Satellite attitude control method based on deep reinforcement learning

doi:10.16708/j.cnki.1000-758X.2019.0027

Chinese Space Science and Technology ›› 2019, Vol. 39 ›› Issue (4): 36-.doi: 10.16708/j.cnki.1000-758X.2019.0027

Previous Articles Next Articles

Satellite attitude control method based on deep reinforcement learning

WANG Yue-Jiao, MA Zhong, YANG Yi-Dai, WANG Zhu-Ping, TANG Lei

Xi′an Microelectronics Technology Institute，Xi′an 710065，China

Received:2018-11-01 Revision received:2019-01-08 Online:2019-04-22 Published:2019-08-25

Abstract

Abstract: Aiming at the problem of sudden changes in the attitudes encountered by satellites while performing complex tasks such as discarding a payload or capturing a target, a satellite attitude control method based on the deep reinforcement learning is proposed to restore the satellite to a stable state. Concretely, the attitude dynamics environment of the vehicle is firstly established, and the output of continuous control torque is discretized. Deep Q Network algorithm is then performed to train the autonomous attitude control of the satellite for further processing, and the optimal intelligent output of discrete behavior is rewarded with the stabilization of attitude angular velocity. Finally, the validity of the mechanism is verified by the simulation test. Results analysis illustrates that the deep reinforcement learning algorithm for satellite attitude control can stabilize satellite attitude after the satellite is disturbed by sudden random disturbance, and it can effectively solve the problem of traditional PD controller depending on the mass parameters of the controlled object. The proposed method adopts selflearning to control the satellite attitude, which has strong intelligence and universal applicability, and has a strong application potential for future intelligent control of satellites performing complex space tasks.

Key words: deep reinforcement learning, satellite attitude control, dynamic environment, autonomous attitude control, mass parameters

WANG Yue-Jiao, MA Zhong, YANG Yi-Dai, WANG Zhu-Ping, TANG Lei. Satellite attitude control method based on deep reinforcement learning[J]. Chinese Space Science and Technology, 2019, 39(4): 36-.

Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks

URL: https://journal26.magtechjournal.com/kjkxjs/EN/10.16708/j.cnki.1000-758X.2019.0027

https://journal26.magtechjournal.com/kjkxjs/EN/Y2019/V39/I4/36

[1]	XI Chao, YANG Bo, WANG Jirong, LI Gong, ZHU Ruijie, YANG Xiao. DRL-based link planning algorithm for mega constellation satellite and TT&C [J]. Chinese Space Science and Technology, 2023, 43(5): 65-70.
[2]	MIAO Jun, TU Xinying, YIN Jianfeng, PENG Jing, LI Haijin, CHEN Ziyun. Hypercube satellite formation control based on ADDPG strategy [J]. Chinese Space Science and Technology, 2023, 43(4): 24-34.
[3]	LIN Jia-Wei, WANG Ping. In orbit Estimation of Satellite inertial matrix using STLS [J]. Chinese Space Science and Technology, 2010, 30(6): 31-38.
[4]	Chen Xueqin Geng Yunhai Wang Feng Zhang Yingchun(Research Center of Satellite Technology,Harbin Institute of Technology,Harbin 150080). Satellite Attitude Fault-tolerant Control Based on Robust Adaptive Inverse Optimal Control [J]. Chinese Space Science and Technology, 2008, 28(02): 35-41.
[5]	Li Yuan1 Wu Hongyue2 Wu Jie3(1 China Academy of Space Technology,Beijing 100094)(2 China Ship Research and Development Academy,Beijing 100085)(3 Beijing Changcheng Institute of Metrology and Measurement,Beijing 100095). Genetic Algorithm PID Self-tuning based on the Satellite Attitude Control [J]. Chinese Space Science and Technology, 2007, 27(04): 66-71.
[6]	Liu Rui Wang Ping Lü Zhenduo(Beijing Institute of Control Engineering,Beijing 100080). Effects of Spacecraft Movable Accessory on the Attitude of Meteorological Satellite [J]. Chinese Space Science and Technology, 2005, 25(06): 1-7.
[7]	Zhang Honghua Li Kang Zhao Yu (Beijing Institute of Control Engineering, Beijing 100080). Fast Maneuver Control of Flexible Satellite System [J]. Chinese Space Science and Technology, 2005, 25(01): -.
[8]	Lü Zhenduo Beijing Institute of Control Engineering. TWO DIFFERENT ATTITUDE CONTROL METHODS FOR GEOSTATIONARY COMMUNICATION BROADCASTING SATELLITE [J]. Chinese Space Science and Technology, 1990, 10(01): 28-35.

Satellite attitude control method based on deep reinforcement learning

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 8

Recommended Articles

Metrics

Comments