基于A2C算法的低轨星座动态波束资源调度研究

doi:10.16708/j.cnki.1000-758X.2023.0045

中国空间科学技术 ›› 2023, Vol. 43 ›› Issue (3): 123-133.doi: 10.16708/j.cnki.1000-758X.2023.0045

• 巨型星座/低轨大规模星座专栏 • 上一篇下一篇

基于A2C算法的低轨星座动态波束资源调度研究

刘伟，郑润泽，张磊，高梓贺，陶滢，崔楷欣

1 国家航天局卫星通信系统创新中心，北京100094
2 中国空间技术研究院通信与导航卫星总体部，北京100094
3 西北工业大学，西安710072
4 北京理工大学，北京100081

发布日期:2023-05-23 出版日期:2023-06-25

Research of dynamic beam resource scheduling of LEO constellation based on A2C algorithm

LIU Wei，ZHENG Runze，ZHANG Lei，GAO Zihe，TAO Ying，CUI Kaixin

1 Innovation Center of Satellite Communication System，CNSA，Beijing 100094，China
2 Institute of Telecommunication and Navigation Satellites，China Academy of Space Technology，Beijing 100094，China
3 Northwestern Polytechnical University，Xi′an 710072，China
4 Beijing Institute of Technology，Beijing 100081，China

Online:2023-05-23 Published:2023-06-25

摘要/Abstract

摘要： 巨型低轨星座为载人飞船、空间站、遥感卫星等用户航天器提供低时延、大容量的通信通道存在波束资源分配优化的难题。针对采用离散时间的深度强化学习A2C（advanced actor- critic）的智能优化框架进行了研究，结合遗传算法中个体和基因概念、形成了可有效满足多用户、动态、并发接入需求的波束资源调度算法。基于仿真分析，提出的算法可在多种典型场景下具有适用性，支持在20s内完成超过3000个任务的有效规划，任务成功率不低于91%。通过算法优化实现复杂度的降低，相对传统遗传算法可节约时间45%以上。同时对传统A2C算法框架中的收敛问题进行了优化，解决了传统全连接A2C算法无法收敛的难题，同时相比DQN（deep q-network）算法框架收敛速度提升38%以上。

关键词: 低轨星座, 波束调度, 任务规划, 深度强化学习, A2C算法

Abstract:

The giant low-orbit constellation provides low-latency，large-capacity communication channels for user spacecraft such as manned spacecraft，space stations and remote sensing satellites，and there is a resource allocation optimizing problem of satellite beams.The intelligent optimization framework of A2C（advanced actor-critic）using discrete-time deep reinforcement learning was studied，and the beam resource scheduling algorithm that could effectively meet the needs of multi-users，dynamic and concurrent access was formed by combining the concepts of individuals and genes in genetic algorithms.Based on simulation and analysis，the proposed algorithm could be applicable in a variety of typical scenarios.The method could provide effective scheduling results for more than 3000 tasks in 20s，and the task success rate was not less than 91%.The complexity was reduced by algorithm optimization，which could save more than 45% of the time compared with traditional genetic algorithms.At the same time，the convergence problem in the traditional A2C algorithm framework was optimized，which solved the non-convergence problem in the traditional fully connected A2C algorithm.Meanwhile，the convergence speed was increased by more than 38% compared with the DQN（deep q-network）algorithm.

Key words:

LEO constellation, beam scheduling, task planning, DRL, A2C algorithm

刘伟, 郑润泽, 张磊, 高梓贺, 陶滢, 崔楷欣. 基于A2C算法的低轨星座动态波束资源调度研究[J]. 中国空间科学技术, 2023, 43(3): 123-133.

LIU Wei, ZHENG Runze, ZHANG Lei, GAO Zihe, TAO Ying, CUI Kaixin. Research of dynamic beam resource scheduling of LEO constellation based on A2C algorithm[J]. Chinese Space Science and Technology, 2023, 43(3): 123-133.

导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks

链接本文: https://journal26.magtechjournal.com/kjkxjs/CN/10.16708/j.cnki.1000-758X.2023.0045

https://journal26.magtechjournal.com/kjkxjs/CN/Y2023/V43/I3/123

[1]	秦嘉豪, 李宝卫, 白雪, 冉德超, 徐明, 张锐, 胡志强. 面向异质卫星集群的事件触发分布式自主任务规划方法[J]. 中国空间科学技术, 2025, 45(4): 88-101.
[2]	凌龙, 朱燕麒, 鲁之君, 王洁, 吴同舟, 冯倩. 面向多目标探测的高轨遥感卫星观测任务规划方法[J]. 中国空间科学技术, 2025, 45(4): 102-113.
[3]	郑鑫宇, 曹栋栋, 唐佩佳, 张轶, 彭升人, 周杰, 党朝辉. 高轨航天器集群在轨服务智能任务规划方法[J]. 中国空间科学技术, 2025, 45(1): 34-45.
[4]	尹霞, 韩笑冬, 李朝玉, 徐瑞. 资源强耦合下改进遗传测控调度方法[J]. 中国空间科学技术, 2025, 45(1): 59-68.
[5]	陈旺, 邵庆龙, 周晓, 刘金普, 兰友国, 余伟, 胡玉新. 海洋一号卫星观测任务规划算法设计及系统应用[J]. 中国空间科学技术, 2024, 44(2): 145-153.
[6]	席超, 杨博, 王记荣, 李公, 朱睿杰, 杨肖. 基于DRL的巨型星座星地测控链路规划算法[J]. 中国空间科学技术, 2023, 43(5): 65-70.
[7]	苗峻, 涂歆滢, 殷建丰, 彭靖, 李海津, 陈子匀. 基于ADDPG策略的超立方体卫星编队控制[J]. 中国空间科学技术, 2023, 43(4): 24-34.
[8]	张耀元, 杨洪伟, 袁荣钢, 梁奕瑾, 李爽. 面向卫星多目标重复观测任务的分层聚类规划[J]. 中国空间科学技术, 2023, 43(1): 29-43.
[9]	薛文, 胡敏, 阮永井, 云朝明, 孙天宇. 基于TLE的Starlink星座第一阶段部署情况分析[J]. 中国空间科学技术, 2022, 42(5): 24-33.
[10]	彭晨远, 张进, 严冰, 周洪喜, 罗亚中. 多约束多星快响巡察任务规划方法[J]. 中国空间科学技术, 2022, 42(3): 39-48.
[11]	杨武霖，陈川，余谦，李明，龚自正. 天基激光移除空间碎片仿真平台研究与开发[J]. 中国空间科学技术, 2019, 39(1): 59-.
[12]	贺川, 李亚晶, 丘震. 按需申请模式下的中继卫星任务规划模型与算法设计[J]. 中国空间科学技术, 2017, 37(6): 46-.
[13]	王晓晖, 李爽. 考虑动态不确定因素的深空探测器任务规划[J]. 中国空间科学技术, 2016, 36(6): 29-37.
[14]	都柄晓, 赵勇, 陈利虎, 姚雯. 基于三分图的非共面卫星分布式加注任务规划[J]. 中国空间科学技术, 2015, 35(1): 58-65.
[15]	林晓辉, 潘小彤, 张锦绣. 敏捷光学卫星密集区域推扫成像任务规划方法[J]. 中国空间科学技术, 2014, 34(2): 62-68.

基于A2C算法的低轨星座动态波束资源调度研究

Research of dynamic beam resource scheduling of LEO constellation based on A2C algorithm

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价