中国空间科学技术 ›› 2025, Vol. 45 ›› Issue (5): 60-74.doi: 10.16708/j.cnki.1000-758X.2025.0076

• 智能遥感应用专题Ⅱ • 上一篇    下一篇

遥感卫星视频目标跟踪方法综述

李洋帆1,2,李伟1,2,*,田静1,2,沈清1,2   

  1. 1.天基智能信息处理全国重点实验室,北京100081
    2.北京理工大学 信息与电子学院,北京100081
  • 收稿日期:2024-08-02 修回日期:2025-02-17 录用日期:2025-03-01 发布日期:2025-09-17 出版日期:2025-10-01

Object tracking in satellite videos: a survey

LI Yangfan1,2,LI Wei1,2,*, TIAN Jing1,2, SHEN Qing1,2   

  1. 1.National Key Laboratory of Science and Technology on Space-Born Intelligent Information Processing, Beijing 100081, China
    2.School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China
  • Received:2024-08-02 Revision received:2025-02-17 Accepted:2025-03-01 Online:2025-09-17 Published:2025-10-01

摘要: 梳理遥感卫星视频单目标跟踪技术的研究进展,分析现有方法的优缺点,并探讨未来发展方向。通过文献调研与对比分析,系统总结了近五年国内外在该领域的研究成果。将现有方法分为基于相关滤波和基于深度学习两类,分别分析其技术特点与性能。结合公开数据集,对代表性方法的跟踪精度进行了对比评价,并探讨了不同方法的适用性与局限性。实验结果表明,基于相关滤波的方法在计算速度与跟踪精度方面表现优异。在公开的SatSOT数据集上,其跟踪精确率最高可达到69.8%,平均帧率超过30frame/s,展现出较强的实用性与实时性。这类方法通过利用目标的表观特征和运动信息,能够在较低计算成本下实现高效跟踪,尤其适用于资源受限的星载平台。相比之下,基于深度学习的方法在特征表达和复杂场景适应性方面具有显著优势,但由于遥感领域缺乏大规模标注数据,其在相同数据集上的最高跟踪精确率目前为66.9%,低于相关滤波方法。总结了遥感卫星视频单目标跟踪的研究进展,相关滤波方法成熟且实时性强,适用于当前任务;深度学习方法潜力巨大,是未来重要方向。因此未来研究需聚焦高性能深度学习目标跟踪方法及其实时性能优化。

关键词: 卫星视频, 单目标跟踪, 相关滤波, 深度学习

Abstract: This paper aims to review the research progress in remote sensing satellite video single-target tracking technologies, analyze the advantages and disadvantages of existing methods, and explore future development directions. Through literature review and comparative analysis, the research achievements in this field over the past five years were systematically summarized. The existing methods are categorized into two types: correlation filtering-based methods and deep learning-based methods. The technical features and performance of each category were analyzed. Tracking accuracy of representative methods was evaluated based on publicly available datasets, and the applicability and limitations of different methods were discussed. Experimental results show that correlation filtering-based methods perform excellently in terms of computation speed and tracking accuracy. On the publicly available SatSOT dataset, the highest tracking accuracy can reach 69.8%, with an average frame rate exceeding 30frame/s, demonstrating strong practicality and real-time performance. These methods efficiently track targets with low computationalcost by utilizing appearance features and motion information, making them particularly suitable for resource-constrained onboard platforms. In contrast, deep learning-based methods have significant advantages in feature representation and adaptability to complex scenes, but due to the lack of large-scale annotated data in the remote sensing domain, their highest tracking accuracy on the SatSOT dataset is currently 66.9%, slightly lower than correlation filtering methods. This paper summarizes the research progress in remote sensing satellite video single-target tracking. Correlation filtering methods are mature and highly real-time, suitable for current tasks. Deep learning methods show great potential but require further improvements in model optimization.

Key words: satellite videos, single object tracking, correlation filter, deep learning