Lightweight unmanned aerial vehicle video object detection based on spatial-temporal correlation

被引:8
|
作者
Zhou, Pei [1 ]
Liu, GuanJun [1 ]
Wang, Jiacun [2 ]
Weng, QianLi [1 ]
Zhang, KaiWen [1 ]
Zhou, ZiYuan [1 ]
机构
[1] Tongji Univ, Dept Comp Sci, Shanghai 201800, Peoples R China
[2] Monmouth Univ, West Long Branch, NJ USA
关键词
computing capacity; spatial-temporal correlation; UAV; video object detection;
D O I
10.1002/dac.5334
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Intelligent unmanned aerial vehicles (UAVs) are drawing more and more attention from industry to academia. UAV navigation plays an important role in the cooperative scenario where multiple UAVs are deployed, while image data that capture the information of the UAV area are often used as input for UAV navigation. Deep learning is a common and powerful technique for UAV image processing, but a complex model generated by deep learning technique is hardly suitable for the limited computing capacity of edge computing devices such as UAVs. Therefore, this paper designs an efficient deep learning model on UAVs to fit the restriction of low computational powers and low power consumption. Traditional UAV object detection methods mostly use static images as the basis for object recognition, or collect images for offline detection. Our method combines the existing fast single-frame detection methods with the spatial-temporal relationship of video sequences, to build an efficient end-to-end model. In addition, the convolutional LSTM module is used to propagate the temporal context of the video frame sequences. Based on the temporal context, we propose a module for calculating spatial correlation. At the same time, we establish our experimental dataset in our real application and conduct the experiment, which shows that the proposed method reduces the size of models and meanwhile maintains the detection rate. Compared with the existing static images approaches, our method is faster and more accurate. Inference speeds of nearly 20fps can be achieved while performing real-time tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection
    Xu, Chao
    Zhang, Jiangning
    Wang, Mengmeng
    Tian, Guanzhong
    Liu, Yong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7809 - 7820
  • [12] LightUAV-YOLO: a lightweight object detection model for unmanned aerial vehicle image
    Lyu, Yifan
    Zhang, Tianze
    Li, Xin
    Liu, Aixun
    Shi, Gang
    [J]. Journal of Supercomputing, 2025, 81 (01):
  • [13] LUD-YOLO: A novel lightweight object detection network for unmanned aerial vehicle
    Fan, Qingsong
    Li, Yiting
    Deveci, Muhammet
    Zhong, Kaiyang
    Kadry, Seifedine
    [J]. INFORMATION SCIENCES, 2025, 686
  • [14] Model-based approach to spatial-temporal sampling of video clips for video object detection by classification
    Chuang, Chi-Han
    Cheng, Shyi-Chyi
    Chang, Chin-Chun
    Chen, Yi-Ping Phoebe
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (05) : 1018 - 1030
  • [15] Deep Spatial-Temporal Joint Feature Representation for Video Object Detection
    Zhao, Baojun
    Zhao, Boya
    Tang, Linbo
    Han, Yuqi
    Wang, Wenzheng
    [J]. SENSORS, 2018, 18 (03)
  • [16] Learning Complementary Spatial-Temporal Transformer for Video Salient Object Detection
    Liu, Nian
    Nan, Kepan
    Zhao, Wangbo
    Yao, Xiwen
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10663 - 10673
  • [17] End-to-End Video Object Detection with Spatial-Temporal Transformers
    He, Lu
    Zhou, Qianyu
    Li, Xiangtai
    Niu, Li
    Cheng, Guangliang
    Li, Xiao
    Liu, Wenxuan
    Tong, Yunhai
    Ma, Lizhuang
    Zhang, Liqing
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1507 - 1516
  • [18] Application of Deep Learning Based Object Detection on Unmanned Aerial Vehicle
    Ipek, Burak
    Akpinar, Mustafa
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 74 - 78
  • [19] The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
    Du, Dawei
    Qi, Yuankai
    Yu, Hongyang
    Yang, Yifan
    Duan, Kaiwen
    Li, Guorong
    Zhang, Weigang
    Huang, Qingming
    Tian, Qi
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 375 - 391
  • [20] Object Detection Technique for Small Unmanned Aerial Vehicle
    Bin Ramli, M. Faiz
    Legowo, Ari
    Shamsudin, Syariful Syafiq
    [J]. 6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260