Lightweight unmanned aerial vehicle video object detection based on spatial-temporal correlation

被引:8
|
作者
Zhou, Pei [1 ]
Liu, GuanJun [1 ]
Wang, Jiacun [2 ]
Weng, QianLi [1 ]
Zhang, KaiWen [1 ]
Zhou, ZiYuan [1 ]
机构
[1] Tongji Univ, Dept Comp Sci, Shanghai 201800, Peoples R China
[2] Monmouth Univ, West Long Branch, NJ USA
关键词
computing capacity; spatial-temporal correlation; UAV; video object detection;
D O I
10.1002/dac.5334
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Intelligent unmanned aerial vehicles (UAVs) are drawing more and more attention from industry to academia. UAV navigation plays an important role in the cooperative scenario where multiple UAVs are deployed, while image data that capture the information of the UAV area are often used as input for UAV navigation. Deep learning is a common and powerful technique for UAV image processing, but a complex model generated by deep learning technique is hardly suitable for the limited computing capacity of edge computing devices such as UAVs. Therefore, this paper designs an efficient deep learning model on UAVs to fit the restriction of low computational powers and low power consumption. Traditional UAV object detection methods mostly use static images as the basis for object recognition, or collect images for offline detection. Our method combines the existing fast single-frame detection methods with the spatial-temporal relationship of video sequences, to build an efficient end-to-end model. In addition, the convolutional LSTM module is used to propagate the temporal context of the video frame sequences. Based on the temporal context, we propose a module for calculating spatial correlation. At the same time, we establish our experimental dataset in our real application and conduct the experiment, which shows that the proposed method reduces the size of models and meanwhile maintains the detection rate. Compared with the existing static images approaches, our method is faster and more accurate. Inference speeds of nearly 20fps can be achieved while performing real-time tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Haze removal for unmanned aerial vehicle aerial video based on spatial-temporal coherence optimisation
    Zhao, Xintao
    Ding, Wenrui
    Liu, Chunhui
    Li, Hongguang
    [J]. IET IMAGE PROCESSING, 2018, 12 (01) : 88 - 97
  • [2] Lightweight Spatial-Temporal Contextual Aggregation Siamese Network for Unmanned Aerial Vehicle Tracking
    Chen, Qiqi
    Liu, Jinghong
    Liu, Faxue
    Xu, Fang
    Liu, Chenglong
    Gonzalez-Aguilera, Diego
    [J]. DRONES, 2024, 8 (01)
  • [3] Lightweight vehicle object detection network for unmanned aerial vehicles aerial images
    Liu, Lu-Chen
    Jia, Xiang-Yu
    Han, Dong-Nuo
    Li, Zhen-Dong
    Sun, Hong-Mei
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [4] Object Tracking Based on Unmanned Aerial Vehicle Video
    Tan, Xiong
    Yu, Xuchu
    Liu, Jingzheng
    Huang, Weijie
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL I, 2010, : 244 - 247
  • [5] Object Tracking Based on Unmanned Aerial Vehicle Video
    Tan, Xiong
    Yu, Xuchu
    Liu, Jingzheng
    Huang, Weijie
    [J]. APPLIED INFORMATICS AND COMMUNICATION, PT I, 2011, 224 : 432 - 438
  • [6] Robust Spatial-Temporal Autoencoder for Unsupervised Anomaly Detection of Unmanned Aerial Vehicle With Flight Data
    Jiang, Guoqian
    Nan, Pengcheng
    Zhang, Jingchao
    Li, Yingwei
    Li, Xiaoli
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [7] Object Detection-Based Video Retargeting With Spatial-Temporal Consistency
    Lee, Seung Joon
    Lee, Siyeong
    Cho, Sung In
    Kang, Suk-Ju
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4434 - 4439
  • [8] Video Object Detection with an Aligned Spatial-Temporal Memory
    Xiao, Fanyi
    Lee, Yong Jae
    [J]. COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 494 - 510
  • [9] Object Detection and Tracking of Unmanned Surface Vehicles Based on Spatial-temporal Information Fusion
    Zhou Zhiguo
    Jing Zhao
    Wang Qiuling
    Qu Chong
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1698 - 1705
  • [10] SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION
    Chen, Zhu
    Li, Weihai
    Fei, Chi
    Liu, Bin
    Yu, Nenghai
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1858 - 1862