Lightweight unmanned aerial vehicle video object detection based on spatial-temporal correlation

被引：8

作者：

Zhou, Pei ^{[1
]}

Liu, GuanJun ^{[1
]}

Wang, Jiacun ^{[2
]}

Weng, QianLi ^{[1
]}

Zhang, KaiWen ^{[1
]}

Zhou, ZiYuan ^{[1
]}

机构：

[1] Tongji Univ, Dept Comp Sci, Shanghai 201800, Peoples R China

[2] Monmouth Univ, West Long Branch, NJ USA

来源：

INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS | 2022年 / 35卷 / 17期

关键词：

computing capacity; spatial-temporal correlation; UAV; video object detection;

D O I：

10.1002/dac.5334

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Intelligent unmanned aerial vehicles (UAVs) are drawing more and more attention from industry to academia. UAV navigation plays an important role in the cooperative scenario where multiple UAVs are deployed, while image data that capture the information of the UAV area are often used as input for UAV navigation. Deep learning is a common and powerful technique for UAV image processing, but a complex model generated by deep learning technique is hardly suitable for the limited computing capacity of edge computing devices such as UAVs. Therefore, this paper designs an efficient deep learning model on UAVs to fit the restriction of low computational powers and low power consumption. Traditional UAV object detection methods mostly use static images as the basis for object recognition, or collect images for offline detection. Our method combines the existing fast single-frame detection methods with the spatial-temporal relationship of video sequences, to build an efficient end-to-end model. In addition, the convolutional LSTM module is used to propagate the temporal context of the video frame sequences. Based on the temporal context, we propose a module for calculating spatial correlation. At the same time, we establish our experimental dataset in our real application and conduct the experiment, which shows that the proposed method reduces the size of models and meanwhile maintains the detection rate. Compared with the existing static images approaches, our method is faster and more accurate. Inference speeds of nearly 20fps can be achieved while performing real-time tasks.

引用

页数：13

共 50 条

[11] Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection
Xu, Chao
Zhang, Jiangning
Wang, Mengmeng
Tian, Guanzhong
Liu, Yong
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7809 - 7820
[12] LightUAV-YOLO: a lightweight object detection model for unmanned aerial vehicle image
Lyu, Yifan
Zhang, Tianze
Li, Xin
Liu, Aixun
Shi, Gang
[J]. Journal of Supercomputing, 2025, 81 (01):
[13] LUD-YOLO: A novel lightweight object detection network for unmanned aerial vehicle
Fan, Qingsong
Li, Yiting
Deveci, Muhammet
Zhong, Kaiyang
Kadry, Seifedine
[J]. INFORMATION SCIENCES, 2025, 686
[14] Model-based approach to spatial-temporal sampling of video clips for video object detection by classification
Chuang, Chi-Han
Cheng, Shyi-Chyi
Chang, Chin-Chun
Chen, Yi-Ping Phoebe
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (05) : 1018 - 1030
[15] Deep Spatial-Temporal Joint Feature Representation for Video Object Detection
Zhao, Baojun
Zhao, Boya
Tang, Linbo
Han, Yuqi
Wang, Wenzheng
[J]. SENSORS, 2018, 18 (03)
[16] Learning Complementary Spatial-Temporal Transformer for Video Salient Object Detection
Liu, Nian
Nan, Kepan
Zhao, Wangbo
Yao, Xiwen
Han, Junwei
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10663 - 10673
[17] End-to-End Video Object Detection with Spatial-Temporal Transformers
He, Lu
Zhou, Qianyu
Li, Xiangtai
Niu, Li
Cheng, Guangliang
Li, Xiao
Liu, Wenxuan
Tong, Yunhai
Ma, Lizhuang
Zhang, Liqing
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1507 - 1516
[18] Application of Deep Learning Based Object Detection on Unmanned Aerial Vehicle
Ipek, Burak
Akpinar, Mustafa
[J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 74 - 78
[19] The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
Du, Dawei
Qi, Yuankai
Yu, Hongyang
Yang, Yifan
Duan, Kaiwen
Li, Guorong
Zhang, Weigang
Huang, Qingming
Tian, Qi
[J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 375 - 391
[20] Object Detection Technique for Small Unmanned Aerial Vehicle
Bin Ramli, M. Faiz
Legowo, Ari
Shamsudin, Syariful Syafiq
[J]. 6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260

← 1 2 3 4 5 →