Toward Real-Time UAV Multi-Target Tracking Using Joint Detection and Tracking

被引:3
|
作者
Keawboontan, Tinnakorn [1 ]
Thammawichai, Mason [2 ]
机构
[1] Navaminda Kasatriyadhiraj Royal Air Force Acad, Grad Sch, Bangkok 10220, Thailand
[2] Navaminda Kasatriyadhiraj Royal Air Force Acad, Elect Engn Dept, Bangkok 10220, Thailand
关键词
Multi-object detection; UAV; deep learning; target tracking; MODEL;
D O I
10.1109/ACCESS.2023.3283411
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multiple object tracking (MOT) of unmanned aerial vehicle (UAV) systems is essential for both defense and civilian applications. As drone technology moves towards real-time, conventional tracking algorithms cannot be directly applied to UAV videos due to limited computational resources and the unstable movements of UAVs in dynamic environments. These challenges lead to blurry video frames, object occlusion, scale changes, and biased data distribution of object classes and samples, resulting in poor tracking accuracy for non-representative classes. Therefore, in this study, we present a deep learning multiple object tracking model for UAV aerial videos to achieve real-time performance. Our approach combines detection and tracking methods using adjacent frame pairs as inputs with shared features to reduce computational time. We also employed a multi-loss function to address the imbalance between the challenging classes and samples. To associate objects between frames, a dual regression bounding box method that considers the center distance of objects rather than just their areas was adopted. This enables the proposed model to perform object ID verification and movement forecasting via single regression. In addition, our model can perform online tracking by predicting the position of an object within the next video frame. By exploiting both low- and high-quality detection techniques to locate the same object across frames, more accurate tracking of objects within the video is attained. The proposed method achieved real-time tracking with a running time of 77 frames per second. The testing results have demonstrated that our approach outperformed the state-of-the-art on the VisDrone2019 test-dev dataset for all ten object categories. In particular, the multiple object tracking accuracy (MOTA) score and the F1 score both increased in comparison to earlier work by 8.7 and 5.3 percent, respectively.
引用
收藏
页码:65238 / 65254
页数:17
相关论文
共 50 条
  • [31] Joint tracking sequence and dwell time allocation for multi-target tracking with phased array radar
    Yuan, Ye
    Yi, Wei
    Kong, Lingjiang
    [J]. SIGNAL PROCESSING, 2022, 192
  • [32] REAL-TIME TARGET TRACKING
    BAUMELA, L
    MARAVALL, D
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 1995, 10 (07) : 4 - 7
  • [33] Real-time infrared multi-class multi-target anchor-free tracking network
    Song, Zizhuang
    Yang, Jiawei
    Zhang, Dongfang
    Wang, Shiqiang
    Zhang, Shuo
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (02): : 401 - 409
  • [34] Real-Time Automatic Target Detection and Tracking using Visual Feedback
    Khan, A.
    Ali, S. S. A.
    Omer, M.
    Raza, K.
    [J]. 2014 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS 2014), 2014,
  • [35] A lightweight multi-target real-time detection model
    Qiu, Bo
    Liu, Xiang
    Shi, Yunyu
    Shang, Yanfeng
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2020, 46 (09): : 1778 - 1785
  • [36] Multi-target visual tracking using occlusion prediction and detection
    Lee, H
    Ko, H
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4050 - 4050
  • [37] Real-time Multi-Target Tracking at 210 Megapixels/second in Wide Area Motion Imagery
    Basharat, Arslan
    Turek, Matt
    Xu, Yiliang
    Atkins, Chuck
    Stoup, David
    Fieldhouse, Keith
    Tunison, Paul
    Hoogs, Anthony
    [J]. 2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 839 - 846
  • [38] Real-time and Online Segmentation Multi-target Tracking with Track Revival Re-identification
    Ahrnbom, Martin
    Nilsson, Mikael
    Ardo, Hakan
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 777 - 784
  • [39] Real-time cooperative multi-target tracking by dense communication among active vision agents
    Ukita, N
    [J]. 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 664 - 671
  • [40] Multi-target Joint Tracking and Classification Using the Trajectory PHD Filter
    Wei, Shaoxiu
    Zhang, Boxiang
    Yi, Wei
    [J]. 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 1094 - 1101