Joint Detection and Association for End-to-End Multi-object Tracking

被引:0
|
作者
Ye Li
Xiaoyu Luo
Junyu Shi
Xinzhong Wang
Guangqiang Yin
Zhiguo Wang
机构
[1] Shenzhen Institute of Information Technology,
[2] University of Electronic Science and Technology of China,undefined
[3] Kashi Institute of Electronics and Information Industry,undefined
[4] Shenzhen University,undefined
来源
Neural Processing Letters | 2023年 / 55卷
关键词
Multi-object tracking; Joint detection and association; End-to-end;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-object tracking (MOT) is mainly used for detecting and tracking the object on multi-cameras, which is widely applied in intelligent video surveillance and intelligent security. The process of MOT generally involves three import parts: feature extracting, multi-task learning and object matching. Unfortunately, the existed methods still have some drawbacks. Firstly, the feature extracting module cannot effectively fuse the shallow and deep features. What’s more, the multi-task learning module cannot strike a good balance between the detection and re-identification. In addition, the object matching module associates with pedestrian by using a traditional method rather than training a model. For these problems, we propose a method of joint detection and association (JDA) for end-to-end multi-object tracking network, which involves the multi-scale feature extraction and the learnable object association. It first combines a feature extraction backbone based on multi-scale feature fusion and a point-based multi-task object detection branch, to solve the task of feature extraction and object detection. Then, a learnable object motion association module is embedded, which uses the historical frames information to infer the position of the object, and associate the object identity between previous frames and subsequent frames. In addition, the JDA can be end-to-end trained when handling the detection and matching tasks. The proposed JDA is evaluated through a series of experiments on MOT16 and MOT17. The results shows that JDA the existing methods in terms of precision and stability of MOT.
引用
收藏
页码:11823 / 11844
页数:21
相关论文
共 50 条
  • [41] End-to-end Visual Object Tracking with Motion Saliency Guidance
    Zhang, Yucheng
    Liu, Kexin
    Wang, Tian
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6566 - 6571
  • [42] A METHOD FOR JOINT DETECTION AND RE-IDENTIFICATION IN MULTI-OBJECT TRACKING
    Huang, L.
    Shi, X.
    Xiang, J.
    [J]. NEURAL NETWORK WORLD, 2022, 32 (06) : 285 - 300
  • [43] JOINT DETECTION, RE-IDENTIFICATION, AND LSTM IN MULTI-OBJECT TRACKING
    Tsai, Wen-Jiin
    Huang, Zih-Jie
    Chung, Chen-En
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [44] End-to-End Object Detection with Fully Convolutional Network
    Wang, Jianfeng
    Song, Lin
    Li, Zeming
    Sun, Hongbin
    Sun, Jian
    Zheng, Nanning
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15844 - 15853
  • [45] SRDD: a lightweight end-to-end object detection with transformer
    Zhu, Yuan
    Xia, Qingyuan
    Jin, Wen
    [J]. CONNECTION SCIENCE, 2022, 34 (01) : 2448 - 2465
  • [46] Progressive End-to-End Object Detection in Crowded Scenes
    Zheng, Anlin
    Zhang, Yuang
    Zhang, Xiangyu
    Qi, Xiaojuan
    Sun, Jian
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 847 - 856
  • [47] Dense Distinct Query for End-to-End Object Detection
    Zhang, Shilong
    Wang, Xinjiang
    Wang, Jiaqi
    Pang, Jiangmiao
    Lyu, Chengqi
    Zhang, Wenwei
    Luo, Ping
    Chen, Kai
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7329 - 7338
  • [48] End-to-End Edge Neuromorphic Object Detection System
    Silva, D. A.
    Shymyrbay, A.
    Smagulova, K.
    Elsheikh, A.
    Fouda, M. E.
    Eltawil, A. M.
    [J]. 2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 194 - 198
  • [49] GSMR-CNN: An End-to-End Trainable Architecture for Grasping Target Objects from Multi-Object Scenes
    Holomjova, Valerija
    Starkey, Andrew J.
    Meissner, Pascal
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3808 - 3814
  • [50] Joint Cost Minimization for Multi-Object Tracking
    Boragule, Abhijeet
    Jeon, Moongu
    [J]. 2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,