CSMOT: Make One-Shot Multi-Object Tracking in Crowded Scenes Great Again

被引:2
|
作者
Hou, Haoxiong [1 ,2 ]
Shen, Chao [1 ,2 ]
Zhang, Ximing [1 ]
Gao, Wei [1 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian 710119, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
关键词
one-shot; multi-object tracking; re-ID; coordinate attention; angle-center loss; data association;
D O I
10.3390/s23073782
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The current popular one-shot multi-object tracking (MOT) algorithms are dominated by the joint detection and embedding paradigm, which have high inference speeds and accuracy, but their tracking performance is unstable in crowded scenes. Not only does the detection branch have difficulty in obtaining the accurate object position, but the ambiguous appearance of features extracted by the re-identification (re-ID) branch also leads to identity switches. Focusing on the above problems, this paper proposes a more robust MOT algorithm, named CSMOT, based on FairMOT. First, on the basis of the encoder-decoder network, a coordinate attention module is designed to enhance the information interaction between channels (horizontal and vertical coordinates), which improves its object-detection abilities. Then, an angle-center loss that effectively maximizes intra-class similarity is proposed to optimize the re-ID branch, and the extracted re-ID features are made more discriminative. We further redesign the re-ID feature dimension to balance the detection and re-ID tasks. Finally, a simple and effective data association mechanism is introduced, which associates each detection instead of just the high-score detections during the tracking process. The experimental results show that our one-shot MOT algorithm achieves excellent tracking performance on multiple public datasets and can be effectively applied to crowded scenes. In particular, CSMOT decreases the number of ID switches by 11.8% and 33.8% on the MOT16 and MOT17 test datasets, respectively, compared to the baseline.
引用
收藏
页数:17
相关论文
共 42 条
  • [21] Similarity-based Calibration Method for Zero-Shot Recognition in Multi-Object Scenes
    Chang, Doo Soo
    Cho, Gun Hee
    Choi, Yong Suk
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 1096 - 1103
  • [22] Unreal mask: one-shot multi-object class-based pose estimation for robotic manipulation using keypoints with a synthetic dataset
    Zabihifar, S. H.
    Semochkin, A. N.
    Seliverstova, E. V.
    Efimov, A. R.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (19): : 12283 - 12300
  • [23] Unreal mask: one-shot multi-object class-based pose estimation for robotic manipulation using keypoints with a synthetic dataset
    S. H. Zabihifar
    A. N. Semochkin
    E. V. Seliverstova
    A. R. Efimov
    Neural Computing and Applications, 2021, 33 : 12283 - 12300
  • [24] Feature-Based Multi-Object Tracking With Maximally One Object per Class
    Krejci, Jan
    Straka, Ondrej
    Vyskocil, Jiri
    Jirik, Miroslav
    Dahmen, Uta
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [25] LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
    Yan, Bin
    Peng, Houwen
    Wu, Kan
    Wang, Dong
    Fu, Jianlong
    Lu, Huchuan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15175 - 15184
  • [26] An Efficient One-Shot Network and Robust Data Associations in Multi-pedestrian Tracking
    He, Fuxiao
    Xiao, Guoqiang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2023, 2023, 14118 : 104 - 120
  • [27] Real-Time Human Group Detection and Clustering in Crowded Environments Using Enhanced Multi-Object Tracking
    Lee, Hyunmin
    Kang, Donggoo
    Park, Hasil
    Park, Sangwoo
    Jeong, Dasol
    Paik, Joonki
    IEEE Access, 2024, 12 : 184028 - 184039
  • [28] TMTB: Transformer based multi-task branching multi-object tracking algorithm for wide-view scenes
    Hongjun Li
    Jiaxin Li
    Multimedia Tools and Applications, 2024, 83 : 41015 - 41032
  • [29] TMTB: Transformer based multi-task branching multi-object tracking algorithm for wide-view scenes
    Li, Hongjun
    Li, Jiaxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41015 - 41032
  • [30] Vehicular/Non-Vehicular Multi-Class Multi-Object Tracking in Drone-Based Aerial Scenes
    Bisio, Igor
    Garibotto, Chiara
    Haleem, Halar
    Lavagetto, Fabio
    Sciarrone, Andrea
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (04) : 4961 - 4977