MODELING HUMAN MEMORY IN MULTI-OBJECT TRACKING WITH TRANSFORMERS

被引:2
|
作者
Li, Yizhuo [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Multi-object tracking; Transformer; Human memory modeling; Deep learning;
D O I
10.1109/ICASSP43922.2022.9747572
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When tracking objects, humans rely on a memory mechanism, memorize the track of an object then look for it in the current scene. In this paper, we propose Memory-based Multi-object Tracking with Transformers (MMTT) to mimic human behavior in multi-object tracking. Unlike Re-ID-based methods, MMTT solves multi-object tracking in an explicit way, with a Track Encoder to extract track memory, a Detection Encoder to extract detection interactions, and a Memory Decoder to simulate the "look" process. The design of MMTT has the ability to model both spatial and temporal information of a single track. We evaluate on commonly used MOT datasets and the experimental results demonstrate its superior effectiveness. We hope this paper can provide a novel direction for the MOT task. The code and models will be made publicly available upon acceptance.
引用
收藏
页码:2849 / 2853
页数:5
相关论文
共 50 条
  • [1] TrackFormer: Multi-Object Tracking with Transformers
    Meinhardt, Tim
    Kirillov, Alexander
    Leal-Taixe, Laura
    Feichtenhofer, Christoph
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8834 - 8844
  • [2] Transformers for Multi-Object Tracking on Point Clouds
    Ruppel, Felicia
    Faion, Florian
    Glaeser, Claudius
    Dietmayer, Klaus
    [J]. 2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 852 - 859
  • [3] MeMOT: Multi-Object Tracking with Memory
    Cai, Jiarui
    Xu, Mingze
    Li, Wei
    Xiong, Yuanjun
    Xia, Wei
    Tu, Zhuowen
    Soatto, Stefano
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8080 - 8090
  • [4] Multi-object tracking of human spermatozoa
    Sorensen, Lauge
    Ostergaard, Jakob
    Johansen, Peter
    de Bruijne, Marleen
    [J]. MEDICAL IMAGING 2008: IMAGE PROCESSING, PTS 1-3, 2008, 6914
  • [5] TLtrack: Combining Transformers and a Linear Model for Robust Multi-Object Tracking
    He, Zuojie
    Zhao, Kai
    Zeng, Dan
    [J]. AI, 2024, 5 (03) : 938 - 947
  • [6] Multi-object tracking via discriminative appearance modeling
    Huang, Shucheng
    Jiang, Shuai
    Zhu, Xia
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 153 : 77 - 87
  • [7] Multi-object trajectory tracking
    Han, Mei
    Xu, Wei
    Tao, Hai
    Gong, Yihong
    [J]. MACHINE VISION AND APPLICATIONS, 2007, 18 (3-4) : 221 - 232
  • [8] Referring Multi-Object Tracking
    Wu, Dongming
    Han, Wencheng
    Wang, Tiancai
    Dong, Xingping
    Zhang, Xiangyu
    Shen, Jianbing
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14633 - 14642
  • [9] Multi-object tracking in video
    Agbinya, JI
    Rees, D
    [J]. REAL-TIME IMAGING, 1999, 5 (05) : 295 - 304
  • [10] Multi-object trajectory tracking
    Mei Han
    Wei Xu
    Hai Tao
    Yihong Gong
    [J]. Machine Vision and Applications, 2007, 18 : 221 - 232