Boosting End-to-end Multi-Object Tracking and Person Search via Knowledge Distillation

被引:10
|
作者
Zhang, Wei [1 ,3 ]
He, Lingxiao [2 ]
Cheng, Peng [2 ]
Liao, Xingyu [2 ]
Liu, Wu [2 ]
Li, Qi [1 ]
Sun, Zhenan [1 ]
机构
[1] CASIA, CRIPAC & NLPR, Beijing, Peoples R China
[2] JD AI Res, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Multi-object tracking; Person search; End-to-end strategy; Knowledge; distillation; MULTITARGET;
D O I
10.1145/3474085.3481546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Object Tracking (MOT) and Person Search both demand to localize and identify specific targets from raw image frames. Existing methods can be classified into two categories, namely twostep strategy and end-to-end strategy. Two-step approaches have high accuracy but suffer from costly computations, while end-toend methods show greater efficiency with limited performance. In this paper, we dissect the gap between two-step and end-to-end strategy and propose a simple yet effective end-to-end framework with knowledge distillation. Our proposed framework is simple in concept and easy to benefit from external datasets. Experimental results demonstrate that our model performs competitively with other sophisticated two-step and end-to-end methods in multiobject tracking and person search.
引用
收藏
页码:1192 / 1201
页数:10
相关论文
共 50 条
  • [21] End-to-End On-Line Multi-object Tracking on Sparse Point Clouds Using Recurrent Convolutional Networks
    Spata, Dominic
    Grumpe, Arne
    Kummert, Anton
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 407 - 419
  • [22] AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWLEDGE DISTILLATION
    Hao, Yu
    Fu, Yanwei
    Jiang, Yu-Gang
    Tian, Qi
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1 - 6
  • [23] End-to-end Multi-Object Tracking Algorithm Integrating Global Local Feature Interaction and Angular Momentum Mechanism
    Ji, Zhongping
    Wang, Xiangwei
    He, Zhiwei
    Du, Chenjie
    Jin, Ran
    Chai, Bencheng
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (09): : 3703 - 3712
  • [24] End-to-End Speech Translation with Knowledge Distillation
    Liu, Yuchen
    Xiong, Hao
    Zhang, Jiajun
    He, Zhongjun
    Wu, Hua
    Wang, Haifeng
    Zong, Chengqing
    INTERSPEECH 2019, 2019, : 1128 - 1132
  • [25] Sequential Transformer for End-to-End Person Search
    Chen, Long
    Xu, Jinhua
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 226 - 238
  • [26] Cascade Transformers for End-to-End Person Search
    Yu, Rui
    Du, Dawei
    LaLonde, Rodney
    Davila, Daniel
    Funk, Christopher
    Hoogs, Anthony
    Clipp, Brian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7257 - 7266
  • [27] Multi-domain Knowledge Distillation via Uncertainty-Matching for End-to-End ASR Models
    Kim, Ho-Gyeong
    Lee, Min-Joong
    Lee, Hoshik
    Kang, Tae Gyoon
    Lee, Jihyun
    Yang, Eunho
    Hwang, Sung Ju
    INTERSPEECH 2021, 2021, : 2531 - 2535
  • [28] ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association
    Ding, Shuxiao
    Schneider, Lukas
    Cordts, Marius
    Gall, Juergen
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15184 - 15194
  • [29] Knowledge Distillation for End-to-End Monaural Multi-talker ASR System
    Zhang, Wangyou
    Chang, Xuankai
    Qian, Yanmin
    INTERSPEECH 2019, 2019, : 2633 - 2637
  • [30] Multi-Attention-Guided Cascading Network for End-to-End Person Search
    Yang, Jianxi
    Wang, Xiaoyong
    APPLIED SCIENCES-BASEL, 2023, 13 (09):