End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models

被引:49
|
作者
Xiang, Jun [1 ,2 ]
Xu, Guohan [1 ]
Ma, Chao [3 ]
Hou, Jianhua [1 ]
机构
[1] South Cent Univ Nationalities, Hubei Key Lab Intelligent Wireless Commun, Wuhan 430074, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Machine learning; Recurrent neural networks; Optimization; Task analysis; Standards; Inference algorithms; Multi-object tracking; end-to-end deep learning; conditional random field; data association; MULTITARGET TRACKING; APPROXIMATION;
D O I
10.1109/TCSVT.2020.2975842
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
By bundling multiple complex sub-problems into a unified framework, end-to-end deep learning frameworks reduce the need for hand engineering or tuning of parameters for each component, and optimize different modules jointly to ensure the generalization of the whole deep architecture. Despite tremendous success in numerous computer vision tasks, end-to-end learnings for multi-object tracking (MOT), especially for the assignment problem in data association, have been surprisingly less investigated mainly due to the lack of available training data. Furthermore, it is challenging to discriminate target objects under mutual occlusions or to reduce identity switches in crowded scenes. To tackle these challenges, this paper proposes learning deep conditional random field (CRF) networks, aiming to model the assignment costs as unary potentials and the long-term dependencies among detection results as pairwise potentials. Specifically, we use a bidirectional long short-term memory (LSTM) network to encode the long-term dependencies. We pose the CRF inference as a recurrent neural network learning process using the standard gradient descent algorithm, where unary and pairwise potentials are jointly optimized in an end-to-end manner. Extensive experiments are conducted on the challenging MOT datasets including MOT15, MOT16 and MOT17, and the results show that the proposed algorithm performs favorably against the state-of-the-art methods.
引用
收藏
页码:275 / 288
页数:14
相关论文
共 50 条
  • [21] Online Multi-object Tracking Based on Deep Learning
    Sun, Zheming
    Bo, Chunjuan
    Wang, Dong
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 33 - 40
  • [22] Deep learning in video multi-object tracking: A survey
    Ciaparrone, Gioele
    Luque Sanchez, Francisco
    Tabik, Siham
    Troiano, Luigi
    Tagliaferri, Roberto
    Herrera, Francisco
    NEUROCOMPUTING, 2020, 381 : 61 - 88
  • [23] Progress Estimation for End-to-End Training of Deep Learning Models With Online Data Preprocessing
    Dong, Qifei
    Luo, Gang
    IEEE ACCESS, 2024, 12 : 18658 - 18684
  • [24] Fusion of End-to-End Deep Learning Models for Sequence-to-Sequence Sleep Staging
    Huy Phan
    Chen, Oliver Y.
    Koch, Philipp
    Mertins, Alfred
    De Vos, Maarten
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 1829 - 1833
  • [25] AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
    Thanh-Toan Do
    Anh Nguyen
    Reid, Ian
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 5882 - 5889
  • [26] End to End Multi-object Tracking Algorithm Applied to Vehicle Tracking
    Qin, Wenyuan
    Du, Hong
    Zhang, Xiaozheng
    Ren, Xuebing
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 367 - 372
  • [27] Parameter Estimation and Contextual Adaptation for a Multi-Object Tracking CRF Model
    Heili, Alexandre
    Odobez, Jean-Marc
    2013 IEEE INTERNATIONAL WORKSHOP ON PERFORMANCE EVALUATION OF TRACKING AND SURVEILLANCE (PETS), 2013, : 14 - 21
  • [28] ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning
    Zhang, Hantian
    Li, Jerry
    Kara, Kaan
    Alistarh, Dan
    Liu, Ji
    Zhang, Ce
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [29] Recent progress in deep end-to-end models for spoken language processing
    Audhkhasi, K.
    Rosenberg, A.
    Saon, G.
    Sethy, A.
    Ramabhadran, B.
    Chen, S.
    Picheny, M.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
  • [30] Deep learning in multi-object detection and tracking: state of the art
    Pal, Sankar K.
    Pramanik, Anima
    Maiti, J.
    Mitra, Pabitra
    APPLIED INTELLIGENCE, 2021, 51 (09) : 6400 - 6429