End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models

被引:49
|
作者
Xiang, Jun [1 ,2 ]
Xu, Guohan [1 ]
Ma, Chao [3 ]
Hou, Jianhua [1 ]
机构
[1] South Cent Univ Nationalities, Hubei Key Lab Intelligent Wireless Commun, Wuhan 430074, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Machine learning; Recurrent neural networks; Optimization; Task analysis; Standards; Inference algorithms; Multi-object tracking; end-to-end deep learning; conditional random field; data association; MULTITARGET TRACKING; APPROXIMATION;
D O I
10.1109/TCSVT.2020.2975842
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
By bundling multiple complex sub-problems into a unified framework, end-to-end deep learning frameworks reduce the need for hand engineering or tuning of parameters for each component, and optimize different modules jointly to ensure the generalization of the whole deep architecture. Despite tremendous success in numerous computer vision tasks, end-to-end learnings for multi-object tracking (MOT), especially for the assignment problem in data association, have been surprisingly less investigated mainly due to the lack of available training data. Furthermore, it is challenging to discriminate target objects under mutual occlusions or to reduce identity switches in crowded scenes. To tackle these challenges, this paper proposes learning deep conditional random field (CRF) networks, aiming to model the assignment costs as unary potentials and the long-term dependencies among detection results as pairwise potentials. Specifically, we use a bidirectional long short-term memory (LSTM) network to encode the long-term dependencies. We pose the CRF inference as a recurrent neural network learning process using the standard gradient descent algorithm, where unary and pairwise potentials are jointly optimized in an end-to-end manner. Extensive experiments are conducted on the challenging MOT datasets including MOT15, MOT16 and MOT17, and the results show that the proposed algorithm performs favorably against the state-of-the-art methods.
引用
收藏
页码:275 / 288
页数:14
相关论文
共 50 条
  • [1] End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models (vol 31, pg 275, 2021)
    Xiang, Jun
    Xu, Guohan
    Ma, Chao
    Hou, Jianhua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 828 - 828
  • [2] End-to-End Training of Hybrid CNN-CRF Models for Stereo
    Knoebelreiter, Patrick
    Reinbacher, Christian
    Shekhovtsov, Alexander
    Pock, Thomas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1456 - 1465
  • [3] Joint Detection and Association for End-to-End Multi-object Tracking
    Li, Ye
    Luo, Xiaoyu
    Shi, Junyu
    Wang, Xinzhong
    Yin, Guangqiang
    Wang, Zhiguo
    NEURAL PROCESSING LETTERS, 2023, 55 (09) : 11823 - 11844
  • [4] Joint Detection and Association for End-to-End Multi-object Tracking
    Ye Li
    Xiaoyu Luo
    Junyu Shi
    Xinzhong Wang
    Guangqiang Yin
    Zhiguo Wang
    Neural Processing Letters, 2023, 55 : 11823 - 11844
  • [5] Tracking Beyond Detection: Learning a Global Response Map for End-to-End Multi-Object Tracking
    Wan, Xingyu
    Cao, Jiakai
    Zhou, Sanping
    Wang, Jinjun
    Zheng, Nanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8222 - 8235
  • [6] Protecting the Ownership of Deep Learning Models with An End-to-End Watermarking Framework
    Zhang, Wei
    Cui, Wenxue
    Jiang, Feng
    Yang, Chifu
    Li, Ran
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 76 - 82
  • [7] Learning Diverse Models for End-to-End Ensemble Tracking
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    IEEE Transactions on Image Processing, 2021, 30 : 2220 - 2231
  • [8] End-to-End Deep Structured Models for Drawing Crosswalks
    Liang, Justin
    Urtasun, Raquel
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 407 - 423
  • [9] Learning Diverse Models for End-to-End Ensemble Tracking
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2220 - 2231
  • [10] End-to-End Joint Multi-Object Detection and Tracking for Intelligent Transportation Systems
    Xu, Qing
    Lin, Xuewu
    Cai, Mengchi
    Guo, Yu-ang
    Zhang, Chuang
    Li, Kai
    Li, Keqiang
    Wang, Jianqiang
    Cao, Dongpu
    CHINESE JOURNAL OF MECHANICAL ENGINEERING, 2023, 36 (01)