Entangled appearance and motion structures network for multi-object tracking and segmentation

被引:0
|
作者
Aryanfar, Ehsan [1 ]
Shoorehdeli, Mahdi Aliyari [2 ]
Seydi, Vahid [1 ,3 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, South Tehran Branch, Artificial Intelligence, Tehran, Iran
[2] KN Toosi Univ Technol, Fac Elect Engn, Tehran, Iran
[3] Bangor Univ, Ctr Appl Marine Sci, Sch Ocean Sci, Bangor, Wales
关键词
ConvLSTM; Multi-object tracking and segmentation; Tracking-by-detection; Variational auto-encoder; ONLINE;
D O I
10.1007/s00138-024-01634-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The object segmentation mask's observation sequence shows the trend of changes in the object's observable geometric form, and predicting them may assist in solving various difficulties in multi-object tracking and segmentation (MOTS). With this aim, we propose the entangled appearance and motion structures network (EAMSN), which can predict the object segmentation mask at the pixel level by integrating VAE and LSTM. Regardless of the surroundings, each EAMSN keeps complete knowledge about the sequence of probable changes in the seen map of the object and its related dynamics. It suggests that EAMSN understands the item meaningfully and is not reliant on instructive examples. As a result, we propose a novel MOTS algorithm. By employing different EAMSNs for each kind of item and training them offline, ambiguities in the segmentation mask discovered for that object may be recovered, and precise estimation of the real boundaries of the object at each step. We analyze our tracker using the KITTI MOTS and MOTS challenges datasets, which comprise car and pedestrian objects, to illustrate the usefulness of the suggested technique. As a result, we developed distinct EAMSNs for cars and pedestrians, trained using the MODELNET40 and Human3.6 M datasets, respectively. The discrepancy between training and testing data demonstrates that EAMSN is not dependent on training data. Finally, we compared our strategy to a variety of other ways. Compared to the published findings, our technique gets the best overall performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
    Sun, Peize
    Cao, Jinkun
    Jiang, Yi
    Yuan, Zehuan
    Bai, Song
    Kitani, Kris
    Luo, Ping
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20961 - 20970
  • [2] Structure and Appearance Preserving Network Flow for Multi-object Tracking
    Pu, Shi
    Zhang, Honggang
    Zhao, Kaili
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1804 - 1808
  • [3] Robust Multi-Object Tracking With Local Appearance and Stable Motion Models
    Hwang, Jubi
    Shim, Kyujin
    Ko, Kangwook
    Ha, Namkoo
    Kim, Changick
    IEEE ACCESS, 2023, 11 : 77023 - 77033
  • [4] MOTS: Multi-Object Tracking and Segmentation
    Voigtlaender, Paul
    Krause, Michael
    Osep, Aljosa
    Luiten, Jonathon
    Sekar, Berin Balachandar Gnana
    Geiger, Andreas
    Leibe, Bastian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7934 - 7943
  • [5] Appearance Guidance Attention for Multi-Object Tracking
    Chen, Yong
    Huang, Junjie
    Liu, Huanlin
    Huang, Meiyong
    Zou, Zhibo
    IEEE ACCESS, 2021, 9 : 103184 - 103193
  • [7] Multi-object model-free tracking with joint appearance and motion inference
    Liu, Chongyu
    Yao, Rui
    Rezatofighi, S. Hamid
    Reid, Ian
    Shi, Qinfeng
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 604 - 611
  • [8] Multi-Object Tracking and Segmentation with a Space-Time Memory Network
    Miah, Mehdi
    Bilodeau, Guillaume-Alexandre
    Saunier, Nicolas
    2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 184 - 193
  • [9] Weakly Supervised Multi-Object Tracking and Segmentation
    Ruiz, Idoia
    Porzi, Lorenzo
    Bulo, Samuel Rota
    Kontschieder, Peter
    Serrat, Joan
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 125 - 133
  • [10] Enhancing Multi-Object Tracking with Siamese Network-based Appearance Search
    Liu, Jinliang
    Lv, Zheng
    Zhao, Jun
    Liu, Shenglan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (12): : 3513 - 3526