Entangled appearance and motion structures network for multi-object tracking and segmentation

被引:0
|
作者
Aryanfar, Ehsan [1 ]
Shoorehdeli, Mahdi Aliyari [2 ]
Seydi, Vahid [1 ,3 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, South Tehran Branch, Artificial Intelligence, Tehran, Iran
[2] KN Toosi Univ Technol, Fac Elect Engn, Tehran, Iran
[3] Bangor Univ, Ctr Appl Marine Sci, Sch Ocean Sci, Bangor, Wales
关键词
ConvLSTM; Multi-object tracking and segmentation; Tracking-by-detection; Variational auto-encoder; ONLINE;
D O I
10.1007/s00138-024-01634-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The object segmentation mask's observation sequence shows the trend of changes in the object's observable geometric form, and predicting them may assist in solving various difficulties in multi-object tracking and segmentation (MOTS). With this aim, we propose the entangled appearance and motion structures network (EAMSN), which can predict the object segmentation mask at the pixel level by integrating VAE and LSTM. Regardless of the surroundings, each EAMSN keeps complete knowledge about the sequence of probable changes in the seen map of the object and its related dynamics. It suggests that EAMSN understands the item meaningfully and is not reliant on instructive examples. As a result, we propose a novel MOTS algorithm. By employing different EAMSNs for each kind of item and training them offline, ambiguities in the segmentation mask discovered for that object may be recovered, and precise estimation of the real boundaries of the object at each step. We analyze our tracker using the KITTI MOTS and MOTS challenges datasets, which comprise car and pedestrian objects, to illustrate the usefulness of the suggested technique. As a result, we developed distinct EAMSNs for cars and pedestrians, trained using the MODELNET40 and Human3.6 M datasets, respectively. The discrepancy between training and testing data demonstrates that EAMSN is not dependent on training data. Finally, we compared our strategy to a variety of other ways. Compared to the published findings, our technique gets the best overall performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Assignment-Space-based Multi-Object Tracking and Segmentation
    Choudhuri, Anwesa
    Chowdhary, Girish
    Schwing, Alexander G.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13578 - 13587
  • [32] Multi-Object Tracking and Segmentation Via Neural Message Passing
    Braso, Guillem
    Cetintas, Orcun
    Leal-Taixe, Laura
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (12) : 3035 - 3053
  • [33] Learning Discriminative Appearance Models for Online Multi-Object Tracking With Appearance Discriminability Measures
    Lee, Seong-Ho
    Kim, Myung-Yun
    Bae, Seung-Hwan
    IEEE ACCESS, 2018, 6 : 67316 - 67328
  • [34] Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
    Karazija, Laurynas
    Choudhury, Subhabrata
    Laina, Iro
    Rupprecht, Christian
    Vedaldi, Andrea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [35] Data Association with Graph Network for Multi-Object Tracking
    Wu, Yubin
    Sheng, Hao
    Wang, Shuai
    Liu, Yang
    Ke, Wei
    Xiong, Zhang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 268 - 280
  • [36] On Pairwise Costs for Network Flow Multi-Object Tracking
    Chari, Visesh
    Lacoste-Julien, Simon
    Laptev, Ivan
    Sivic, Josef
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5537 - 5545
  • [37] Associative affinity network learning for multi-object tracking
    Ma, Liang
    Zhong, Qiaoyong
    Zhang, Yingying
    Xie, Di
    Pu, Shiliang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (09) : 1194 - 1206
  • [38] Learning Sequential Visual Appearance Transformation for Online Multi-Object Tracking
    Sagastiberri, Itziar
    van de Gevel, Noud
    Garcia, Jorge
    Otaegui, Oihana
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [39] Online multi-object tracking by detection based on generative appearance models
    Riahi, Dorra
    Bilodeau, Guillaume-Alexandre
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 152 : 88 - 102
  • [40] Discriminative Label Propagation for Multi-Object Tracking with Sporadic Appearance Features
    Kumar, Amit K. C.
    De Vleeschouwer, Christophe
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2000 - 2007