Entangled appearance and motion structures network for multi-object tracking and segmentation

被引:0
|
作者
Aryanfar, Ehsan [1 ]
Shoorehdeli, Mahdi Aliyari [2 ]
Seydi, Vahid [1 ,3 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, South Tehran Branch, Artificial Intelligence, Tehran, Iran
[2] KN Toosi Univ Technol, Fac Elect Engn, Tehran, Iran
[3] Bangor Univ, Ctr Appl Marine Sci, Sch Ocean Sci, Bangor, Wales
关键词
ConvLSTM; Multi-object tracking and segmentation; Tracking-by-detection; Variational auto-encoder; ONLINE;
D O I
10.1007/s00138-024-01634-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The object segmentation mask's observation sequence shows the trend of changes in the object's observable geometric form, and predicting them may assist in solving various difficulties in multi-object tracking and segmentation (MOTS). With this aim, we propose the entangled appearance and motion structures network (EAMSN), which can predict the object segmentation mask at the pixel level by integrating VAE and LSTM. Regardless of the surroundings, each EAMSN keeps complete knowledge about the sequence of probable changes in the seen map of the object and its related dynamics. It suggests that EAMSN understands the item meaningfully and is not reliant on instructive examples. As a result, we propose a novel MOTS algorithm. By employing different EAMSNs for each kind of item and training them offline, ambiguities in the segmentation mask discovered for that object may be recovered, and precise estimation of the real boundaries of the object at each step. We analyze our tracker using the KITTI MOTS and MOTS challenges datasets, which comprise car and pedestrian objects, to illustrate the usefulness of the suggested technique. As a result, we developed distinct EAMSNs for cars and pedestrians, trained using the MODELNET40 and Human3.6 M datasets, respectively. The discrepancy between training and testing data demonstrates that EAMSN is not dependent on training data. Finally, we compared our strategy to a variety of other ways. Compared to the published findings, our technique gets the best overall performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] ETTrack: enhanced temporal motion predictor for multi-object tracking
    Han, Xudong
    Oishi, Nobuyuki
    Tian, Yueying
    Ucurum, Elif
    Young, Rupert
    Chatwin, Chris
    Birch, Philip
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [42] TracTrac: A fast multi-object tracking algorithm for motion estimation
    Heyman, Joris
    COMPUTERS & GEOSCIENCES, 2019, 128 : 11 - 18
  • [43] UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation
    Yi, Kefu
    Luo, Kai
    Luo, Xiaolei
    Huang, Jiangui
    Wu, Hao
    Hu, Rongdong
    Hao, Wei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6702 - 6710
  • [44] Robust Multi-object Tracking for Wide Area Motion Imagery
    AL-Shakarji, Noor M.
    Bunyak, Filiz
    Seetharaman, Guna
    Palaniappan, Kannappan
    2018 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2018,
  • [45] UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos
    Soma Hazra
    Shaurjya Mandal
    Banani Saha
    Sunirmal Khatua
    Multimedia Tools and Applications, 2023, 82 : 12401 - 12422
  • [46] UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos
    Hazra, Soma
    Mandal, Shaurjya
    Saha, Banani
    Khatua, Sunirmal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 12401 - 12422
  • [47] PANet: An End-to-end Network Based on Relative Motion for Online Multi-object Tracking
    Li, Rui
    Zhang, Baopeng
    Liu, Wei
    Teng, Zhu
    Fan, Jianping
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [48] Multi-object trajectory tracking
    Han, Mei
    Xu, Wei
    Tao, Hai
    Gong, Yihong
    MACHINE VISION AND APPLICATIONS, 2007, 18 (3-4) : 221 - 232
  • [49] Multi-object tracking in video
    Agbinya, JI
    Rees, D
    REAL-TIME IMAGING, 1999, 5 (05) : 295 - 304
  • [50] Learning interactive multi-object segmentation through appearance embedding and spatial attention
    Gui, Yan
    Zhou, Bingqiang
    Zhang, Jianming
    Sun, Cheng
    Xiang, Lingyun
    Zhang, Jin
    IET IMAGE PROCESSING, 2022, 16 (10) : 2722 - 2737