Entangled appearance and motion structures network for multi-object tracking and segmentation

被引：0

作者：

Aryanfar, Ehsan ^{[1
]}

Shoorehdeli, Mahdi Aliyari ^{[2
]}

Seydi, Vahid ^{[1
,3
]}

机构：

[1] Islamic Azad Univ, Dept Comp Engn, South Tehran Branch, Artificial Intelligence, Tehran, Iran

[2] KN Toosi Univ Technol, Fac Elect Engn, Tehran, Iran

[3] Bangor Univ, Ctr Appl Marine Sci, Sch Ocean Sci, Bangor, Wales

来源：

MACHINE VISION AND APPLICATIONS | 2025年 / 36卷 / 01期

关键词：

ConvLSTM; Multi-object tracking and segmentation; Tracking-by-detection; Variational auto-encoder; ONLINE;

D O I：

10.1007/s00138-024-01634-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The object segmentation mask's observation sequence shows the trend of changes in the object's observable geometric form, and predicting them may assist in solving various difficulties in multi-object tracking and segmentation (MOTS). With this aim, we propose the entangled appearance and motion structures network (EAMSN), which can predict the object segmentation mask at the pixel level by integrating VAE and LSTM. Regardless of the surroundings, each EAMSN keeps complete knowledge about the sequence of probable changes in the seen map of the object and its related dynamics. It suggests that EAMSN understands the item meaningfully and is not reliant on instructive examples. As a result, we propose a novel MOTS algorithm. By employing different EAMSNs for each kind of item and training them offline, ambiguities in the segmentation mask discovered for that object may be recovered, and precise estimation of the real boundaries of the object at each step. We analyze our tracker using the KITTI MOTS and MOTS challenges datasets, which comprise car and pedestrian objects, to illustrate the usefulness of the suggested technique. As a result, we developed distinct EAMSNs for cars and pedestrians, trained using the MODELNET40 and Human3.6 M datasets, respectively. The discrepancy between training and testing data demonstrates that EAMSN is not dependent on training data. Finally, we compared our strategy to a variety of other ways. Compared to the published findings, our technique gets the best overall performance.

引用

页数：16

共 50 条

[31] Assignment-Space-based Multi-Object Tracking and Segmentation
Choudhuri, Anwesa
Chowdhary, Girish
Schwing, Alexander G.
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13578 - 13587
[32] Multi-Object Tracking and Segmentation Via Neural Message Passing
Braso, Guillem
Cetintas, Orcun
Leal-Taixe, Laura
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (12) : 3035 - 3053
[33] Learning Discriminative Appearance Models for Online Multi-Object Tracking With Appearance Discriminability Measures
Lee, Seong-Ho
Kim, Myung-Yun
Bae, Seung-Hwan
IEEE ACCESS, 2018, 6 : 67316 - 67328
[34] Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
Karazija, Laurynas
Choudhury, Subhabrata
Laina, Iro
Rupprecht, Christian
Vedaldi, Andrea
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[35] Data Association with Graph Network for Multi-Object Tracking
Wu, Yubin
Sheng, Hao
Wang, Shuai
Liu, Yang
Ke, Wei
Xiong, Zhang
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 268 - 280
[36] On Pairwise Costs for Network Flow Multi-Object Tracking
Chari, Visesh
Lacoste-Julien, Simon
Laptev, Ivan
Sivic, Josef
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5537 - 5545
[37] Associative affinity network learning for multi-object tracking
Ma, Liang
Zhong, Qiaoyong
Zhang, Yingying
Xie, Di
Pu, Shiliang
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (09) : 1194 - 1206
[38] Learning Sequential Visual Appearance Transformation for Online Multi-Object Tracking
Sagastiberri, Itziar
van de Gevel, Noud
Garcia, Jorge
Otaegui, Oihana
2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
[39] Online multi-object tracking by detection based on generative appearance models
Riahi, Dorra
Bilodeau, Guillaume-Alexandre
COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 152 : 88 - 102
[40] Discriminative Label Propagation for Multi-Object Tracking with Sporadic Appearance Features
Kumar, Amit K. C.
De Vleeschouwer, Christophe
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2000 - 2007

← 1 2 3 4 5 →