Entangled appearance and motion structures network for multi-object tracking and segmentation

被引：0

作者：

Aryanfar, Ehsan ^{[1
]}

Shoorehdeli, Mahdi Aliyari ^{[2
]}

Seydi, Vahid ^{[1
,3
]}

机构：

[1] Islamic Azad Univ, Dept Comp Engn, South Tehran Branch, Artificial Intelligence, Tehran, Iran

[2] KN Toosi Univ Technol, Fac Elect Engn, Tehran, Iran

[3] Bangor Univ, Ctr Appl Marine Sci, Sch Ocean Sci, Bangor, Wales

来源：

MACHINE VISION AND APPLICATIONS | 2025年 / 36卷 / 01期

关键词：

ConvLSTM; Multi-object tracking and segmentation; Tracking-by-detection; Variational auto-encoder; ONLINE;

D O I：

10.1007/s00138-024-01634-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The object segmentation mask's observation sequence shows the trend of changes in the object's observable geometric form, and predicting them may assist in solving various difficulties in multi-object tracking and segmentation (MOTS). With this aim, we propose the entangled appearance and motion structures network (EAMSN), which can predict the object segmentation mask at the pixel level by integrating VAE and LSTM. Regardless of the surroundings, each EAMSN keeps complete knowledge about the sequence of probable changes in the seen map of the object and its related dynamics. It suggests that EAMSN understands the item meaningfully and is not reliant on instructive examples. As a result, we propose a novel MOTS algorithm. By employing different EAMSNs for each kind of item and training them offline, ambiguities in the segmentation mask discovered for that object may be recovered, and precise estimation of the real boundaries of the object at each step. We analyze our tracker using the KITTI MOTS and MOTS challenges datasets, which comprise car and pedestrian objects, to illustrate the usefulness of the suggested technique. As a result, we developed distinct EAMSNs for cars and pedestrians, trained using the MODELNET40 and Human3.6 M datasets, respectively. The discrepancy between training and testing data demonstrates that EAMSN is not dependent on training data. Finally, we compared our strategy to a variety of other ways. Compared to the published findings, our technique gets the best overall performance.

引用

页数：16

共 50 条

[1] DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
Sun, Peize
Cao, Jinkun
Jiang, Yi
Yuan, Zehuan
Bai, Song
Kitani, Kris
Luo, Ping
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20961 - 20970
[2] Structure and Appearance Preserving Network Flow for Multi-object Tracking
Pu, Shi
Zhang, Honggang
Zhao, Kaili
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1804 - 1808
[3] Robust Multi-Object Tracking With Local Appearance and Stable Motion Models
Hwang, Jubi
Shim, Kyujin
Ko, Kangwook
Ha, Namkoo
Kim, Changick
IEEE ACCESS, 2023, 11 : 77023 - 77033
[4] MOTS: Multi-Object Tracking and Segmentation
Voigtlaender, Paul
Krause, Michael
Osep, Aljosa
Luiten, Jonathon
Sekar, Berin Balachandar Gnana
Geiger, Andreas
Leibe, Bastian
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7934 - 7943
[5] Appearance Guidance Attention for Multi-Object Tracking
Chen, Yong
Huang, Junjie
Liu, Huanlin
Huang, Meiyong
Zou, Zhibo
IEEE ACCESS, 2021, 9 : 103184 - 103193
[6] Multi-object tracking through learning relational appearance features and motion patterns
Gwak, Jeonghwan
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 162 : 103 - 115
[7] Multi-object model-free tracking with joint appearance and motion inference
Liu, Chongyu
Yao, Rui
Rezatofighi, S. Hamid
Reid, Ian
Shi, Qinfeng
2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 604 - 611
[8] Multi-Object Tracking and Segmentation with a Space-Time Memory Network
Miah, Mehdi
Bilodeau, Guillaume-Alexandre
Saunier, Nicolas
2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 184 - 193
[9] Weakly Supervised Multi-Object Tracking and Segmentation
Ruiz, Idoia
Porzi, Lorenzo
Bulo, Samuel Rota
Kontschieder, Peter
Serrat, Joan
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 125 - 133
[10] Enhancing Multi-Object Tracking with Siamese Network-based Appearance Search
Liu, Jinliang
Lv, Zheng
Zhao, Jun
Liu, Shenglan
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (12): : 3513 - 3526

← 1 2 3 4 5 →