Entangled appearance and motion structures network for multi-object tracking and segmentation

被引：0

作者：

Aryanfar, Ehsan ^{[1
]}

Shoorehdeli, Mahdi Aliyari ^{[2
]}

Seydi, Vahid ^{[1
,3
]}

机构：

[1] Islamic Azad Univ, Dept Comp Engn, South Tehran Branch, Artificial Intelligence, Tehran, Iran

[2] KN Toosi Univ Technol, Fac Elect Engn, Tehran, Iran

[3] Bangor Univ, Ctr Appl Marine Sci, Sch Ocean Sci, Bangor, Wales

来源：

MACHINE VISION AND APPLICATIONS | 2025年 / 36卷 / 01期

关键词：

ConvLSTM; Multi-object tracking and segmentation; Tracking-by-detection; Variational auto-encoder; ONLINE;

D O I：

10.1007/s00138-024-01634-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The object segmentation mask's observation sequence shows the trend of changes in the object's observable geometric form, and predicting them may assist in solving various difficulties in multi-object tracking and segmentation (MOTS). With this aim, we propose the entangled appearance and motion structures network (EAMSN), which can predict the object segmentation mask at the pixel level by integrating VAE and LSTM. Regardless of the surroundings, each EAMSN keeps complete knowledge about the sequence of probable changes in the seen map of the object and its related dynamics. It suggests that EAMSN understands the item meaningfully and is not reliant on instructive examples. As a result, we propose a novel MOTS algorithm. By employing different EAMSNs for each kind of item and training them offline, ambiguities in the segmentation mask discovered for that object may be recovered, and precise estimation of the real boundaries of the object at each step. We analyze our tracker using the KITTI MOTS and MOTS challenges datasets, which comprise car and pedestrian objects, to illustrate the usefulness of the suggested technique. As a result, we developed distinct EAMSNs for cars and pedestrians, trained using the MODELNET40 and Human3.6 M datasets, respectively. The discrepancy between training and testing data demonstrates that EAMSN is not dependent on training data. Finally, we compared our strategy to a variety of other ways. Compared to the published findings, our technique gets the best overall performance.

引用

页数：16

共 50 条

[41] ETTrack: enhanced temporal motion predictor for multi-object tracking
Han, Xudong
Oishi, Nobuyuki
Tian, Yueying
Ucurum, Elif
Young, Rupert
Chatwin, Chris
Birch, Philip
APPLIED INTELLIGENCE, 2025, 55 (01)
[42] TracTrac: A fast multi-object tracking algorithm for motion estimation
Heyman, Joris
COMPUTERS & GEOSCIENCES, 2019, 128 : 11 - 18
[43] UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation
Yi, Kefu
Luo, Kai
Luo, Xiaolei
Huang, Jiangui
Wu, Hao
Hu, Rongdong
Hao, Wei
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6702 - 6710
[44] Robust Multi-object Tracking for Wide Area Motion Imagery
AL-Shakarji, Noor M.
Bunyak, Filiz
Seetharaman, Guna
Palaniappan, Kannappan
2018 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2018,
[45] UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos
Soma Hazra
Shaurjya Mandal
Banani Saha
Sunirmal Khatua
Multimedia Tools and Applications, 2023, 82 : 12401 - 12422
[46] UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos
Hazra, Soma
Mandal, Shaurjya
Saha, Banani
Khatua, Sunirmal
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 12401 - 12422
[47] PANet: An End-to-end Network Based on Relative Motion for Online Multi-object Tracking
Li, Rui
Zhang, Baopeng
Liu, Wei
Teng, Zhu
Fan, Jianping
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[48] Multi-object trajectory tracking
Han, Mei
Xu, Wei
Tao, Hai
Gong, Yihong
MACHINE VISION AND APPLICATIONS, 2007, 18 (3-4) : 221 - 232
[49] Multi-object tracking in video
Agbinya, JI
Rees, D
REAL-TIME IMAGING, 1999, 5 (05) : 295 - 304
[50] Learning interactive multi-object segmentation through appearance embedding and spatial attention
Gui, Yan
Zhou, Bingqiang
Zhang, Jianming
Sun, Cheng
Xiang, Lingyun
Zhang, Jin
IET IMAGE PROCESSING, 2022, 16 (10) : 2722 - 2737

← 1 2 3 4 5 →