Self-Supervised Multi-View Person Association and its Applications

被引：14

作者：

Vo, Minh ^{[1
]}

Yumer, Ersin ^{[2
]}

Sunkavalli, Kalyan ^{[3
]}

Hadap, Sunil ^{[4
]}

Sheikh, Yaser ^{[1
]}

Narasimhan, Srinivasa G. ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA

[2] Uber ATG, San Francisco, CA 94103 USA

[3] Adobe Res, San Jose, CA 95110 USA

[4] Amazon Lab 126, Sunnyvale, CA 94085 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2021年 / 43卷 / 08期

基金：

美国国家科学基金会;

关键词：

Descriptor adaptation; self-supervised; people association; motion tracking; multi-angle video; MOTION CAPTURE; TRACKING; MULTITARGET;

D O I：

10.1109/TPAMI.2020.2974726

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reliable markerless motion tracking of people participating in a complex group activity from multiple moving cameras is challenging due to frequent occlusions, strong viewpoint and appearance variations, and asynchronous video streams. To solve this problem, reliable association of the same person across distant viewpoints and temporal instances is essential. We present a self-supervised framework to adapt a generic person appearance descriptor to the unlabeled videos by exploitingmotion tracking, mutual exclusion constraints, and multi-view geometry. The adapted discriminative descriptor is used in a tracking-by-clustering formulation. We validate the effectiveness of our descriptor learning on WILDTRACK T. Chavdarova et al., "WILDTRACK: Amulti-camera HD dataset for dense unscripted pedestrian detection," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2018, pp. 5030-5039. and three new complex social scenes captured bymultiple cameras with up to 60 people "in the wild". We report significant improvement in association accuracy (up to 18 percent) and stable and coherent 3D human skeleton tracking (5 to 10 times) over the baseline. Using the reconstructed 3D skeletons, we cut the input videos into a multi-angle videowhere the image of a specified person is shown fromthe best visible front-facing camera. Our algorithm detects inter-human occlusion to determine the camera switching moment while still maintaining the flow of the action well. Website: http://www.cs.cmu.edu/similar to ILIM/projects/IM/Association4Tracking

引用

页码：2794 / 2808

页数：15

共 50 条

[1] Self-supervised Multi-view Multi-Human Association and Tracking
Gan, Yiyang
Han, Ruize
Yin, Liqiang
Feng, Wei
Wang, Song
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 282 - 290
[2] Self-supervised learning for multi-view stereo
Ito S.
Kaneko N.
Sumi K.
Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2020, 86 (12): : 1042 - 1050
[3] Multi-view Self-supervised Heterogeneous Graph Embedding
Zhao, Jianan
Wen, Qianlong
Sun, Shiyu
Ye, Yanfang
Zhang, Chuxu
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 319 - 334
[4] Self-Supervised Deep Multi-View Subspace Clustering
Sun, Xiukun
Cheng, Miaomiao
Min, Chen
Jing, Liping
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 1001 - 1016
[5] Digging into Uncertainty in Self-supervised Multi-view Stereo
Xu, Hongbin
Zhou, Zhipeng
Wang, Yali
Kang, Wenxiong
Sun, Baigui
Li, Hao
Qiao, Yu
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6058 - 6067
[6] Self-Supervised Representations for Multi-View Reinforcement Learning
Yang, Huanhuan
Shi, Dianxi
Xie, Guojun
Peng, Yingxuan
Zhang, Yi
Yang, Yantai
Yang, Shaowu
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2203 - 2213
[7] Self-supervised Deep Correlational Multi-view Clustering
Xin, Bowen
Zeng, Shan
Wang, Xiuying
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[8] Self-supervised depth completion with multi-view geometric constraints
Xiong, Mingkang
Zhang, Zhenghong
Liu, Jiyuan
Zhang, Tao
Xiong, Huilin
IET IMAGE PROCESSING, 2023, 17 (11) : 3095 - 3105
[9] Multi-view Self-supervised Disentanglement for General Image Denoising
Chen, Hao
Qu, Chenyuan
Zhang, Yu
Chen, Chen
Jiao, Jianbo
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12247 - 12257
[10] Self-supervised Learning of Depth Inference for Multi-view Stereo
Yang, Jiayu
Alvarez, Jose M.
Liu, Miaomiao
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7522 - 7530

← 1 2 3 4 5 →