Self-Supervised Multi-View Person Association and its Applications

被引:14
|
作者
Vo, Minh [1 ]
Yumer, Ersin [2 ]
Sunkavalli, Kalyan [3 ]
Hadap, Sunil [4 ]
Sheikh, Yaser [1 ]
Narasimhan, Srinivasa G. [1 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[2] Uber ATG, San Francisco, CA 94103 USA
[3] Adobe Res, San Jose, CA 95110 USA
[4] Amazon Lab 126, Sunnyvale, CA 94085 USA
基金
美国国家科学基金会;
关键词
Descriptor adaptation; self-supervised; people association; motion tracking; multi-angle video; MOTION CAPTURE; TRACKING; MULTITARGET;
D O I
10.1109/TPAMI.2020.2974726
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reliable markerless motion tracking of people participating in a complex group activity from multiple moving cameras is challenging due to frequent occlusions, strong viewpoint and appearance variations, and asynchronous video streams. To solve this problem, reliable association of the same person across distant viewpoints and temporal instances is essential. We present a self-supervised framework to adapt a generic person appearance descriptor to the unlabeled videos by exploitingmotion tracking, mutual exclusion constraints, and multi-view geometry. The adapted discriminative descriptor is used in a tracking-by-clustering formulation. We validate the effectiveness of our descriptor learning on WILDTRACK T. Chavdarova et al., "WILDTRACK: Amulti-camera HD dataset for dense unscripted pedestrian detection," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2018, pp. 5030-5039. and three new complex social scenes captured bymultiple cameras with up to 60 people "in the wild". We report significant improvement in association accuracy (up to 18 percent) and stable and coherent 3D human skeleton tracking (5 to 10 times) over the baseline. Using the reconstructed 3D skeletons, we cut the input videos into a multi-angle videowhere the image of a specified person is shown fromthe best visible front-facing camera. Our algorithm detects inter-human occlusion to determine the camera switching moment while still maintaining the flow of the action well. Website: http://www.cs.cmu.edu/similar to ILIM/projects/IM/Association4Tracking
引用
收藏
页码:2794 / 2808
页数:15
相关论文
共 50 条
  • [1] Self-supervised Multi-view Multi-Human Association and Tracking
    Gan, Yiyang
    Han, Ruize
    Yin, Liqiang
    Feng, Wei
    Wang, Song
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 282 - 290
  • [2] Self-supervised learning for multi-view stereo
    Ito S.
    Kaneko N.
    Sumi K.
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2020, 86 (12): : 1042 - 1050
  • [3] Multi-view Self-supervised Heterogeneous Graph Embedding
    Zhao, Jianan
    Wen, Qianlong
    Sun, Shiyu
    Ye, Yanfang
    Zhang, Chuxu
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 319 - 334
  • [4] Self-Supervised Deep Multi-View Subspace Clustering
    Sun, Xiukun
    Cheng, Miaomiao
    Min, Chen
    Jing, Liping
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 1001 - 1016
  • [5] Digging into Uncertainty in Self-supervised Multi-view Stereo
    Xu, Hongbin
    Zhou, Zhipeng
    Wang, Yali
    Kang, Wenxiong
    Sun, Baigui
    Li, Hao
    Qiao, Yu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6058 - 6067
  • [6] Self-Supervised Representations for Multi-View Reinforcement Learning
    Yang, Huanhuan
    Shi, Dianxi
    Xie, Guojun
    Peng, Yingxuan
    Zhang, Yi
    Yang, Yantai
    Yang, Shaowu
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2203 - 2213
  • [7] Self-supervised Deep Correlational Multi-view Clustering
    Xin, Bowen
    Zeng, Shan
    Wang, Xiuying
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Self-supervised depth completion with multi-view geometric constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Liu, Jiyuan
    Zhang, Tao
    Xiong, Huilin
    IET IMAGE PROCESSING, 2023, 17 (11) : 3095 - 3105
  • [9] Multi-view Self-supervised Disentanglement for General Image Denoising
    Chen, Hao
    Qu, Chenyuan
    Zhang, Yu
    Chen, Chen
    Jiao, Jianbo
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12247 - 12257
  • [10] Self-supervised Learning of Depth Inference for Multi-view Stereo
    Yang, Jiayu
    Alvarez, Jose M.
    Liu, Miaomiao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7522 - 7530