Integrating Pose and Mask Predictions for Multi-person in Videos

被引:1
|
作者
Heo, Miran [1 ,2 ]
Hwang, Sukjun [1 ]
Oh, Seoung Wug [2 ]
Lee, Joon-Young [2 ]
Kim, Seon Joo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
[2] Adobe Res, San Jose, CA USA
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022 | 2022年
关键词
D O I
10.1109/CVPRW56347.2022.00299
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In real-world applications for video editing, humans are arguably the most important objects. When editing videos of humans, the efficient tracking of fine-grained masks and body joints is the fundamental requirement. In this paper, we propose a simple and efficient system for jointly tracking pose and segmenting high-quality masks for all humans in the video. We design a pipeline that globally tracks pose and locally segments fine-grained masks. Specifically, CenterTrack is first employed to track human poses by viewing the whole scene, and then the proposed local segmentation network leverages the pose information as a powerful query to carry out high-quality segmentation. Furthermore, we adopt a highly light-weight MLP-Mixer layer within the segmentation network that can efficiently propagate the query pose throughout the region of interest with minimal overhead. For the evaluation, we collect a new benchmark called KineMask which includes various appearances and actions. The experimental results demonstrate that our method has superior fine-grained segmentation performance. Moreover, it runs at 33 fps, achieving a great balance of speed and accuracy compared to the prevailing online Video Instance Segmentation methods.
引用
收藏
页码:2656 / 2665
页数:10
相关论文
共 50 条
  • [1] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
    Gu, Renshu
    Wang, Gaoang
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
  • [2] Predicting Dominance in Multi-person Videos
    Bai, Chongyang
    Bolonkin, Maksim
    Kumar, Srijan
    Leskovec, Jure
    Burgoon, Judee
    Dunbar, Norah
    Subrahmanian, V. S.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4643 - 4650
  • [3] Pose Knowledge Transfer for multi-person pose estimation
    Buwei Li
    Yi Ji
    Ying Li
    Yunlong Xu
    Chunping Liu
    Signal, Image and Video Processing, 2022, 16 : 321 - 328
  • [4] Pose Knowledge Transfer for multi-person pose estimation
    Li, Buwei
    Ji, Yi
    Li, Ying
    Xu, Yunlong
    Liu, Chunping
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 321 - 328
  • [5] Pose Partition Networks for Multi-person Pose Estimation
    Nie, Xuecheng
    Feng, Jiashi
    Xing, Junliang
    Yan, Shuicheng
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 705 - 720
  • [6] Monocular multi-person pose estimation: A survey
    dos Reis, Eduardo Souza
    Seewald, Lucas Adams
    Antunes, Rodolfo Stoffel
    Rodrigues, Vinicius Facco
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    da Silveira Jr, Luiz Gonzaga
    Eskofier, Bjoern
    Maier, Andreas
    Horz, Tim
    Fahrig, Rebecca
    PATTERN RECOGNITION, 2021, 118
  • [7] Multi-Person Re-Identification Based on Face, Pose and Texture Analysis in Unconstrained Videos
    Gallego, Jaime
    Slater, Mel
    PROCEEDINGS OF 2020 IEEE 21ST INTERNATIONAL CONFERENCE ON COMPUTATIONAL PROBLEMS OF ELECTRICAL ENGINEERING (CPEE), 2020,
  • [8] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [9] Multi-Person Pose Estimation on Embedded Device
    Ma, Zhipeng
    Tian, Dawei
    Zhang, Ming
    He, Dingxin
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 57 - 61
  • [10] The Overview of Multi-person Pose Estimation Method
    Li, Bingyi
    Zou, Jiaqi
    Wang, Luyao
    Li, Xiangyuan
    Li, Yue
    Lei, Rongjia
    Sun, Songlin
    SIGNAL AND INFORMATION PROCESSING, NETWORKING AND COMPUTERS (ICSINC), 2019, 550 : 600 - 607