Fine-Grained Multi-human Parsing

被引:23
|
作者
Zhao, Jian [1 ,2 ]
Li, Jianshu [1 ]
Liu, Hengzhu [2 ]
Yan, Shuicheng [1 ,3 ]
Feng, Jiashi [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Natl Univ Def Technol, Changsha, Peoples R China
[3] Qihoo 360 AI Inst, Beijing, Peoples R China
关键词
Multi-human parsing; Benchmark dataset; Nested adversarial learning; Generative Adversarial Networks;
D O I
10.1007/s11263-019-01181-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the noticeable progress in perceptual tasks like detection, instance segmentation and human parsing, computers still perform unsatisfactorily on visually understanding humans in crowded scenes, such as group behavior analysis, person re-identification, e-commerce, media editing, video surveillance, autonomous driving and virtual reality, etc. To perform well, models need to comprehensively perceive the semantic information and the differences between instances in a multi-human image, which is recently defined as themulti-human parsingtask. In this paper, we first present a new large-scale database "Multi-human Parsing (MHP v2.0)" for algorithm development and evaluation to advance the research on understanding humans in crowded scenes. MHP v2.0 contains 25,403 elaborately annotated images with 58 fine-grained semantic category labels and 16 dense pose key point labels, involving 2-26 persons per image captured in real-world scenes from various viewpoints, poses, occlusion, interactions and background. We further propose a novel deep Nested Adversarial Network (NAN) model for multi-human parsing. NAN consists of three Generative Adversarial Network-like sub-nets, respectively performing semantic saliency prediction, instance-agnostic parsing and instance-aware clustering. These sub-nets form a nested structure and are carefully designed to learn jointly in an end-to-end way. NAN consistently outperforms existing state-of-the-art solutions on our MHP and several other datasets, including MHP v1.0, PASCAL-Person-Part and Buffy. NAN serves as a strong baseline to shed light on generic instance-level semantic part prediction and drive the future research on multi-human parsing. With the above innovations and contributions, we have organized the CVPR 2018 Workshop on Visual Understanding of Humans in Crowd Scene (VUHCS 2018) and the Fine-Grained Multi-human Parsing and Pose Estimation Challenge. These contributions together significantly benefit the community. Code and pre-trained models are available at.
引用
收藏
页码:2185 / 2203
页数:19
相关论文
共 50 条
  • [1] Fine-Grained Multi-human Parsing
    Jian Zhao
    Jianshu Li
    Hengzhu Liu
    Shuicheng Yan
    Jiashi Feng
    [J]. International Journal of Computer Vision, 2020, 128 : 2185 - 2203
  • [2] Multi-Human Parsing Machines
    Li, Jianshu
    Zhao, Jian
    Chen, Yunpeng
    Roy, Sujoy
    Yan, Shuicheng
    Feng, Jiashi
    Sim, Terence
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 45 - 53
  • [3] Hand Parsing for Fine-Grained Recognition of Human Grasps in Monocular Images
    Saran, Akanksha
    Teney, Damien
    Kitani, Kris M.
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 5052 - 5058
  • [4] Multi-human Parsing Based on Dynamic Convolution
    Yan, Min
    Zhang, Guoshan
    Zhang, Tong
    Zhang, Yueming
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7185 - 7190
  • [5] Great Service! Fine-grained Parsing of Implicit Arguments
    Cui, Ruixiang
    Hershcovich, Daniel
    [J]. IWPT 2021: THE 17TH INTERNATIONAL CONFERENCE ON PARSING TECHNOLOGIES: PROCEEDINGS OF THE CONFERENCE (INCLUDING THE IWPT 2021 SHARED TASK), 2021, : 65 - 77
  • [6] FINE-GRAINED GARMENT PARSING: A BODY GENERATION APPROACH
    Zhang, Peng
    Zhang, Yuwei
    Huang, Shan
    Wang, Zhi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [7] Nondiscriminatory treatment: A straightforward framework for multi-human parsing
    Yan, Min
    Zhang, Guoshan
    Zhang, Tong
    Zhang, Yueming
    [J]. NEUROCOMPUTING, 2021, 460 : 126 - 138
  • [8] Fine-grained parallelism in probabilistic parsing with Habanero Java']Java
    Francis-Landau, Matthew
    Xue, Bing
    Eisner, Jason
    Sarkar, Vivek
    [J]. PROCEEDINGS OF 2016 6TH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURE AND ALGORITHMS (IA3), 2016, : 78 - 81
  • [9] Fine-Grained Text Sentiment Transfer via Dependency Parsing
    Xiao, Lulu
    Qu, Xiaoye
    Li, Ruixuan
    Wang, Jun
    Zhou, Pan
    Li, Yuhua
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2228 - 2235
  • [10] Local Temporal Bilinear Pooling for Fine-Grained Action Parsing
    Zhang, Yan
    Tang, Siyu
    Muandet, Krikamol
    Jarvers, Christian
    Neumann, Heiko
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11997 - 12007