Fine-Grained Multi-human Parsing

被引:23
|
作者
Zhao, Jian [1 ,2 ]
Li, Jianshu [1 ]
Liu, Hengzhu [2 ]
Yan, Shuicheng [1 ,3 ]
Feng, Jiashi [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Natl Univ Def Technol, Changsha, Peoples R China
[3] Qihoo 360 AI Inst, Beijing, Peoples R China
关键词
Multi-human parsing; Benchmark dataset; Nested adversarial learning; Generative Adversarial Networks;
D O I
10.1007/s11263-019-01181-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the noticeable progress in perceptual tasks like detection, instance segmentation and human parsing, computers still perform unsatisfactorily on visually understanding humans in crowded scenes, such as group behavior analysis, person re-identification, e-commerce, media editing, video surveillance, autonomous driving and virtual reality, etc. To perform well, models need to comprehensively perceive the semantic information and the differences between instances in a multi-human image, which is recently defined as themulti-human parsingtask. In this paper, we first present a new large-scale database "Multi-human Parsing (MHP v2.0)" for algorithm development and evaluation to advance the research on understanding humans in crowded scenes. MHP v2.0 contains 25,403 elaborately annotated images with 58 fine-grained semantic category labels and 16 dense pose key point labels, involving 2-26 persons per image captured in real-world scenes from various viewpoints, poses, occlusion, interactions and background. We further propose a novel deep Nested Adversarial Network (NAN) model for multi-human parsing. NAN consists of three Generative Adversarial Network-like sub-nets, respectively performing semantic saliency prediction, instance-agnostic parsing and instance-aware clustering. These sub-nets form a nested structure and are carefully designed to learn jointly in an end-to-end way. NAN consistently outperforms existing state-of-the-art solutions on our MHP and several other datasets, including MHP v1.0, PASCAL-Person-Part and Buffy. NAN serves as a strong baseline to shed light on generic instance-level semantic part prediction and drive the future research on multi-human parsing. With the above innovations and contributions, we have organized the CVPR 2018 Workshop on Visual Understanding of Humans in Crowd Scene (VUHCS 2018) and the Fine-Grained Multi-human Parsing and Pose Estimation Challenge. These contributions together significantly benefit the community. Code and pre-trained models are available at.
引用
收藏
页码:2185 / 2203
页数:19
相关论文
共 50 条
  • [41] Fine-grained searchable encryption in multi-user setting
    Jun Ye
    Jianfeng Wang
    Jiaolian Zhao
    Jian Shen
    Kuan-Ching Li
    [J]. Soft Computing, 2017, 21 : 6201 - 6212
  • [42] Fine-Grained Multi-Resource Scheduling in Cloud Datacenters
    Zhang, Yuan
    Fu, Xiaoming
    Ramakrishnan, K. K.
    [J]. 2014 IEEE 20TH INTERNATIONAL WORKSHOP ON LOCAL & METROPOLITAN AREA NETWORKS (LANMAN), 2014,
  • [43] ENERGAT: Fine-Grained Energy Attribution for Multi-Tenancy
    He, Hongyu
    Friedman, Michal
    Rekatsinas, Theodoros
    [J]. PROCEEDINGS OF THE 2ND ACM WORKSHOP ON SUSTAINABLE COMPUTER SYSTEMS, HOTCARBON 2023, 2023,
  • [44] The fine-grained complexity of multi-dimensional ordering properties
    An, Haozhe
    Gurumukhani, Mohit
    Impagliazzo, Russell
    Jaber, Michael
    Künnemann, Marvin
    Nina, Maria Paula Parga
    [J]. Leibniz International Proceedings in Informatics, LIPIcs, 2021, 214
  • [45] Towards Fine-Grained Recognition: Joint Learning for Object Detection and Fine-Grained Classification
    Wang, Qiaosong
    Rasmussen, Christopher
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 332 - 344
  • [46] ADAPTIVE MULTI-TASK LEARNING FOR FINE-GRAINED CATEGORIZATION
    Sun, Gang
    Chen, Yanyun
    Liu, Xuehui
    Wu, Enhua
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 996 - 1000
  • [47] The Fine-Grained Complexity of Multi-Dimensional Ordering Properties
    An, Haozhe
    Gurumukhani, Mohit
    Impagliazzo, Russell
    Jaber, Michael
    Kuennemann, Marvin
    Nina, Maria Paula Parga
    [J]. ALGORITHMICA, 2022, 84 (11) : 3156 - 3191
  • [48] Multi-Scale CNN for Fine-Grained Image Recognition
    Won, Chee Sun
    [J]. IEEE ACCESS, 2020, 8 : 116663 - 116674
  • [49] The Fine-Grained Complexity of Multi-Dimensional Ordering Properties
    Haozhe An
    Mohit Gurumukhani
    Russell Impagliazzo
    Michael Jaber
    Marvin Künnemann
    Maria Paula Parga Nina
    [J]. Algorithmica, 2022, 84 : 3156 - 3191
  • [50] Attention-Guided Hierarchical Parsing for Fine-Grained Person-Centric Image Captioning
    Gu, Zhengcheng
    Jin, Jing
    [J]. IEEE ACCESS, 2024, 12 : 86293 - 86301