Human Parsing with Contextualized Convolutional Neural Network

被引:128
|
作者
Liang, Xiaodan [1 ,2 ]
Xu, Chunyan [2 ]
Shen, Xiaohui [3 ]
Yang, Jianchao [5 ]
Liu, Si [6 ]
Tang, Jinhui [4 ]
Lin, Liang [1 ]
Yan, Shuicheng [2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[2] Natl Univ Singapore, Singapore 117548, Singapore
[3] Adobe Res, San Jose, CA USA
[4] Nanjing Univ Sci & Technol, Nanjing, Jiangsu, Peoples R China
[5] Snapchat Res, Venice, CA USA
[6] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100864, Peoples R China
关键词
D O I
10.1109/ICCV.2015.163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we address the human parsing task with a novel Contextualized Convolutional Neural Network (Co-CNN) architecture, which well integrates the cross-layer context, global image-level context, within-super-pixel context and cross-super-pixel neighborhood context into a unified network. Given an input human image, Co-CNN produces the pixel-wise categorization in an end-to-end way. First, the cross-layer context is captured by our basic local-to-global-to-local structure, which hierarchically combines the global semantic information and the local fine details across different convolutional layers. Second, the global image-level label prediction is used as an auxiliary objective in the intermediate layer of the Co-CNN, and its outputs are further used for guiding the feature learning in subsequent convolutional layers to leverage the global image-level context. Finally, to further utilize the local super-pixel contexts, the within-super-pixel smoothing and cross-super-pixel neighbourhood voting are formulated as natural sub-components of the Co-CNN to achieve the local label consistency in both training and testing process. Comprehensive evaluations on two public datasets well demonstrate the significant superiority of our Co-CNN over other state-of-the-arts for human parsing. In particular, the F-1 score on the large dataset [15] reaches 76.95% by Co-CNN, significantly higher than 62.81% and 64.38% by the state-of-theart algorithms, M-CNN [21] and ATR [15], respectively.
引用
收藏
页码:1386 / 1394
页数:9
相关论文
共 50 条
  • [1] Human Parsing with Contextualized Convolutional Neural Network
    Liang, Xiaodan
    Xu, Chunyan
    Shen, Xiaohui
    Yang, Jianchao
    Tang, Jinhui
    Lin, Liang
    Yan, Shuicheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (01) : 115 - 127
  • [2] Convolutional Neural Network with Contextualized Word Embedding for Text Classification
    Fan, Gaoyang
    Zhu, Cui
    Zhu, Wenjun
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [3] Character-Based Parsing with Convolutional Neural Network
    Zheng, Xiaoqing
    Peng, Haoyuan
    Chen, Yi
    Zhang, Pengjing
    Zhang, Wenqiang
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1054 - 1060
  • [4] Probabilistic Graph-based Dependency Parsing with Convolutional Neural Network
    Zhang, Zhisong
    Zhao, Hai
    Qin, Lianhui
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1382 - 1392
  • [5] Exploring Contextualized Neural Language Models for Temporal Dependency Parsing
    Ross, Hayley
    Cai, Jonathon
    Min, Bonan
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8548 - 8553
  • [6] Fully Convolutional Neural Network with Relation Aware Context Information for Image Parsing
    Azam, Basim
    Mandal, Ranju
    Verma, Brijesh
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 127 - 132
  • [7] Multilabel learning based adaptive graph convolutional network for human parsing
    Hao, Huaqing
    Liu, Weibin
    Xing, Weiwei
    Zhang, Shunli
    PATTERN RECOGNITION, 2022, 127
  • [8] Convolutional Network for Plantar Pressure Parsing
    Bai, Tau
    Yang, Xianjun
    Yao, Zhiming
    Zhou, Xu
    Wu, Haixiao
    Li, Bochen
    Wang, Jianguo
    Sun, Yining
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [9] Interlinked Convolutional Neural Networks for Face Parsing
    Zhou, Yisu
    Hu, Xiaolin
    Zhang, Bo
    ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 222 - 231
  • [10] Relation Parsing Neural Network for Human-Object Interaction Detection
    Zhou, Penghao
    Chi, Mingmin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 843 - 851