Human Parsing with Contextualized Convolutional Neural Network

被引:128
|
作者
Liang, Xiaodan [1 ,2 ]
Xu, Chunyan [2 ]
Shen, Xiaohui [3 ]
Yang, Jianchao [5 ]
Liu, Si [6 ]
Tang, Jinhui [4 ]
Lin, Liang [1 ]
Yan, Shuicheng [2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[2] Natl Univ Singapore, Singapore 117548, Singapore
[3] Adobe Res, San Jose, CA USA
[4] Nanjing Univ Sci & Technol, Nanjing, Jiangsu, Peoples R China
[5] Snapchat Res, Venice, CA USA
[6] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100864, Peoples R China
关键词
D O I
10.1109/ICCV.2015.163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we address the human parsing task with a novel Contextualized Convolutional Neural Network (Co-CNN) architecture, which well integrates the cross-layer context, global image-level context, within-super-pixel context and cross-super-pixel neighborhood context into a unified network. Given an input human image, Co-CNN produces the pixel-wise categorization in an end-to-end way. First, the cross-layer context is captured by our basic local-to-global-to-local structure, which hierarchically combines the global semantic information and the local fine details across different convolutional layers. Second, the global image-level label prediction is used as an auxiliary objective in the intermediate layer of the Co-CNN, and its outputs are further used for guiding the feature learning in subsequent convolutional layers to leverage the global image-level context. Finally, to further utilize the local super-pixel contexts, the within-super-pixel smoothing and cross-super-pixel neighbourhood voting are formulated as natural sub-components of the Co-CNN to achieve the local label consistency in both training and testing process. Comprehensive evaluations on two public datasets well demonstrate the significant superiority of our Co-CNN over other state-of-the-arts for human parsing. In particular, the F-1 score on the large dataset [15] reaches 76.95% by Co-CNN, significantly higher than 62.81% and 64.38% by the state-of-theart algorithms, M-CNN [21] and ATR [15], respectively.
引用
收藏
页码:1386 / 1394
页数:9
相关论文
共 50 条
  • [31] Human action recognition using a modified convolutional neural network
    Kim, Ho-Joon
    Lee, Joseph S.
    Yang, Hyun-Seung
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 715 - +
  • [32] Human Activity Recognition Using Multichannel Convolutional Neural Network
    Sikder, Niloy
    Chowdhury, Md Sanaullah
    Arif, Abu Shamim Mohammad
    Nahid, Abdullah-Al
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2019, : 560 - 565
  • [33] Scene Parsing Using Fully Convolutional Network for Semantic Segmentation
    Ali, Nisar
    Ijaz, Ali Zeeshan
    Ali, Raja Hashim
    Ul Abideen, Zain
    Bais, Abdul
    2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
  • [34] Occlusion-aware deep convolutional neural network via homogeneous Tanh-transforms for face parsing
    Qiu, Jianhua
    Liu, Weihua
    Lin, Chaochao
    Li, Jiaojiao
    Yu, Haoping
    Boumaraf, Said
    IMAGE AND VISION COMPUTING, 2024, 148
  • [35] Deep Convolutional Neural Network
    Zhou, Yu
    Fang, Rui
    Liu, Peng
    Liu, Kai
    2019 PROCEEDINGS OF THE CONFERENCE ON CONTROL AND ITS APPLICATIONS, CT, 2019, : 46 - 51
  • [36] Understanding of a Convolutional Neural Network
    Albawi, Saad
    Mohammed, Tareq Abed
    Al-Zawi, Saad
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2017,
  • [37] Dendritic convolutional neural network
    Wang, Rong-Long
    Lei, Zhenyu
    Zhang, Zhiming
    Gao, Shangce
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (02) : 302 - 304
  • [38] Clickbait Convolutional Neural Network
    Zheng, Hai-Tao
    Chen, Jin-Yuan
    Yao, Xin
    Sangaiah, Arun Kumar
    Jiang, Yong
    Zhao, Cong-Zhi
    SYMMETRY-BASEL, 2018, 10 (05):
  • [39] Denoising Convolutional Neural Network
    Xu, Qingyang
    Zhang, Chengjin
    Zhang, Li
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 1184 - 1187
  • [40] FocusedDropout for Convolutional Neural Network
    Liu, Minghui
    Xie, Tianshu
    Cheng, Xuan
    Deng, Jiali
    Yang, Meiyi
    Wang, Xiaomin
    Liu, Ming
    APPLIED SCIENCES-BASEL, 2022, 12 (15):