Human Parsing with Contextualized Convolutional Neural Network

被引:128
|
作者
Liang, Xiaodan [1 ,2 ]
Xu, Chunyan [2 ]
Shen, Xiaohui [3 ]
Yang, Jianchao [5 ]
Liu, Si [6 ]
Tang, Jinhui [4 ]
Lin, Liang [1 ]
Yan, Shuicheng [2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[2] Natl Univ Singapore, Singapore 117548, Singapore
[3] Adobe Res, San Jose, CA USA
[4] Nanjing Univ Sci & Technol, Nanjing, Jiangsu, Peoples R China
[5] Snapchat Res, Venice, CA USA
[6] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100864, Peoples R China
关键词
D O I
10.1109/ICCV.2015.163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we address the human parsing task with a novel Contextualized Convolutional Neural Network (Co-CNN) architecture, which well integrates the cross-layer context, global image-level context, within-super-pixel context and cross-super-pixel neighborhood context into a unified network. Given an input human image, Co-CNN produces the pixel-wise categorization in an end-to-end way. First, the cross-layer context is captured by our basic local-to-global-to-local structure, which hierarchically combines the global semantic information and the local fine details across different convolutional layers. Second, the global image-level label prediction is used as an auxiliary objective in the intermediate layer of the Co-CNN, and its outputs are further used for guiding the feature learning in subsequent convolutional layers to leverage the global image-level context. Finally, to further utilize the local super-pixel contexts, the within-super-pixel smoothing and cross-super-pixel neighbourhood voting are formulated as natural sub-components of the Co-CNN to achieve the local label consistency in both training and testing process. Comprehensive evaluations on two public datasets well demonstrate the significant superiority of our Co-CNN over other state-of-the-arts for human parsing. In particular, the F-1 score on the large dataset [15] reaches 76.95% by Co-CNN, significantly higher than 62.81% and 64.38% by the state-of-theart algorithms, M-CNN [21] and ATR [15], respectively.
引用
收藏
页码:1386 / 1394
页数:9
相关论文
共 50 条
  • [41] Differential convolutional neural network
    Sarigul, M.
    Ozyildirim, B. M.
    Avci, M.
    NEURAL NETWORKS, 2019, 116 : 279 - 287
  • [42] Biomedical Event Extraction Using Convolutional Neural Networks and Dependency Parsing
    Bjorne, Jari
    Salakoski, Tapio
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), 2018, : 98 - 108
  • [43] Very High Resolution Image Semantic Segmentation with Contextualized Convolutional Neural Network Coupled with Higher Order Conditional Random Fields
    Mei, Tiancan
    Ji, Hong
    Zheng, Wenyuan
    He, Saixian
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 1506 - 1510
  • [44] Research on Human Face Location Based on Adaboost and Convolutional Neural Network
    Wang Wenzhi
    Wang Lin
    Zhang Ruliang
    Huang Chengquan
    Rao Yan
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 311 - 315
  • [45] Human and object detection using Hybrid Deep Convolutional Neural Network
    Mukilan, P.
    Semunigus, Wogderess
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1913 - 1923
  • [46] Digital twin for human-machine interaction with convolutional neural network
    Wang, Tian
    Li, Jiakun
    Deng, Yingjun
    Wang, Chuang
    Snoussi, Hichem
    Tao, Fei
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2021, 34 (7-8) : 888 - 897
  • [47] Human activity recognition using temporal convolutional neural network architecture
    Andrade-Ambriz, Yair A.
    Ledesma, Sergio
    Ibarra-Manzano, Mario-Alberto
    Oros-Flores, Marvella, I
    Almanza-Ojeda, Dora-Luz
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [48] Dynamic emotion recognition of human face based on convolutional neural network
    Xu, Lanbo
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (05) : 533 - 551
  • [49] An improved human activity recognition technique based on convolutional neural network
    Raj, Ravi
    Kos, Andrzej
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [50] Video spatiotemporal mapping for human action recognition by convolutional neural network
    Zare, Amin
    Abrishami Moghaddam, Hamid
    Sharifi, Arash
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (01) : 265 - 279