FCGNet: Foreground and Class Guided Network for human parsing

被引:0
|
作者
Jang, Jaehyuk [1 ]
Wang, Yooseung [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea
关键词
Human parsing; Semantic segmentation; Graph convolutional network;
D O I
10.1016/j.patcog.2024.110879
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the inherent hierarchical human structure is key to human parsing. To capture the human- specific characteristic, it is necessary to focus on the spatial and class information corresponding to the foreground (i.e., human) in an image. Inspired by these insights, we introduce two supervision signals, spatial foreground information and existent class information in the image. By utilizing foreground information as guidance, the network is guided to generate a human-focused feature map and capture the pixel-wise hierarchical characteristics by computing correlations between pixels. Furthermore, we guide the network to consider class information in the image at the feature level and capture the class-wise relationship by calculating correlations between channels. Moreover, during the training phase, we prevent the network from misclassifying pixels into confusing classes by providing the existent class information in the image to the network at the prediction level. Our model achieves state-of-the-art performance with significantly reduced parameters and Multiply-Accumulate Operations (MACs) in three public benchmarks.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] CDGNet: Class Distribution Guided Network for Human Parsing
    Liu, Kunliang
    Choi, Ouk
    Wang, Jianming
    Hwang, Wonjun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4463 - 4472
  • [2] FEANet: Foreground-edge-aware network with DenseASPOC for human parsing
    Yu, Wing-Yin
    Po, Lai-Man
    Zhao, Yuzhi
    Zhang, Yujia
    Lau, Kin-Wai
    IMAGE AND VISION COMPUTING, 2021, 109
  • [3] Mask-Guided Deformation Adaptive Network for Human Parsing
    Mao, Aihua
    Liang, Yuan
    Jiao, Jianbo
    Liu, Yongtuo
    He, Shengfeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [4] Boundary-guided part reasoning network for human parsing
    Su, Zhuo
    Guan, Huiqiang
    Lai, Yuntian
    Zhou, Fan
    Liang, Yun
    NEUROCOMPUTING, 2023, 561
  • [5] Attention-guided Progressive Partition Network for Human Parsing
    Huang, Xi
    He, Chengkun
    Shao, Jie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Foreground Mask Guided Network for Crowd Counting
    Li, Chun
    Shang, Lin
    Xu, Suping
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 322 - 334
  • [7] Multi-class Human Body Parsing with Edge-Enhancement Network
    Huang, Xi
    Wu, Keyu
    Hu, Gang
    Shao, Jie
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 466 - 477
  • [8] Optical Flow Estimation with Foreground Attention Guided Network
    Hou, Dongdong
    Sun, Gan
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 42 - 48
  • [9] GCAENet: global-class context with advanced edge network for single human parsing
    Zhang, Xiukun
    Liu, Weibin
    Xing, Weiwei
    Wei, Xiang
    VISUAL COMPUTER, 2023, 39 (12): : 6379 - 6394
  • [10] GCAENet: global-class context with advanced edge network for single human parsing
    Xiukun Zhang
    Weibin Liu
    Weiwei Xing
    Xiang Wei
    The Visual Computer, 2023, 39 : 6379 - 6394