LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR PREDICTING FACE ATTRIBUTES IN THE WILD

被引:0
|
作者
Zhong, Yang [1 ]
Sullivan, Josephine [1 ]
Li, Haibo [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
关键词
deep learning; mid-level deep representation; face attribute prediction; face recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Predicting facial attributes from faces in the wild is very challenging due to pose and lighting variations in the real world. The key to this problem is to build proper feature representations to cope with these unfavourable conditions. Given the success of Convolutional Neural Network (CNN) in image classification, the high-level CNN feature, as an intuitive and reasonable choice, has been widely utilized for this problem. In this paper, however, we consider the mid-level CNN features as an alternative to the high-level ones for attribute prediction. This is based on the observation that face attributes are different: some of them are locally oriented while others are globally defined. Our investigations reveal that the mid-level deep representations outperform the prediction accuracy achieved by the (fine-tuned) high-level abstractions. We empirically demonstrate that the mid-level representations achieve state-of-the-art prediction performance on CelebA and LFWA datasets. Our investigations also show that by utilizing the mid-level representations one can employ a single deep network to achieve both face recognition and attribute prediction.
引用
收藏
页码:3239 / 3243
页数:5
相关论文
共 50 条
  • [1] EGOCENTRIC ACTIVITY RECOGNITION BY LEVERAGING MULTIPLE MID-LEVEL REPRESENTATIONS
    Hsieh, Peng-Ju
    Tin, Yen-Hang
    Chen, Yu-Hsiu
    Hsu, Winston
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [2] LET THEM CHOOSE WHAT THEY WANT: A MULTI-TASK CNN ARCHITECTURE LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR FACE ATTRIBUTE CLASSIFICATION
    Chen, Zhenduo
    Liu, Feng
    Zhao, Zhenglai
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 879 - 883
  • [3] Unsupervised learning of mid-level visual representations
    Matteucci, Giulio
    Piasini, Eugenio
    Zoccolan, Davide
    CURRENT OPINION IN NEUROBIOLOGY, 2024, 84
  • [4] Understanding mid-level representations in visual processing
    Peirce, Jonathan W.
    JOURNAL OF VISION, 2015, 15 (07):
  • [5] Mid-level Deep Pattern Mining
    Li, Yao
    Liu, Lingqiao
    Shen, Chunhua
    van den Hengel, Anton
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 971 - 980
  • [6] Investigation of Factorized Optical Flows as Mid-Level Representations
    Yang, Hsuan-Kung
    Hsiao, Tsu-Ching
    Liao, Ting-Hsuan
    Liu, Hsu-Shen
    Tsao, Li-Yuan
    Wang, Tzu-Wen
    Yang, Shan-Ya
    Chen, Yu-Wen
    Liao, Huang-Ru
    Lee, Chun-Yi
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 746 - 753
  • [7] Deep Learning Face Attributes in the Wild
    Liu, Ziwei
    Luo, Ping
    Wang, Xiaogang
    Tang, Xiaoou
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3730 - 3738
  • [8] Image Super-resolution Using Mid-level Representations
    Yang, Li
    Wang, Yaxing
    Mu, Xiaomin
    Wang, Yaping
    2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 291 - 296
  • [9] Object Pose Estimation using Mid-level Visual Representations
    Nejatishahidin, Negar
    Fayyazsanavi, Pooya
    Kosecka, Jana
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 13105 - 13111
  • [10] Scene Text Identification by Leveraging Mid-level Patches and Context Information
    Wang, Runmin
    Sang, Nong
    Gao, Changxin
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) : 963 - 967