LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR PREDICTING FACE ATTRIBUTES IN THE WILD

被引:0
|
作者
Zhong, Yang [1 ]
Sullivan, Josephine [1 ]
Li, Haibo [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
关键词
deep learning; mid-level deep representation; face attribute prediction; face recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Predicting facial attributes from faces in the wild is very challenging due to pose and lighting variations in the real world. The key to this problem is to build proper feature representations to cope with these unfavourable conditions. Given the success of Convolutional Neural Network (CNN) in image classification, the high-level CNN feature, as an intuitive and reasonable choice, has been widely utilized for this problem. In this paper, however, we consider the mid-level CNN features as an alternative to the high-level ones for attribute prediction. This is based on the observation that face attributes are different: some of them are locally oriented while others are globally defined. Our investigations reveal that the mid-level deep representations outperform the prediction accuracy achieved by the (fine-tuned) high-level abstractions. We empirically demonstrate that the mid-level representations achieve state-of-the-art prediction performance on CelebA and LFWA datasets. Our investigations also show that by utilizing the mid-level representations one can employ a single deep network to achieve both face recognition and attribute prediction.
引用
收藏
页码:3239 / 3243
页数:5
相关论文
共 50 条
  • [31] Selectivity for mid-level properties of faces and places in the fusiform face area and parahippocampal place area
    Coggan, David D.
    Baker, Daniel H.
    Andrews, Timothy J.
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 49 (12) : 1587 - 1596
  • [32] AttriNet: Learning Mid-Level Features for Human Activity Recognition with Deep Belief Networks
    Nair, Harideep
    Tan, Cathy
    Zeng, Ming
    Mengshoel, Ole J.
    Shen, John Paul
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 510 - 517
  • [33] SAR IMAGE CLASSIFICATION BASED ON THE MULTI-LAYER NETWORK AND TRANSFER LEARNING OF MID-LEVEL REPRESENTATIONS
    Kang, Chenyao
    He, Chu
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 1146 - 1149
  • [34] Scene classification from remote sensing images using mid-level deep feature learning
    Ni, Kang
    Wu, Yiquan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (04) : 1415 - 1436
  • [35] Introduction to the CVIU special issue on "Parts and Attributes: Mid-level representation for object recognition, scene classification and object detection"
    Darrell, Trevor
    Ferrari, Vittorio
    Julie, Frederic
    Lepetit, Vincent
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 138 : 85 - 85
  • [36] Scene Recognition From Optical Remote Sensing Images Using Mid-Level Deep Feature Mining
    Banerjee, Biplab
    Chaudhuri, Subhasis
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (07) : 1080 - 1084
  • [37] Leveraging Deep Representations of Radiology Reports in Survival Analysis for Predicting Heart Failure Patient Mortality
    Lee, Hyun Gi
    Sholle, Evan
    Beecy, Ashley
    Al'Aref, Subhi
    Peng, Yifan
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4533 - 4538
  • [38] DEEP NEURAL NETWORK BASED LEARNING AND TRANSFERRING MID-LEVEL AUDIO FEATURES FOR ACOUSTIC SCENE CLASSIFICATION
    Mun, Seongkyu
    Shon, Suwon
    Kim, Wooil
    Han, David K.
    Ko, Hanseok
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 796 - 800
  • [39] Predicting Irregular Individual Movement following Frequent Mid-Level Disasters using Location Data from Smartphones
    Yabe, Takahiro
    Tsubouchi, Kota
    Sudo, Akihito
    Sekimoto, Yoshihide
    24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [40] Training-induced recovery of low-level vision followed by mid-level perceptual improvements in developmental object and face agnosia
    Lev, Maria
    Gilaie-Dotan, Sharon
    Gotthilf-Nezri, Dana
    Yehezkel, Oren
    Brooks, Joseph L.
    Perry, Anat
    Bentin, Shlomo
    Bonneh, Yoram
    Polat, Uri
    DEVELOPMENTAL SCIENCE, 2015, 18 (01) : 50 - 64