LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR PREDICTING FACE ATTRIBUTES IN THE WILD

被引：0

作者：

Zhong, Yang ^{[1
]}

Sullivan, Josephine ^{[1
]}

Li, Haibo ^{[1
]}

机构：

[1] KTH Royal Inst Technol, Stockholm, Sweden

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2016年

关键词：

deep learning; mid-level deep representation; face attribute prediction; face recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Predicting facial attributes from faces in the wild is very challenging due to pose and lighting variations in the real world. The key to this problem is to build proper feature representations to cope with these unfavourable conditions. Given the success of Convolutional Neural Network (CNN) in image classification, the high-level CNN feature, as an intuitive and reasonable choice, has been widely utilized for this problem. In this paper, however, we consider the mid-level CNN features as an alternative to the high-level ones for attribute prediction. This is based on the observation that face attributes are different: some of them are locally oriented while others are globally defined. Our investigations reveal that the mid-level deep representations outperform the prediction accuracy achieved by the (fine-tuned) high-level abstractions. We empirically demonstrate that the mid-level representations achieve state-of-the-art prediction performance on CelebA and LFWA datasets. Our investigations also show that by utilizing the mid-level representations one can employ a single deep network to achieve both face recognition and attribute prediction.

引用

页码：3239 / 3243

页数：5

共 50 条

[1] EGOCENTRIC ACTIVITY RECOGNITION BY LEVERAGING MULTIPLE MID-LEVEL REPRESENTATIONS
Hsieh, Peng-Ju
Tin, Yen-Hang
Chen, Yu-Hsiu
Hsu, Winston
2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
[2] LET THEM CHOOSE WHAT THEY WANT: A MULTI-TASK CNN ARCHITECTURE LEVERAGING MID-LEVEL DEEP REPRESENTATIONS FOR FACE ATTRIBUTE CLASSIFICATION
Chen, Zhenduo
Liu, Feng
Zhao, Zhenglai
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 879 - 883
[3] Unsupervised learning of mid-level visual representations
Matteucci, Giulio
Piasini, Eugenio
Zoccolan, Davide
CURRENT OPINION IN NEUROBIOLOGY, 2024, 84
[4] Understanding mid-level representations in visual processing
Peirce, Jonathan W.
JOURNAL OF VISION, 2015, 15 (07):
[5] Mid-level Deep Pattern Mining
Li, Yao
Liu, Lingqiao
Shen, Chunhua
van den Hengel, Anton
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 971 - 980
[6] Investigation of Factorized Optical Flows as Mid-Level Representations
Yang, Hsuan-Kung
Hsiao, Tsu-Ching
Liao, Ting-Hsuan
Liu, Hsu-Shen
Tsao, Li-Yuan
Wang, Tzu-Wen
Yang, Shan-Ya
Chen, Yu-Wen
Liao, Huang-Ru
Lee, Chun-Yi
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 746 - 753
[7] Deep Learning Face Attributes in the Wild
Liu, Ziwei
Luo, Ping
Wang, Xiaogang
Tang, Xiaoou
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3730 - 3738
[8] Image Super-resolution Using Mid-level Representations
Yang, Li
Wang, Yaxing
Mu, Xiaomin
Wang, Yaping
2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 291 - 296
[9] Object Pose Estimation using Mid-level Visual Representations
Nejatishahidin, Negar
Fayyazsanavi, Pooya
Kosecka, Jana
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 13105 - 13111
[10] Scene Text Identification by Leveraging Mid-level Patches and Context Information
Wang, Runmin
Sang, Nong
Gao, Changxin
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) : 963 - 967

← 1 2 3 4 5 →