SwinFace: A Multi-Task Transformer for Face Recognition, Expression Recognition, Age Estimation and Attribute Estimation

被引:4
|
作者
Qin, Lixiong [1 ]
Wang, Mei [1 ]
Deng, Chao [2 ]
Wang, Ke [2 ]
Chen, Xi [2 ]
Hu, Jiani [1 ]
Deng, Weihong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] China Mobile Res Inst, Beijing 100053, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-task learning; Swin Transformer; face recognition; facial expression recognition; age estimation; face attribute estimation; REPRESENTATION; IMAGE;
D O I
10.1109/TCSVT.2023.3304724
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, vision transformers have been introduced into face recognition and analysis and have achieved performance breakthroughs. However, most previous methods generally train a single model or an ensemble of models to perform the desired task, which ignores the synergy among different tasks and fails to achieve improved prediction accuracy, increased data efficiency, and reduced training time. This paper presents a multi-purpose algorithm for simultaneous face recognition, facial expression recognition, age estimation, and face attribute estimation (40 attributes including gender) based on a single Swin Transformer. Our design, the SwinFace, consists of a single shared backbone together with a subnet for each set of related tasks. To address the conflicts among multiple tasks and meet the different demands of tasks, a Multi-Level Channel Attention (MLCA) module is integrated into each task-specific analysis subnet, which can adaptively select the features from optimal levels and channels to perform the desired tasks. Extensive experiments show that the proposed model has a better understanding of the face and achieves excellent performance for all tasks. Especially, it achieves 90.97% accuracy on RAF-DB and $0.22 \epsilon $ -error on CLAP2015, which are state-of-the-art results on facial expression recognition and age estimation respectively.
引用
收藏
页码:2223 / 2234
页数:12
相关论文
共 50 条
  • [1] MULTI-TASK LEARNING FOR FACE IDENTIFICATION AND ATTRIBUTE ESTIMATION
    Hsieh, Hui-Lan
    Hsu, Winston
    Chen, Yan-Ying
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2981 - 2985
  • [2] Research on Face Attribute Recognition Based on Multi-task CNN Network
    Chen, Xiaoyan
    Wang, Weiwei
    Zheng, Shuangwu
    [J]. PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1221 - 1224
  • [3] Multi-dataset fusion for multi-task learning on face attribute recognition
    Lu, Hengjie
    Xu, Shugong
    Wang, Jiahao
    [J]. PATTERN RECOGNITION LETTERS, 2023, 173 : 72 - 78
  • [4] Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach
    Han, Hu
    Jain, Anil K.
    Wang, Fang
    Shan, Shiguang
    Chen, Xilin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (11) : 2597 - 2609
  • [5] Face Attribute Estimation Using Multi-Task Convolutional Neural Network
    Kawai, Hiroyarr
    Ito, Koichi
    Aoki, Takafumi
    [J]. JOURNAL OF IMAGING, 2022, 8 (04)
  • [6] PARFormer: Transformer-Based Multi-Task Network for Pedestrian Attribute Recognition
    Fan, Xinwen
    Zhang, Yukang
    Lu, Yang
    Wang, Hanzi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 411 - 423
  • [7] A single-model multi-task method for face recognition and face attribute recognition in internet of things and visual computing
    Lu, Jin
    Wu, Bo
    [J]. IET IMAGE PROCESSING, 2022,
  • [8] A JOINT MULTI-TASK CNN FOR CROSS-AGE FACE RECOGNITION
    Yu, Jinbiao
    Jing, Liping
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2411 - 2415
  • [9] Grouped Multi-Task CNN for Facial Attribute Recognition
    Yip, Chitung
    Hu, Haifeng
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 272 - 277
  • [10] Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction
    Wang, Zhanxiong
    He, Keke
    Fu, Yanwei
    Feng, Rui
    Jiang, Yu-Gang
    Xue, Xiangyang
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 370 - 379