Facial Emotion Recognition Using Transfer Learning in the Deep CNN

被引:87
|
作者
Akhand, M. A. H. [1 ]
Roy, Shuvendu [1 ]
Siddique, Nazmul [2 ]
Kamal, Md Abdus Samad [3 ]
Shimamura, Tetsuya [4 ]
机构
[1] Khulna Univ Engn & Technol, Dept Comp Sci & Engn, Khulna 9203, Bangladesh
[2] Ulster Univ, Sch Comp Engn & Intelligent Syst, Coleraine BT48 7JL, Londonderry, North Ireland
[3] Gunma Univ, Grad Sch Sci & Technol, Kiryu, Gumma 3768515, Japan
[4] Saitama Univ, Grad Sch Sci & Engn, Saitama 3388570, Japan
关键词
convolutional neural network (CNN); deep CNN; emotion recognition; transfer learning; CONVOLUTIONAL NEURAL-NETWORKS; EXPRESSION RECOGNITION; ARCHITECTURES; FACE;
D O I
10.3390/electronics10091036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human facial emotion recognition (FER) has attracted the attention of the research community for its promising applications. Mapping different facial expressions to the respective emotional states are the main task in FER. The classical FER consists of two major steps: feature extraction and emotion recognition. Currently, the Deep Neural Networks, especially the Convolutional Neural Network (CNN), is widely used in FER by virtue of its inherent feature extraction mechanism from images. Several works have been reported on CNN with only a few layers to resolve FER problems. However, standard shallow CNNs with straightforward learning schemes have limited feature extraction capability to capture emotion information from high-resolution images. A notable drawback of the most existing methods is that they consider only the frontal images (i.e., ignore profile views for convenience), although the profile views taken from different angles are important for a practical FER system. For developing a highly accurate FER system, this study proposes a very Deep CNN (DCNN) modeling through Transfer Learning (TL) technique where a pre-trained DCNN model is adopted by replacing its dense upper layer(s) compatible with FER, and the model is fine-tuned with facial emotion data. A novel pipeline strategy is introduced, where the training of the dense layer(s) is followed by tuning each of the pre-trained DCNN blocks successively that has led to gradual improvement of the accuracy of FER to a higher level. The proposed FER system is verified on eight different pre-trained DCNN models (VGG-16, VGG-19, ResNet-18, ResNet-34, ResNet-50, ResNet-152, Inception-v3 and DenseNet-161) and well-known KDEF and JAFFE facial image datasets. FER is very challenging even for frontal views alone. FER on the KDEF dataset poses further challenges due to the diversity of images with different profile views together with frontal views. The proposed method achieved remarkable accuracy on both datasets with pre-trained models. On a 10-fold cross-validation way, the best achieved FER accuracies with DenseNet-161 on test sets of KDEF and JAFFE are 96.51% and 99.52%, respectively. The evaluation results reveal the superiority of the proposed FER system over the existing ones regarding emotion detection accuracy. Moreover, the achieved performance on the KDEF dataset with profile views is promising as it clearly demonstrates the required proficiency for real-life applications.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Facial emotion recognition using deep quantum and advanced transfer learning mechanism
    Alsubai, Shtwai
    Alqahtani, Abdullah
    Alanazi, Abed
    Sha, Mohemmed
    Gumaei, Abdu
    [J]. Frontiers in Computational Neuroscience, 2024, 18
  • [2] Facial emotion recognition based on deep transfer learning approach
    Aziza Sultana
    Samrat Kumar Dey
    Md. Armanur Rahman
    [J]. Multimedia Tools and Applications, 2023, 82 : 44175 - 44189
  • [3] Facial emotion recognition based on deep transfer learning approach
    Sultana, Aziza
    Dey, Samrat Kumar
    Rahman, Md. Armanur
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 44175 - 44189
  • [4] Facial emotion recognition and music recommendation system using CNN-based deep learning techniques
    Brijesh Bakariya
    Arshdeep Singh
    Harmanpreet Singh
    Pankaj Raju
    Rohit Rajpoot
    Krishna Kumar Mohbey
    [J]. Evolving Systems, 2024, 15 : 641 - 658
  • [5] Facial emotion recognition and music recommendation system using CNN-based deep learning techniques
    Bakariya, Brijesh
    Singh, Arshdeep
    Singh, Harmanpreet
    Raju, Pankaj
    Rajpoot, Rohit
    Mohbey, Krishna Kumar
    [J]. EVOLVING SYSTEMS, 2024, 15 (02) : 641 - 658
  • [6] Deep Learning Model for Facial Emotion Recognition
    Pathak, Ajeet Ram
    Bhalsing, Somesh
    Desai, Shivani
    Gandhi, Monica
    Patwardhan, Pranathi
    [J]. PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 543 - 558
  • [7] Deep Residual Learning for Facial Emotion Recognition
    Mishra, Sagar
    Joshi, Basanta
    Paudyal, Rajendra
    Chaulagain, Duryodhan
    Shakya, Subarna
    [J]. MOBILE COMPUTING AND SUSTAINABLE INFORMATICS, 2022, 68 : 301 - 313
  • [8] Virtual facial expression recognition using deep CNN with ensemble learning
    Chirra, Venkata Rami Reddy
    Uyyala, Srinivasulu Reddy
    Kolli, Venkata Krishna Kishore
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (12) : 10581 - 10599
  • [9] Virtual facial expression recognition using deep CNN with ensemble learning
    Venkata Rami Reddy Chirra
    Srinivasulu Reddy Uyyala
    Venkata Krishna Kishore Kolli
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 10581 - 10599
  • [10] Facial emotion recognition using Handcrafted features and CNN
    Gautam, Chahak
    Seeja, K.R.
    [J]. Procedia Computer Science, 2022, 218 : 1295 - 1303