Multi-stream CNN for facial expression recognition in limited training data

被引:23
|
作者
Aghamaleki, Javad Abbasi [1 ]
Chenarlogh, Vahid Ashkani [2 ]
机构
[1] Damgham Univ, Fac Engn Dept, Damghan, Iran
[2] Islamic Azad Univ, Sci & Res Branch, ECE Dept, Tehran, Iran
关键词
Facial expression recognition; Convolutional neural network; Limited data; Multi-stream structure; FACE;
D O I
10.1007/s11042-019-7530-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Limited data is a challenging problem to train Convolutional Neural Networks. On the other hand, acquiring a database in a demanded scale is not a straightforward task. In this paper, handcrafted features along with a multi-stream structure are proposed as a solution to improve performance of limited data via CNN. Three handcrafted features using local binary pattern code extractor and Sobel edge detection operator in horizontal and vertical directions of images have been extracted to apply to the multi-stream CNN model. Our model is based on two distinct structures including three-stream and single-stream structures. The three-stream structure can be employed to improve the recognition rate in facial expression classifiers when the training data is limited. In three-stream structure, each of information channels will be added to distinct streams separately. Furthermore, the transfer learning technique employed and behaviour of VGG16 architecture trained with limited data have been studied to be compared with the proposed method. In addition, input data is expanded by means of rotation, cropping, and flipping. Next, three-stream and single-stream structures are examined while using limited and also expanded training data. We have evaluated the mentioned system in order to compare it with state of the arts for CK+ and MUG databases in both limited-data and expanded-data. The results indicate that by using limited-data, recognition accuracy will be improved through the mentioned strategy. (92.19 to 88.95 in CK+ database and 85.4 to 82.5 in MUG database). Additionally, the performance was improved in comparison with benchmark methods.
引用
收藏
页码:22861 / 22882
页数:22
相关论文
共 50 条
  • [1] Multi-stream CNN for facial expression recognition in limited training data
    Javad Abbasi Aghamaleki
    Vahid Ashkani Chenarlogh
    Multimedia Tools and Applications, 2019, 78 : 22861 - 22882
  • [2] A Multi-View Human Action recognition System in Limited Data case using multi-stream CNN
    Chenarlogh, Vahid Ashkani
    Razzazi, Farbod
    Mohammadyahya, Najmeh
    2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [3] Multi-stream 3D CNN structure for human action recognition trained by limited data
    Chenarlogh, Vahid Ashkani
    Razzazi, Farbod
    IET COMPUTER VISION, 2019, 13 (03) : 338 - 344
  • [4] Fusing multi-stream deep neural networks for facial expression recognition
    Fatima Zahra Salmam
    Abdellah Madani
    Mohamed Kissi
    Signal, Image and Video Processing, 2019, 13 : 609 - 616
  • [5] Fusing multi-stream deep neural networks for facial expression recognition
    Zahra Salmam, Fatima
    Madani, Abdellah
    Kissi, Mohamed
    SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (03) : 609 - 616
  • [6] Multimodal Egocentric Activity Recognition Using Multi-stream CNN
    Imran, Javed
    Raman, Balasubramanian
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [7] End-to-End Speech Recognition Technology Based on Multi-Stream CNN
    Xiao, Hao
    Qiu, Yuan
    Fei, Rong
    Chen, Xiongbo
    Liu, Zuo
    Wu, Zongling
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1310 - 1315
  • [8] Multi-Stream Deep Convolution Neural Network With Ensemble Learning for Facial Micro-Expression Recognition
    Perveen, Gulnaz
    Ali, Syed Farooq
    Ahmad, Jameel
    Shahab, Sana
    Adnan, Muhammad
    Anjum, Mohd
    Khosa, Ikramullah
    IEEE ACCESS, 2023, 11 : 118474 - 118489
  • [9] Driving behaviour recognition from still images by using multi-stream fusion CNN
    Yaocong Hu
    Mingqi Lu
    Xiaobo Lu
    Machine Vision and Applications, 2019, 30 : 851 - 865
  • [10] Driving behaviour recognition from still images by using multi-stream fusion CNN
    Hu, Yaocong
    Lu, Mingqi
    Lu, Xiaobo
    MACHINE VISION AND APPLICATIONS, 2019, 30 (05) : 851 - 865