Audiovisual Facial Action Unit Recognition using Feature Level Fusion

被引:4
|
作者
Meng, Zibo [1 ]
Han, Shizhong [1 ]
Chen, Min [2 ]
Tong, Yan [1 ]
机构
[1] Univ South Carolina, Columbia, SC 29208 USA
[2] Univ Washington, Bothell, WA USA
基金
美国国家科学基金会;
关键词
Action Units; Convolutional Neural Network; Facial Action Unit Recognition; Facial Activity; Feature-Level Information Fusion;
D O I
10.4018/IJMDEM.2016010104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.
引用
收藏
页码:60 / 76
页数:17
相关论文
共 50 条
  • [1] Feature Level Fusion for Bimodal Facial Action Unit Recognition
    Meng, Zibo
    Han, Shizhong
    Chen, Min
    Tong, Yan
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 471 - 476
  • [2] Improving Speech Related Facial Action Unit Recognition by Audiovisual Information Fusion
    Meng, Zibo
    Han, Shizhong
    Liu, Ping
    Tong, Yan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (09) : 3293 - 3306
  • [3] Facial expression recognition using feature level fusion
    Jain, Vanita
    Lamba, Puneet Singh
    Singh, Bhanu
    Namboothiri, Narayanan
    Dhall, Shafali
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (02): : 337 - 350
  • [4] Feature and Decision Level Fusion for Action Recognition
    Abouelenien, Mohamed
    Wan, Yiwen
    Saudagar, Abdullah
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [5] Feature-level and Model-level Audiovisual Fusion for Emotion Recognition in the Wild
    Cai, Jie
    Meng, Zibo
    Khan, Ahmed Shehab
    Li, Zhiyuan
    O'Reilly, James
    Han, Shizhong
    Liu, Ping
    Chen, Min
    Tong, Yan
    [J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 443 - 448
  • [6] Action Recognition Based on Feature-level Fusion
    Cheng, Wanli
    Chen, Enqing
    [J]. TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [7] Texture and shape information fusion for facial expression and facial action unit recognition
    Kotsia, Irene
    Zafeiriou, Stefanos
    Pitas, Loannis
    [J]. PATTERN RECOGNITION, 2008, 41 (03) : 833 - 851
  • [8] Multi-level Feature Fusion Facial Expression Recognition Network
    Hu, Qian
    Wu, Chengdong
    Chi, Jianning
    Yu, Xiaosheng
    Wang, Huan
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5267 - 5272
  • [9] Facial action unit recognition using temporal templates
    Valstar, M
    Patras, I
    Pantic, M
    [J]. RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 253 - 258
  • [10] Decision Level Fusion of Domain Specific Regions for Facial Action Recognition
    Jiang, Bihan
    Martinez, Brais
    Valstar, Michel F.
    Pantic, Maja
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1776 - 1781