Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models

被引:4
|
作者
Bian, Yifan [1 ]
Kuester, Dennis [2 ]
Liu, Hui [2 ]
Krumhuber, Eva G. [1 ]
机构
[1] UCL, Dept Expt Psychol, London WC1H 0AP, England
[2] Univ Bremen, Dept Math & Comp Sci, D-28359 Bremen, Germany
关键词
automatic facial expression recognition; naturalistic context; deep learning; multimodal large language model; EMOTION; CONTEXT; FACE; RECOGNITION; DATABASE;
D O I
10.3390/s24010126
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper provides a comprehensive overview of affective computing systems for facial expression recognition (FER) research in naturalistic contexts. The first section presents an updated account of user-friendly FER toolboxes incorporating state-of-the-art deep learning models and elaborates on their neural architectures, datasets, and performances across domains. These sophisticated FER toolboxes can robustly address a variety of challenges encountered in the wild such as variations in illumination and head pose, which may otherwise impact recognition accuracy. The second section of this paper discusses multimodal large language models (MLLMs) and their potential applications in affective science. MLLMs exhibit human-level capabilities for FER and enable the quantification of various contextual variables to provide context-aware emotion inferences. These advancements have the potential to revolutionize current methodological approaches for studying the contextual influences on emotions, leading to the development of contextualized emotion models.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models
    Deng, Yinlin
    Xia, Chunqiu Steven
    Peng, Haoran
    Yang, Chenyuan
    Zhan, Lingming
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 423 - 435
  • [22] Identifying Human Emotions from Facial Expressions with Deep Learning
    Babajee, Phavish
    Suddul, Geerish
    Armoogum, Sandhya
    Foogooa, Ravi
    2020 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE (ZINC), 2020, : 36 - 39
  • [23] Assessing Deep Learning Approaches in Detecting Masked Facial Expressions
    Wang, Yepu
    Wang, Yiting
    Zhong, Chuyue
    Zhao, Yijun
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 994 - 997
  • [24] Prototypical Contrastive Transfer Learning for Multimodal Language Understanding
    Otsuki, Seitaro
    Ishikawa, Shintaro
    Sugiura, Komei
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 25 - 32
  • [25] Understanding Telecom Language Through Large Language Models
    Bariah, Lina
    Zou, Hang
    Zhao, Qiyang
    Mouhouche, Belkacem
    Bader, Faouzi
    Debbah, Merouane
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6542 - 6547
  • [26] Interpreting Deep Learning Models for Multimodal Neuroimaging
    Mueller, K. R.
    Hofmann, S. M.
    2023 11TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI, 2023,
  • [27] MultiModal Language Modelling on Knowledge Graphs for Deep Video Understanding
    Anand, Vishal
    Ramesh, Raksha
    Jin, Boshen
    Wang, Ziyin
    Lei, Xiaoxiao
    Lin, Ching-Yung
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4868 - 4872
  • [28] Multimodal Analysis for Deep Video Understanding with Video Language Transformer
    Zhang, Beibei
    Fang, Yaqun
    Ren, Tongwei
    Wu, Gangshan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7165 - 7169
  • [29] Enhancing masked facial expression recognition with multimodal deep learning
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Akram, Sheeraz
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 73911 - 73921
  • [30] Implications of Multimodal Learning Models for foreign language teaching and learning
    Farias, Miguel
    Obilinovic, Katica
    Orrego, Roxana
    COLOMBIAN APPLIED LINGUISTICS JOURNAL, 2007, 9 : 174 - 199