Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models

被引:4
|
作者
Bian, Yifan [1 ]
Kuester, Dennis [2 ]
Liu, Hui [2 ]
Krumhuber, Eva G. [1 ]
机构
[1] UCL, Dept Expt Psychol, London WC1H 0AP, England
[2] Univ Bremen, Dept Math & Comp Sci, D-28359 Bremen, Germany
关键词
automatic facial expression recognition; naturalistic context; deep learning; multimodal large language model; EMOTION; CONTEXT; FACE; RECOGNITION; DATABASE;
D O I
10.3390/s24010126
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper provides a comprehensive overview of affective computing systems for facial expression recognition (FER) research in naturalistic contexts. The first section presents an updated account of user-friendly FER toolboxes incorporating state-of-the-art deep learning models and elaborates on their neural architectures, datasets, and performances across domains. These sophisticated FER toolboxes can robustly address a variety of challenges encountered in the wild such as variations in illumination and head pose, which may otherwise impact recognition accuracy. The second section of this paper discusses multimodal large language models (MLLMs) and their potential applications in affective science. MLLMs exhibit human-level capabilities for FER and enable the quantification of various contextual variables to provide context-aware emotion inferences. These advancements have the potential to revolutionize current methodological approaches for studying the contextual influences on emotions, leading to the development of contextualized emotion models.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Harnessing multimodal approaches for depression detection using large language models and facial expressions
    Misha Sadeghi
    Robert Richer
    Bernhard Egger
    Lena Schindler-Gmelch
    Lydia Helene Rupp
    Farnaz Rahimi
    Matthias Berking
    Bjoern M. Eskofier
    npj Mental Health Research, 3 (1):
  • [2] Naturalistic multimodal emotion data with deep learning can advance the theoretical understanding of emotion
    Thanakorn Angkasirisan
    Psychological Research, 2025, 89 (1)
  • [3] Shortcut Learning of Large Language Models in Natural Language Understanding
    Du, Mengnan
    He, Fengxiang
    Zou, Na
    Tao, Dacheng
    Hu, Xia
    COMMUNICATIONS OF THE ACM, 2024, 67 (01) : 110 - 120
  • [4] Multimodal large language models for inclusive collaboration learning tasks
    Lewis, Armanda
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 202 - 210
  • [5] A survey on multimodal large language models
    Shukang Yin
    Chaoyou Fu
    Sirui Zhao
    Ke Li
    Xing Sun
    Tong Xu
    Enhong Chen
    National Science Review, 2024, 11 (12) : 277 - 296
  • [6] Understanding Art Deeply: Sentiment Analysis of Facial Expressions of Graphic Arts Using Deep Learning
    Wang, Fei
    International Journal of Advanced Computer Science and Applications, 2025, 16 (01) : 525 - 534
  • [7] Understanding Deep Learning Techniques for Recognition of Human Emotions Using Facial Expressions: A Comprehensive Survey
    Karnati, Mohan
    Seal, Ayan
    Bhattacharjee, Debotosh
    Yazidi, Anis
    Krejcar, Ondrej
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [8] From Large Language Models to Large Multimodal Models: A Literature Review
    Huang, Dawei
    Yan, Chuan
    Li, Qing
    Peng, Xiaojiang
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [9] The Importance of Understanding Language in Large Language Models
    Youssef, Alaa
    Stein, Samantha
    Clapp, Justin
    Magnus, David
    AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (10): : 6 - 7
  • [10] Multimodal deep learning for multimedia understanding and reasoning
    Multimedia Tools and Applications, 2021, 80 : 17167 - 17167