The Emotional Intelligence of the GPT-4 Large Language Model

被引:1
|
作者
Vzorin, Gleb D. [1 ,2 ]
Bukinich, Alexey M. [1 ,3 ]
Sedykh, Anna V. [1 ]
Vetrova, Irina I. [2 ]
Sergienko, Elena A. [2 ]
机构
[1] Lomonosov Moscow State Univ, Moscow, Russia
[2] Russian Acad Sci, Inst Psychol, Moscow, Russia
[3] Fed Sci Ctr Psychol & Interdisciplinary Res, Moscow, Russia
来源
PSYCHOLOGY IN RUSSIA-STATE OF THE ART | 2024年 / 17卷 / 02期
关键词
artificial empathy; artificial psychology; ChatGPT; emotional intelligence (EI); emotional quotient(EQ); GPT-4; machine behavior;
D O I
10.11621/pir.2024.0206
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Background. Advanced AI models such as the large language model GPT-4 demonstrate sophisticated intellectual capabilities, sometimes exceeding human intellectual performance. However, the emotional competency of these models, along with their underlying mechanisms, has not been sufficiently evaluated. Objective. Our research aimed to explore different emotional intelligence domains in GPT-4 according to the Mayer-Salovey-Caruso model. We also tried to find out whether GPT-4's answer accuracy is consistent with its explanation of the answer. Design. The Russian version of the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT) sections was used in this research, with questions asked as text prompts in separate, independent ChatGPT chats three times each. Results. . High scores were achieved by the GPT-4 Large Language Model on the Understanding Emotions scale (with scores of 117, 124, and 128 across the three runs) and the Strategic Emotional Intelligence scale (with scores of 118, 121, and 122). Average scores were obtained on the Managing Emotions scale (103, 108, and 110 points). However, the Using Emotions to Facilitate Thought scale yielded low and less reliable scores (85, 86, and 88 points). Four types of explanations for the answer choices were identified: Meaningless sentences; Relation declaration; Implicit logic; and Explicit logic. Correct answers were accompanied by all types of explanations, whereas incorrect answers were only followed by Meaningless sentences or Explicit logic. This distribution aligns with observed patterns in children when they explore and elucidate mental states. Conclusion. GPT-4 is capable of emotion identification and managing emotions, but it lacks deep reflexive analysis of emotional experience and the motivational aspect of emotions.
引用
收藏
页码:85 / 99
页数:15
相关论文
共 50 条
  • [1] ChatGPT and GPT-4 in Ophthalmology: Applications of Large Language Model Artificial Intelligence in Retina
    Ong, Joshua
    Hariprasad, Seenu M.
    Chhablani, Jay
    OPHTHALMIC SURGERY LASERS & IMAGING RETINA, 2023, 54 (10): : 557 - 562
  • [2] The potential and pitfalls of using a large language model such as ChatGPT, GPT-4, or LLaMA as a clinical assistant
    Zhang, Jingqing
    Sun, Kai
    Jagadeesh, Akshay
    Falakaflaki, Parastoo
    Kayayan, Elena
    Tao, Guanyu
    Ghahfarokhi, Mahta Haghighat
    Gupta, Deepa
    Gupta, Ashok
    Gupta, Vibhor
    Guo, Yike
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1884 - 1891
  • [3] GPT-4, artificial intelligence and implications for publishing
    Ong, C. W. M.
    Blackbourn, H. D.
    Migiliori, G. B.
    INTERNATIONAL JOURNAL OF TUBERCULOSIS AND LUNG DISEASE, 2023, 27 (06) : 425 - 426
  • [4] Usefulness of the large language model ChatGPT (GPT-4) as a diagnostic tool and information source in dermatology
    Nielsen, Jacob P. S.
    Gronhoj, Christian
    Skov, Lone
    Gyldenlove, Mette
    JEADV CLINICAL PRACTICE, 2024, 3 (05): : 1570 - 1575
  • [5] FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
    Bhatia, Gagan
    Nagoudi, El Moatez Billah
    Cavusoglu, Hasan
    Abdul-Mageed, Muhammad
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13064 - 13087
  • [6] The performance of the multimodal large language model GPT-4 on the European board of radiology examination sample test
    Besler, Muhammed Said
    JAPANESE JOURNAL OF RADIOLOGY, 2024, 42 (08) : 927 - 927
  • [7] Performance of the pre-trained large language model GPT-4 on automated short answer grading
    Kortemeyer G.
    Discover Artificial Intelligence, 2024, 4 (01):
  • [8] Monitoring Patients with Glioblastoma by Using a Large Language Model: Accurate Summarization of Radiology Reports with GPT-4
    Laukamp, Kai R.
    Terzis, Robert A.
    Werner, Jan-Michael
    Galldiks, Norbert
    Lennartz, Simon
    Maintz, David
    Reimer, Robert
    Fervers, Philipp
    Gertz, Roman Johannes
    Persigehl, Thorsten
    Rubbert, Christian
    Lehnen, Nils C.
    Deuschl, Cornelius
    Schlamann, Marc
    Schoenfeld, Michael H.
    Kottlors, Jonathan
    RADIOLOGY, 2024, 312 (01)
  • [9] GPT-4: a new era of artificial intelligence in medicine
    Waisberg, Ethan
    Ong, Joshua
    Masalkhi, Mouayad
    Kamran, Sharif Amit
    Zaman, Nasif
    Sarker, Prithul
    Lee, Andrew G.
    Tavakkoli, Alireza
    IRISH JOURNAL OF MEDICAL SCIENCE, 2023, 192 (06) : 3197 - 3200
  • [10] GPT-4: a new era of artificial intelligence in medicine
    Ethan Waisberg
    Joshua Ong
    Mouayad Masalkhi
    Sharif Amit Kamran
    Nasif Zaman
    Prithul Sarker
    Andrew G. Lee
    Alireza Tavakkoli
    Irish Journal of Medical Science (1971 -), 2023, 192 : 3197 - 3200