Leveraging Large Language Models for Automated Dialogue Analysis

被引:0
|
作者
Finch, Sarah E. [1 ]
Paek, Ellie S. [1 ]
Choi, Jinho D. [1 ]
机构
[1] Emory Univ, Dept Comp Sci, Atlanta, GA 30322 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Developing high-performing dialogue systems benefits from the automatic identification of undesirable behaviors in system responses. However, detecting such behaviors remains challenging, as it draws on a breadth of general knowledge and understanding of conversational practices. Although recent research has focused on building specialized classifiers for detecting specific dialogue behaviors, the behavior coverage is still incomplete and there is a lack of testing on real-world human-bot interactions. This paper investigates the ability of a state-of-the-art large language model (LLM), ChatGPT-3.5, to perform dialogue behavior detection for nine categories in real human-bot dialogues. We aim to assess whether ChatGPT can match specialized models and approximate human performance, thereby reducing the cost of behavior detection tasks. Our findings reveal that neither specialized models nor ChatGPT have yet achieved satisfactory results for this task, falling short of human performance. Nevertheless, ChatGPT shows promising potential and often outperforms specialized detection models. We conclude with an in-depth examination of the prevalent shortcomings of ChatGPT, offering guidance for future research to enhance LLM capabilities.
引用
收藏
页码:202 / 215
页数:14
相关论文
共 50 条
  • [1] Leveraging Large Language Models for the Automated Documentation of Hardware Designs
    Fernando, Saruni
    Kunzelmann, Robert
    Lopera, Daniela Sanchez
    Al Halabi, Jad
    Ecker, Wolfgang
    [J]. 2024 13TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING, MECO 2024, 2024, : 165 - 170
  • [2] Aliro: an automated machine learning tool leveraging large language models
    Choi, Hyunjun
    Moran, Jay
    Matsumoto, Nicholas
    Hernandez, Miguel E.
    Moore, Jason H.
    [J]. BIOINFORMATICS, 2023, 39 (10)
  • [3] Automated Topic Analysis with Large Language Models
    Kirilenko, Andrei
    Stepchenkova, Svetlana
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2024, ENTER 2024, 2024, : 29 - 34
  • [4] Leveraging large language models in dermatology
    Matin, Rubeta N.
    Linos, Eleni
    Rajan, Neil
    [J]. BRITISH JOURNAL OF DERMATOLOGY, 2023, 189 (03) : 253 - 254
  • [5] Leveraging Large Language Models for Analysis of Student Course Feedback
    Wang, Zixuan
    Denny, Paul
    Leinonen, Juho
    Luxton-Reilly, Andrew
    [J]. PROCEEDINGS OF THE 16TH ANNUAL ACM INDIA COMPUTE CONFERENCE, COMPUTE 2023, 2023, : 76 - 79
  • [6] Leveraging large language models for predictive chemistry
    Kevin Maik Jablonka
    Philippe Schwaller
    Andres Ortega-Guerrero
    Berend Smit
    [J]. Nature Machine Intelligence, 2024, 6 : 161 - 169
  • [7] Leveraging Large Language Models for Tradespace Exploration
    Apaza, Gabriel
    Selva, Daniel
    [J]. JOURNAL OF SPACECRAFT AND ROCKETS, 2024, 61 (05) : 1165 - 1183
  • [8] Leveraging Large Language Models for Sequential Recommendation
    Harte, Jesse
    Zorgdrager, Wouter
    Louridas, Panos
    Katsifodimos, Asterios
    Jannach, Dietmar
    Fragkoulis, Marios
    [J]. PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1096 - 1102
  • [9] Leveraging large language models for predictive chemistry
    Jablonka, Kevin Maik
    Schwaller, Philippe
    Ortega-Guerrero, Andres
    Smit, Berend
    [J]. NATURE MACHINE INTELLIGENCE, 2024, 6 (02) : 122 - 123
  • [10] A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators
    Zhang, Chen
    D'Haro, Luis Fernando
    Chen, Yiming
    Zhang, Malu
    Li, Haizhou
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19515 - 19524