Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

被引:0
|
作者
Song, Xiaoshuai [1 ]
He, Keqing [2 ]
Wang, Pei [1 ]
Dong, Guanting [1 ]
Mou, Yutao [1 ]
Wang, Jingang [2 ]
Xiang, Yunsen [2 ]
Cai, Xunliang [2 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges.
引用
收藏
页码:10291 / 10304
页数:14
相关论文
共 50 条
  • [1] Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition
    Song, Xiaoshuai
    Mou, Yutao
    He, Keqing
    Qiu, Yueyan
    Wang, Pei
    Xu, Weiran
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4370 - 4382
  • [2] Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models
    Xi, Yunjia
    Liu, Weiwen
    Lin, Jianghao
    Cai, Xiaoling
    Hong, Zhu
    Zhu, Jieming
    Chen, Bo
    Tang, Ruiming
    Zhang, Weinan
    Yu, Yong
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 12 - 22
  • [3] On Leveraging Large Language Models for Multilingual Intent Discovery
    Chow, Rudolf
    Suen, King yiu
    Lam, Albert Y. S.
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [4] Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding
    He, Mutian
    Garner, Philip N.
    INTERSPEECH 2023, 2023, : 1109 - 1113
  • [5] Open-World Class Discovery with Kernel Networks
    Wang, Zifeng
    Salehi, Batool
    Gritsenko, Andrey
    Chowdhury, Kaushik
    Ioannidis, Stratis
    Dy, Jennifer
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 631 - 640
  • [6] Unsupervised open-world human action recognition
    Gutoski, Matheus
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1753 - 1770
  • [7] Unsupervised open-world human action recognition
    Matheus Gutoski
    André Eugenio Lazzaretti
    Heitor Silvério Lopes
    Pattern Analysis and Applications, 2023, 26 : 1753 - 1770
  • [8] OWI: Open-World Intent Identification Framework for Dialog Based System
    Parmar, Jitendra
    Soni, Sanskar
    Chouhan, Satyendra Singh
    8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, BDA 2020, 2020, 12581 : 329 - 343
  • [9] Evaluation of ChatGPT and Gemini large language models for pharmacometrics with NONMEM
    Shin, Euibeom
    Yu, Yifan
    Bies, Robert R.
    Ramanathan, Murali
    JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2024, 51 (03) : 187 - 197
  • [10] Towards open-world recognition: Critical problems and challenges
    Wang, Ke
    Li, Zhikang
    Chen, Yang
    Dong, Wenjie
    Chen, Junlan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143