Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

被引:0
|
作者
Song, Xiaoshuai [1 ]
He, Keqing [2 ]
Wang, Pei [1 ]
Dong, Guanting [1 ]
Mou, Yutao [1 ]
Wang, Jingang [2 ]
Xiang, Yunsen [2 ]
Cai, Xunliang [2 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges.
引用
收藏
页码:10291 / 10304
页数:14
相关论文
共 50 条
  • [31] Evaluation of large language models for discovery of gene set function
    Hu, Mengzhou
    Alkhairy, Sahar
    Lee, Ingoo
    Pillich, Rudolf T.
    Fong, Dylan
    Smith, Kevin
    Bachelder, Robin
    Ideker, Trey
    Pratt, Dexter
    NATURE METHODS, 2025, 22 (01) : 82 - 91
  • [32] Empowering Molecule Discovery for Molecule-Caption Translation With Large Language Models: A ChatGPT Perspective
    Li, Jiatong
    Liu, Yunqing
    Fan, Wenqi
    Wei, Xiao-Yong
    Liu, Hui
    Tang, Jiliang
    Li, Qing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6071 - 6083
  • [33] Rethinking Knowledge Graph Evaluation Under the Open-World Assumption
    Yang, Haotong
    Lin, Zhouchen
    Zhang, Muhan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [34] HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
    Chen, Chen
    Hu, Yuchen
    Yang, Chao-Han Huck
    Siniscalchi, Sabato Marco
    Chen, Pin-Yu
    Chng, Eng Siong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] Leveraging Open Large Language Models for Historical Named Entity Recognition
    Gonzalez-Gallardo, Carlos-Emiliano
    Hanh Thi Hong Tran
    Hamdi, Ahmed
    Doucet, Antoine
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, PT I, TPDL 2024, 2024, 15177 : 379 - 395
  • [36] CTR: Contrastive Training Recognition Classifier for Few-Shot Open-World Recognition
    Dionelis, Nikolaos
    Tsaftaris, Sotirios A.
    Yaghoobi, Mehrdad
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1792 - 1799
  • [37] Quo Vadis ChatGPT? From large language models to Large Knowledge Models
    Venkatasubramanian, Venkat
    Chakraborty, Arijit
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
  • [38] How does ChatGPT 'think'? Psychology and neuroscience crack open AI large language models
    Hutson, Matthew
    NATURE, 2024, 629 (8014) : 986 - 988
  • [39] Large Language Models for Automated Open-domain Scientific Hypotheses Discovery
    Yang, Zonglin
    Du, Xinya
    Li, Junxian
    Zheng, Jie
    Poria, Soujanya
    Cambria, Erik
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13545 - 13565
  • [40] ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
    Lai, Viet Dac
    Nguyen, Nghia Trung
    Ben Veyseh, Amir Pouran
    Man, Hieu
    Dernoncourt, Franck
    Bu, Trung
    Nguyen, Thien Huu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 13171 - 13189