Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

被引:0
|
作者
Song, Xiaoshuai [1 ]
He, Keqing [2 ]
Wang, Pei [1 ]
Dong, Guanting [1 ]
Mou, Yutao [1 ]
Wang, Jingang [2 ]
Xiang, Yunsen [2 ]
Cai, Xunliang [2 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges.
引用
收藏
页码:10291 / 10304
页数:14
相关论文
共 50 条
  • [41] ChatGPT and large language models in academia: opportunities and challenges
    Jesse G. Meyer
    Ryan J. Urbanowicz
    Patrick C. N. Martin
    Karen O’Connor
    Ruowang Li
    Pei-Chen Peng
    Tiffani J. Bright
    Nicholas Tatonetti
    Kyoung Jae Won
    Graciela Gonzalez-Hernandez
    Jason H. Moore
    BioData Mining, 16
  • [42] Modular Behavior Trees: Language for Fast AI in Open-World Video Games
    Plch, Tomas
    Marko, Matej
    Ondracek, Petr
    Cerny, Martin
    Gemrot, Jakub
    Brom, Cyril
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1209 - +
  • [43] ChatGPT and large language models in academia: opportunities and challenges
    Meyer, Jesse G.
    Urbanowicz, Ryan J.
    Martin, Patrick C. N.
    O'Connor, Karen
    Li, Ruowang
    Peng, Pei-Chen
    Bright, Tiffani J.
    Tatonetti, Nicholas
    Won, Kyoung Jae
    Gonzalez-Hernandez, Graciela
    Moore, Jason H.
    BIODATA MINING, 2023, 16 (01)
  • [44] Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data
    Zhao, Na
    Lee, Gim Hee
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16989 - 16997
  • [45] Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation
    Jin, Kailun
    Wang, Chung-Yu
    Hung Viet Pham
    Hemmati, Hadi
    2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2024, : 167 - 171
  • [46] Implications of large language models such as ChatGPT for dental medicine
    Eggmann, Florin
    Weiger, Roland
    Zitzmann, Nicola U.
    Blatz, Markus B.
    JOURNAL OF ESTHETIC AND RESTORATIVE DENTISTRY, 2023, 35 (07) : 1098 - 1102
  • [47] Exploring Large Language Models in Intent Acquisition and Translation
    Fontana, Mattia
    Martini, Barbara
    Sciarrone, Filippo
    2024 IEEE 10TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT 2024, 2024, : 231 - 234
  • [48] Exploring Capabilities of Large Language Models such as ChatGPT in Radiation
    Dennstadt, Fabio
    Hastings, Janna
    Putora, Paul Martin
    Vu, Erwin
    Fischer, Galina F.
    Suveg, Krisztian
    Glatzer, Markus
    Riggenbach, Elena
    Ha, Hong-Linh
    Cihoric, Nikola
    ADVANCES IN RADIATION ONCOLOGY, 2024, 9 (03)
  • [49] SAR Target Recognition via Random Sampling Combination in Open-World Environments
    Geng, Xiaojing
    Dong, Ganggang
    Xia, Ziheng
    Liu, Hongwei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 (331-343) : 331 - 343
  • [50] JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
    Wang, Zihao
    Cai, Shaofei
    Liu, Anji
    Jin, Yonggang
    Hou, Jinbing
    Zhang, Bowei
    Lin, Haowei
    He, Zhaofeng
    Zheng, Zilong
    Yang, Yaodong
    Ma, Xiaojian
    Liang, Yitao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1894 - 1907