Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

被引：0

作者：

Song, Xiaoshuai ^{[1
]}

He, Keqing ^{[2
]}

Wang, Pei ^{[1
]}

Dong, Guanting ^{[1
]}

Mou, Yutao ^{[1
]}

Wang, Jingang ^{[2
]}

Xiang, Yunsen ^{[2
]}

Cai, Xunliang ^{[2
]}

Xu, Weiran ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

[2] Meituan, Beijing, Peoples R China

来源：

2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges.

引用

页码：10291 / 10304

页数：14

共 50 条

[41] ChatGPT and large language models in academia: opportunities and challenges
Jesse G. Meyer
Ryan J. Urbanowicz
Patrick C. N. Martin
Karen O’Connor
Ruowang Li
Pei-Chen Peng
Tiffani J. Bright
Nicholas Tatonetti
Kyoung Jae Won
Graciela Gonzalez-Hernandez
Jason H. Moore
BioData Mining, 16
[42] Modular Behavior Trees: Language for Fast AI in Open-World Video Games
Plch, Tomas
Marko, Matej
Ondracek, Petr
Cerny, Martin
Gemrot, Jakub
Brom, Cyril
21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1209 - +
[43] ChatGPT and large language models in academia: opportunities and challenges
Meyer, Jesse G.
Urbanowicz, Ryan J.
Martin, Patrick C. N.
O'Connor, Karen
Li, Ruowang
Peng, Pei-Chen
Bright, Tiffani J.
Tatonetti, Nicholas
Won, Kyoung Jae
Gonzalez-Hernandez, Graciela
Moore, Jason H.
BIODATA MINING, 2023, 16 (01)
[44] Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data
Zhao, Na
Lee, Gim Hee
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16989 - 16997
[45] Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation
Jin, Kailun
Wang, Chung-Yu
Hung Viet Pham
Hemmati, Hadi
2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2024, : 167 - 171
[46] Implications of large language models such as ChatGPT for dental medicine
Eggmann, Florin
Weiger, Roland
Zitzmann, Nicola U.
Blatz, Markus B.
JOURNAL OF ESTHETIC AND RESTORATIVE DENTISTRY, 2023, 35 (07) : 1098 - 1102
[47] Exploring Large Language Models in Intent Acquisition and Translation
Fontana, Mattia
Martini, Barbara
Sciarrone, Filippo
2024 IEEE 10TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT 2024, 2024, : 231 - 234
[48] Exploring Capabilities of Large Language Models such as ChatGPT in Radiation
Dennstadt, Fabio
Hastings, Janna
Putora, Paul Martin
Vu, Erwin
Fischer, Galina F.
Suveg, Krisztian
Glatzer, Markus
Riggenbach, Elena
Ha, Hong-Linh
Cihoric, Nikola
ADVANCES IN RADIATION ONCOLOGY, 2024, 9 (03)
[49] SAR Target Recognition via Random Sampling Combination in Open-World Environments
Geng, Xiaojing
Dong, Ganggang
Xia, Ziheng
Liu, Hongwei
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 (331-343) : 331 - 343
[50] JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
Wang, Zihao
Cai, Shaofei
Liu, Anji
Jin, Yonggang
Hou, Jinbing
Zhang, Bowei
Lin, Haowei
He, Zhaofeng
Zheng, Zilong
Yang, Yaodong
Ma, Xiaojian
Liang, Yitao
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1894 - 1907

← 1 2 3 4 5 →