Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering

被引：0

作者：

Addlesee, Angus ^{[1
]}

Sieinska, Weronika ^{[1
]}

Gunson, Nancie ^{[1
]}

Garcia, Daniel Hernandez ^{[1
]}

Dondrup, Christian ^{[1
]}

Lemon, Oliver ^{[1
,2
,3
]}

机构：

[1] Heriot Watt Univ, Edinburgh, Midlothian, Scotland

[2] Alana AI, London, England

[3] Edinburgh Ctr Robot, Edinburgh, Midlothian, Scotland

来源：

24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper evaluates the extent to which current Large Language Models (LLMs) can capture task-oriented multi-party conversations (MPCs). We have recorded and transcribed 29 MPCs between patients, their companions, and a social robot in a hospital. We then annotated this corpus for multi-party goal-tracking and intent-slot recognition. People share goals, answer each other's goals, and provide other people's goals in MPCs - none of which occur in dyadic interactions. To understand user goals in MPCs, we compared three methods in zero-shot and few-shot settings: we fine-tuned T5, created pre-training tasks to train DialogLM using LED, and employed prompt engineering techniques with GPT-3.5-turbo, to determine which approach can complete this novel task with limited data. GPT-3.5-turbo significantly outperformed the others in a few-shot setting. The 'reasoning' style prompt, when given 7% of the corpus as example annotated conversations, was the best performing method. It correctly annotated 62.32% of the goal tracking MPCs, and 69.57% of the intent-slot recognition MPCs. A 'story' style prompt increased model hallucination, which could be detrimental if deployed in safety-critical settings. We conclude that multi-party conversations still challenge state-of-the-art LLMs.

引用

页码：229 / 241

页数：13

共 50 条

[1] Robust Face Tracking Using Siamese-VGG with Pre-training and Fine-tuning
Yuan, Shuo
Yu, Xinguo
Majid, Abdul
2019 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING (ICCRE), 2019, : 170 - 174
[2] SAR-HUB: Pre-Training, Fine-Tuning, and Explaining
Yang, Haodong
Kang, Xinyue
Liu, Long
Liu, Yujiang
Huang, Zhongling
REMOTE SENSING, 2023, 15 (23)
[3] AlignDet: Aligning Pre-training and Fine-tuning in Object Detection
Li, Ming
Wu, Jie
Wang, Xionghui
Chen, Chen
Qin, Jie
Xiao, Xuefeng
Wang, Rui
Zheng, Min
Pan, Xin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6843 - 6853
[4] Improved Fine-Tuning by Better Leveraging Pre-Training Data
Liu, Ziquan
Xu, Yi
Xu, Yuanhong
Qian, Qi
Li, Hao
Ji, Xiangyang
Chan, Antoni B.
Jin, Rong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[5] EM Pre-training for Multi-party Dialogue Response Generation
Li, Yiyang
Zhao, Hai
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 92 - 103
[6] Comparing Fine-Tuning and Prompt Engineering for Multi-Class Classification in Hospitality Review Analysis
Botunac, Ive
Bakaric, Marija Brkic
Matetic, Maja
APPLIED SCIENCES-BASEL, 2024, 14 (14):
[7] Tri-Train: Automatic Pre-Fine Tuning between Pre-Training and Fine-Tuning for SciNER
Zeng, Qingkai
Yu, Wenhao
Yu, Mengxia
Jiang, Tianwen
Weninger, Tim
Jiang, Meng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4778 - 4787
[8] Bridging the Gap between Pre-Training and Fine-Tuning for Commonsense Generation
Yang, Haoran
Wang, Yan
Li, Piji
Bi, Wei
Lam, Wai
Xu, Chen
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 376 - 383
[9] On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Ramanujan, Vivek
Nguyen, Thao
Oh, Sewoong
Schmidt, Ludwig
Farhadi, Ali
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[10] Pre-training Multi-party Dialogue Models with Latent Discourse Inference
Li, Yiyang
Huang, Xinting
Bi, Wei
Zhao, Hai
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 9584 - 9599

← 1 2 3 4 5 →