Advancing Faithfulness of Large Language Models in Goal-Oriented Dialogue Question Answering

被引:0
|
作者
Sticha, Abigail [1 ]
Braunschweiler, Norbert [2 ]
Doddipatla, Rama [2 ]
Knill, Kate [3 ]
机构
[1] Univ Cambridge, Cambridge, Cambs, England
[2] Toshiba Europe Ltd, Cambridge, Cambs, England
[3] Univ Cambridge, Dept Engn, Cambridge, Cambs, England
关键词
Dialogue system; Question-answering; Large Language Models; Knowledge retrieval; BENCHMARK;
D O I
10.1145/3640794.3665573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Goal-oriented dialogue systems, such as assistant chatbots and conversational AI systems, have gained prominence for their question-answering capabilities, often utilizing large language models (LLMs) as knowledge bases. However, these systems face limitations when knowledge outside their intrinsic scope is required. In this paper we address these limitations by designing more faithful and useful systems that can accurately respond to users based on external information. Guided by reference-free evaluation metrics instead of traditional word-overlap metrics, we present two novel methods to prompt LLMs which surpass the baselines in accuracy, linguistic quality, and faithfulness. The first method employs a reranking technique using LLMs to rank document relevance without the need for fine-tuning. The second system builds upon the ReAct framework by incorporating a self-reflection mechanism, ensuring answers are grounded in retrieved content. Overall, our methods advance few-shot prompting as a way to learn to condition on external evidence, and significantly reduce hallucinations.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Evaluating Goal Models within the Goal-Oriented Requirement Language
    Amyot, Daniel
    Ghanavati, Sepideh
    Horkoff, Jennifer
    Mussbacher, Gunter
    Peyton, Liam
    Yu, Eric
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2010, 25 (08) : 841 - 877
  • [2] Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models
    King, Evan
    Yu, Haoxiang
    Lee, Sangsu
    Julien, Christine
    [J]. PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (01):
  • [3] Analyzing Semantic Faithfulness of Language Models via Input Intervention on Question Answering
    Chaturvedi, Akshay
    Bhar, Swarnadeep
    Saha, Soumadeep
    Garain, Utpal
    Asher, Nicholas
    [J]. COMPUTATIONAL LINGUISTICS, 2023, 50 (01) : 119 - 155
  • [4] Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
    Matsumori, Shoya
    Shingyouchi, Kosuke
    Abe, Yuki
    Fukuchi, Yosuke
    Sugiura, Komei
    Imai, Michita
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1878 - 1887
  • [5] The GOAL Approach A Goal-Oriented Algebraic Language
    Dantan, Jerome
    Pollet, Yann
    Taibi, Salima
    [J]. ENASE: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, 2013, : 173 - 180
  • [6] AirDialogue: An Environment for Goal-Oriented Dialogue Research
    Wei, Wei
    Le, Quoc, V
    Dai, Andrew M.
    Li, Li-Jia
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3844 - 3854
  • [7] Reasoning with large language models for medical question answering
    Lucas, Mary M.
    Yang, Justin
    Pomeroy, Jon K.
    Yang, Christopher C.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09)
  • [8] Enhancing Biomedical Question Answering with Large Language Models
    Yang, Hua
    Li, Shilong
    Goncalves, Teresa
    [J]. INFORMATION, 2024, 15 (08)
  • [9] Towards a Unified Goal-Oriented Language
    Patricio, Pedro
    Amaral, Vasco
    Araujo, Joao
    Monteiro, Rui
    [J]. 2011 35TH IEEE ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2011, : 596 - 601
  • [10] Goal-Oriented Dialogue Policy Learning from Failures
    Lu, Keting
    Zhang, Shiqi
    Chen, Xiaoping
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2596 - 2603