Adversarial Learning of Task-Oriented Neural Dialog Models

被引:0
|
作者
Liu, Bing [1 ]
Lane, Ian [2 ]
机构
[1] Carnegie Mellon Univ, Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Elect & Comp Engn, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose an adversarial learning method for reward estimation in reinforcement learning (RL) based task-oriented dialog models. Most of the current RL based task-oriented dialog systems require the access to a reward signal from either user feedback or user ratings. Such user ratings, however, may not always be consistent or available in practice. Furthermore, online dialog policy learning with RL typically requires a large number of queries to users, suffering from sample efficiency problem. To address these challenges, we propose an adversarial learning method to learn dialog rewards directly from dialog samples. Such rewards are further used to optimize the dialog policy with policy gradient based RL. In the evaluation in a restaurant search domain, we show that the proposed adversarial dialog learning method achieves advanced dialog success rate comparing to strong baseline methods. We further discuss the covariate shift problem in online adversarial dialog learning and show how we can address that with partial access to user feedback.
引用
收藏
页码:350 / 359
页数:10
相关论文
共 50 条
  • [41] Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog
    Pasupae, Panupong
    Gupta, Sonal
    Mandyam, Karishma
    Shah, Rushin
    Lewis, Mike
    Zettlemoyer, Luke
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1520 - 1526
  • [42] SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
    Kottur, Satwik
    Moon, Seungwhan
    Geramifard, Alborz
    Damavandi, Babak
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 4903 - 4912
  • [43] Joint Reasoning on Hybrid-knowledge sources for Task-Oriented Dialog
    Mishra, Mayank
    Contractor, Danish
    Raghu, Dinesh
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1778 - 1787
  • [44] Contextual Dynamic Prompting for Response Generation in Task-oriented Dialog Systems
    Swamy, Sandesh
    Tabari, Narges
    Chen, Chacha
    Gangadharaiah, Rashmi
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3102 - 3111
  • [45] Combining Open Domain Question Answering with a Task-Oriented Dialog System
    Nehring, Jan
    Feldhus, Nils
    Ahmed, Akhyar
    Kaur, Harleen
    1ST WORKSHOP ON DOCUMENT-GROUNDED DIALOGUE AND CONVERSATIONAL QUESTION ANSWERING (DIALDOC 2021), 2021, : 38 - 45
  • [46] An Incremental Turn-Taking Model For Task-Oriented Dialog Systems
    Coman, Andrei C.
    Yoshino, Koichiro
    Murase, Yukitoshi
    Nakamura, Satoshi
    Riccardi, Giuseppe
    INTERSPEECH 2019, 2019, : 4155 - 4159
  • [47] Actionable conversational quality indicators for improving task-oriented dialog systems
    Higgins, Michael
    Widdows, Dominic
    Hockey, Beth Ann
    Hazare, Akshay
    Howell, Kristen
    Christian, Gwen
    Mathi, Sujit
    Brew, Chris
    Maurer, Andrew
    Bonev, George
    Dunn, Matthew
    Bradley, Joseph
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (06) : 1229 - 1254
  • [48] Few-shot Natural Language Generation for Task-Oriented Dialog
    Peng, Baolin
    Zhu, Chenguang
    Li, Chunyuan
    Li, Xiujun
    Li, Jinchao
    Zeng, Michael
    Gao, Jianfeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 172 - 182
  • [49] A code-mixed task-oriented dialog dataset for medical domain
    Dowlagar, Suman
    Mamidi, Radhika
    COMPUTER SPEECH AND LANGUAGE, 2023, 78
  • [50] DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog
    Hung, Chia-Chien
    Lauscher, Anne
    Ponzetto, Simone Paolo
    Glavas, Goran
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 891 - 904