Goal-Oriented Dialogue Policy Learning from Failures

被引:0
|
作者
Lu, Keting [1 ]
Zhang, Shiqi [2 ]
Chen, Xiaoping [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci, Hefei, Anhui, Peoples R China
[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY USA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning methods have been used for learning dialogue policies. However, learning an effective dialogue policy frequently requires prohibitively many conversations. This is partly because of the sparse rewards in dialogues, and the very few successful dialogues in early learning phase. Hindsight experience replay (HER) enables learning from failures, but the vanilla HER is inapplicable to dialogue learning due to the implicit goals. In this work, we develop two complex HER methods providing different tradeoffs between complexity and performance, and, for the first time, enabled HER-based dialogue policy learning. Experiments using a realistic user simulator show that our HER methods perform better than existing experience replay methods (as applied to deep Q-networks) in learning rate.
引用
收藏
页码:2596 / 2603
页数:8
相关论文
共 50 条
  • [1] Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
    Zhang, Zheng
    Liao, Lizi
    Zhu, Xiaoyan
    Chua, Tat-Seng
    Liu, Zitao
    Huang, Yan
    Huang, Minlie
    [J]. 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 122 - 132
  • [2] AirDialogue: An Environment for Goal-Oriented Dialogue Research
    Wei, Wei
    Le, Quoc, V
    Dai, Andrew M.
    Li, Li-Jia
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3844 - 3854
  • [3] Domain Expert Platform for Goal-Oriented Dialogue Collection
    Gosko, Didzis
    Znotins, Arturs
    Skadina, Inguna
    Gruzitis, Normunds
    Nespore-Berzkalne, Gunta
    [J]. EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 295 - 301
  • [4] LEARNING GOAL-ORIENTED VISUAL DIALOG VIA TEMPERED POLICY GRADIENT
    Zhao, Rui
    Tresp, Volker
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 868 - 875
  • [5] A functional solution for goal-oriented policy refinement
    Rubio-Loyola, Javier
    Serrat, Joan
    Charalambides, Marinos
    Flegkas, Paris
    Pavlou, George
    [J]. SEVENTH IEEE INTERNATIONAL WORKSHOP ON POLICIES FOR DISTRIBUTED SYSTEMS AND NETWORKS, PROCEEDINGS, 2006, : 133 - +
  • [6] Multi-Task Learning of System Dialogue Act Selection for Supervised Pretraining of Goal-Oriented Dialogue Policies
    McLeod, Sarah
    Kruijff-Korbayova, Ivana
    Kiefer, Bernd
    [J]. 20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 411 - 417
  • [7] Building Goal-oriented Document-grounded Dialogue Systems
    Chen, Xi
    Lin, Faner
    Zhou, Yeju
    Ma, Kaixin
    Francis, Jonathan
    Nyberg, Eric
    Oltramari, Alessandro
    [J]. 1ST WORKSHOP ON DOCUMENT-GROUNDED DIALOGUE AND CONVERSATIONAL QUESTION ANSWERING (DIALDOC 2021), 2021, : 109 - 112
  • [8] Building Goal-Oriented Dialogue Systems with Situated Visual Context
    Agarwal, Sanchit
    Jezabek, Jan
    Biswas, Arijit
    Barut, Emre
    Gao, Bill
    Chung, Tagyoung
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13149 - 13151
  • [9] Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems
    El Asri, Layla
    Schulz, Hannes
    Sharma, Shikhar
    Zumer, Jeremie
    Harris, Justin
    Fine, Emery
    Mehrotra, Rahul
    Suleman, Kaheer
    [J]. 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017,
  • [10] Goal-Oriented Learning on the Web: Practical approach
    Maurer, H
    Scherbakov, N
    [J]. ADVANCED RESEARCH IN COMPUTERS AND COMMUNICATIONS IN EDUCATION, VOL 2: NEW HUMAN ABILITIES FOR THE NETWORKED SOCIETY, 1999, 55 : 736 - 743