Goal-Oriented Dialogue Policy Learning from Failures

被引：0

作者：

Lu, Keting ^{[1
]}

Zhang, Shiqi ^{[2
]}

Chen, Xiaoping ^{[1
]}

机构：

[1] Univ Sci & Technol China, Sch Comp Sci, Hefei, Anhui, Peoples R China

[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY USA

来源：

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning methods have been used for learning dialogue policies. However, learning an effective dialogue policy frequently requires prohibitively many conversations. This is partly because of the sparse rewards in dialogues, and the very few successful dialogues in early learning phase. Hindsight experience replay (HER) enables learning from failures, but the vanilla HER is inapplicable to dialogue learning due to the implicit goals. In this work, we develop two complex HER methods providing different tradeoffs between complexity and performance, and, for the first time, enabled HER-based dialogue policy learning. Experiments using a realistic user simulator show that our HER methods perform better than existing experience replay methods (as applied to deep Q-networks) in learning rate.

引用

页码：2596 / 2603

页数：8

共 50 条

[1] Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
Zhang, Zheng
Liao, Lizi
Zhu, Xiaoyan
Chua, Tat-Seng
Liu, Zitao
Huang, Yan
Huang, Minlie
[J]. 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 122 - 132
[2] AirDialogue: An Environment for Goal-Oriented Dialogue Research
Wei, Wei
Le, Quoc, V
Dai, Andrew M.
Li, Li-Jia
[J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3844 - 3854
[3] Domain Expert Platform for Goal-Oriented Dialogue Collection
Gosko, Didzis
Znotins, Arturs
Skadina, Inguna
Gruzitis, Normunds
Nespore-Berzkalne, Gunta
[J]. EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 295 - 301
[4] LEARNING GOAL-ORIENTED VISUAL DIALOG VIA TEMPERED POLICY GRADIENT
Zhao, Rui
Tresp, Volker
[J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 868 - 875
[5] A functional solution for goal-oriented policy refinement
Rubio-Loyola, Javier
Serrat, Joan
Charalambides, Marinos
Flegkas, Paris
Pavlou, George
[J]. SEVENTH IEEE INTERNATIONAL WORKSHOP ON POLICIES FOR DISTRIBUTED SYSTEMS AND NETWORKS, PROCEEDINGS, 2006, : 133 - +
[6] Multi-Task Learning of System Dialogue Act Selection for Supervised Pretraining of Goal-Oriented Dialogue Policies
McLeod, Sarah
Kruijff-Korbayova, Ivana
Kiefer, Bernd
[J]. 20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 411 - 417
[7] Building Goal-oriented Document-grounded Dialogue Systems
Chen, Xi
Lin, Faner
Zhou, Yeju
Ma, Kaixin
Francis, Jonathan
Nyberg, Eric
Oltramari, Alessandro
[J]. 1ST WORKSHOP ON DOCUMENT-GROUNDED DIALOGUE AND CONVERSATIONAL QUESTION ANSWERING (DIALDOC 2021), 2021, : 109 - 112
[8] Building Goal-Oriented Dialogue Systems with Situated Visual Context
Agarwal, Sanchit
Jezabek, Jan
Biswas, Arijit
Barut, Emre
Gao, Bill
Chung, Tagyoung
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13149 - 13151
[9] Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems
El Asri, Layla
Schulz, Hannes
Sharma, Shikhar
Zumer, Jeremie
Harris, Justin
Fine, Emery
Mehrotra, Rahul
Suleman, Kaheer
[J]. 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017,
[10] Goal-Oriented Learning on the Web: Practical approach
Maurer, H
Scherbakov, N
[J]. ADVANCED RESEARCH IN COMPUTERS AND COMMUNICATIONS IN EDUCATION, VOL 2: NEW HUMAN ABILITIES FOR THE NETWORKED SOCIETY, 1999, 55 : 736 - 743

← 1 2 3 4 5 →