Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness

被引：0

作者：

Zhang, Zheng ^{[1
]}

Liao, Lizi ^{[2
]}

Zhu, Xiaoyan ^{[1
]}

Chua, Tat-Seng ^{[2
]}

Liu, Zitao ^{[3
]}

Huang, Yan ^{[3
]}

Huang, Minlie ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Inst Artificial Intelligence,Beijing Natl Res Ctr, Beijing, Peoples R China

[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore

[3] TAL Educ Grp, Beijing, Peoples R China

来源：

1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020) | 2020年

关键词：

REINFORCEMENT; NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most existing approaches for goal-oriented dialogue policy learning used reinforcement learning, which focuses on the target agent policy and simply treats the opposite agent policy as part of the environment. While in real-world scenarios, the behavior of an opposite agent often exhibits certain patterns or underlies hidden policies, which can be inferred and utilized by the target agent to facilitate its own decision making. This strategy is common in human mental simulation by first imaging a specific action and the probable results before really acting it. We therefore propose an opposite behavior aware framework for policy learning in goal-oriented dialogues. We estimate the opposite agent's policy from its behavior and use this estimation to improve the target agent by regarding it as part of the target policy. We evaluate our model on both cooperative and competitive dialogue tasks, showing superior performance over state-of-the-art baselines.

引用

页码：122 / 132

页数：11

共 50 条

[41] Diplomat: A Conversational Agent Framework for Goal-Oriented Group Discussion
Hogan, Kevin
Baer, Annabelle
Purtilo, James
[J]. CONTEMPORARY ISSUES IN GROUP DECISION AND NEGOTIATION, GDN 2021, 2021, 420 : 143 - 154
[42] Agent-based tactics for goal-oriented requirements elaboration
Letier, E
van Lamsweerde, A
[J]. ICSE 2002: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2002, : 83 - 93
[43] ScenEdit: a goal-oriented tool to design learning scenarios
Emin, Valerie
Pernin, Jean-Philippe
[J]. ICALT: 2009 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, 2009, : 736 - 737
[44] Goal-Oriented Sensitivity Analysis of Hyperparameters in Deep Learning
Paul Novello
Gaël Poëtte
David Lugato
Pietro Marco Congedo
[J]. Journal of Scientific Computing, 2023, 94
[45] Goal-Oriented Sensitivity Analysis of Hyperparameters in Deep Learning
Novello, Paul
Poette, Gael
Lugato, David
Congedo, Pietro Marco
[J]. JOURNAL OF SCIENTIFIC COMPUTING, 2023, 94 (03)
[46] Data-Efficient Goal-Oriented Conversation with Dialogue Knowledge Transfer Networks
Shalyminov, Igor
Lee, Sungjin
Eshghi, Arash
Lemon, Oliver
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1741 - 1751
[47] Advancing Faithfulness of Large Language Models in Goal-Oriented Dialogue Question Answering
Sticha, Abigail
Braunschweiler, Norbert
Doddipatla, Rama
Knill, Kate
[J]. PROCEEDINGS OF THE 6TH CONFERENCE ON ACM CONVERSATIONAL USER INTERFACES, CUI 2024, 2024,
[48] Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
Xu, Zipeng
Feng, Fangxiang
Wang, Xiaojie
Yang, Yushu
Jiang, Huixing
Wang, Zhongyuan
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4271 - 4279
[49] Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
Matsumori, Shoya
Shingyouchi, Kosuke
Abe, Yuki
Fukuchi, Yosuke
Sugiura, Komei
Imai, Michita
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1878 - 1887
[50] UniGDD: A Unified Generative Framework for Goal-Oriented Document-Grounded Dialogue
Gao, Chang
Zhang, Wenxuan
Lam, Wai
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 599 - 605

← 1 2 3 4 5 →