Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness

被引:0
|
作者
Zhang, Zheng [1 ]
Liao, Lizi [2 ]
Zhu, Xiaoyan [1 ]
Chua, Tat-Seng [2 ]
Liu, Zitao [3 ]
Huang, Yan [3 ]
Huang, Minlie [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Inst Artificial Intelligence,Beijing Natl Res Ctr, Beijing, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] TAL Educ Grp, Beijing, Peoples R China
关键词
REINFORCEMENT; NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing approaches for goal-oriented dialogue policy learning used reinforcement learning, which focuses on the target agent policy and simply treats the opposite agent policy as part of the environment. While in real-world scenarios, the behavior of an opposite agent often exhibits certain patterns or underlies hidden policies, which can be inferred and utilized by the target agent to facilitate its own decision making. This strategy is common in human mental simulation by first imaging a specific action and the probable results before really acting it. We therefore propose an opposite behavior aware framework for policy learning in goal-oriented dialogues. We estimate the opposite agent's policy from its behavior and use this estimation to improve the target agent by regarding it as part of the target policy. We evaluate our model on both cooperative and competitive dialogue tasks, showing superior performance over state-of-the-art baselines.
引用
收藏
页码:122 / 132
页数:11
相关论文
共 50 条
  • [41] Diplomat: A Conversational Agent Framework for Goal-Oriented Group Discussion
    Hogan, Kevin
    Baer, Annabelle
    Purtilo, James
    [J]. CONTEMPORARY ISSUES IN GROUP DECISION AND NEGOTIATION, GDN 2021, 2021, 420 : 143 - 154
  • [42] Agent-based tactics for goal-oriented requirements elaboration
    Letier, E
    van Lamsweerde, A
    [J]. ICSE 2002: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2002, : 83 - 93
  • [43] ScenEdit: a goal-oriented tool to design learning scenarios
    Emin, Valerie
    Pernin, Jean-Philippe
    [J]. ICALT: 2009 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, 2009, : 736 - 737
  • [44] Goal-Oriented Sensitivity Analysis of Hyperparameters in Deep Learning
    Paul Novello
    Gaël Poëtte
    David Lugato
    Pietro Marco Congedo
    [J]. Journal of Scientific Computing, 2023, 94
  • [45] Goal-Oriented Sensitivity Analysis of Hyperparameters in Deep Learning
    Novello, Paul
    Poette, Gael
    Lugato, David
    Congedo, Pietro Marco
    [J]. JOURNAL OF SCIENTIFIC COMPUTING, 2023, 94 (03)
  • [46] Data-Efficient Goal-Oriented Conversation with Dialogue Knowledge Transfer Networks
    Shalyminov, Igor
    Lee, Sungjin
    Eshghi, Arash
    Lemon, Oliver
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1741 - 1751
  • [47] Advancing Faithfulness of Large Language Models in Goal-Oriented Dialogue Question Answering
    Sticha, Abigail
    Braunschweiler, Norbert
    Doddipatla, Rama
    Knill, Kate
    [J]. PROCEEDINGS OF THE 6TH CONFERENCE ON ACM CONVERSATIONAL USER INTERFACES, CUI 2024, 2024,
  • [48] Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
    Xu, Zipeng
    Feng, Fangxiang
    Wang, Xiaojie
    Yang, Yushu
    Jiang, Huixing
    Wang, Zhongyuan
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4271 - 4279
  • [49] Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
    Matsumori, Shoya
    Shingyouchi, Kosuke
    Abe, Yuki
    Fukuchi, Yosuke
    Sugiura, Komei
    Imai, Michita
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1878 - 1887
  • [50] UniGDD: A Unified Generative Framework for Goal-Oriented Document-Grounded Dialogue
    Gao, Chang
    Zhang, Wenxuan
    Lam, Wai
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 599 - 605