Deep Reinforcement Learning for On-line Dialogue State Tracking

被引:2
|
作者
Chen, Zhi [1 ]
Chen, Lu [1 ]
Zhou, Xiang [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, X LANCE Lab,Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
Task-oriented Dialogue System; Joint Training; Reinforcement Learning;
D O I
10.1007/978-981-99-2401-1_25
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Dialogue state tracking (DST) is a crucial module in dialogue management. It is usually cast as a supervised training problem, which is not convenient for on-line optimization. In this paper, a novel companion teaching based deep reinforcement learning (DRL) framework for on-line DST optimization is proposed. To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems. In addition, dialogue policy can be further jointly updated. Experiments show that on-line DST optimization can effectively improve the dialogue manager performance while keeping the flexibility of using predefined policy. Joint training of both DST and policy can further improve the performance.
引用
收藏
页码:278 / 292
页数:15
相关论文
共 50 条
  • [1] On-Line Building Energy Optimization Using Deep Reinforcement Learning
    Mocanu, Elena
    Mocanu, Decebal Constantin
    Nguyen, Phuong H.
    Liotta, Antonio
    Webber, Michael E.
    Gibescu, Madeleine
    Slootweg, J. G.
    IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (04) : 3698 - 3708
  • [2] On-line EM reinforcement learning
    Yoshimoto, J
    Ishii, S
    Sato, M
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, : 163 - 168
  • [3] Exploratory Policy Generation Methods in On-line Deep Reinforcement Learning: A Survey
    Li, Shilei
    Ye, Qing
    Yuan, Zhimin
    Chen, Yun
    He, Tao
    Fu, Yu
    Jiqiren/Robot, 2024, 46 (06): : 753 - 768
  • [4] JOINT ON-LINE LEARNING OF A ZERO-SHOT SPOKEN SEMANTIC PARSER AND A REINFORCEMENT LEARNING DIALOGUE MANAGER
    Riou, Matthieu
    Jabaian, Bassam
    Huet, Stephane
    Lefevre, Fabrice
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3072 - 3076
  • [5] On-line learning control by association and reinforcement
    Si, J
    Wang, YT
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, : 221 - 226
  • [6] On-line learning control by association and reinforcement
    Si, J
    Wang, YT
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (02): : 264 - 276
  • [7] Reinforcement Learning for on-line Sequence Transformation
    Rypesc, Grzegorz
    Lepak, Lukasz
    Wawrzynski, Pawel
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 133 - 139
  • [8] Dual Learning for Dialogue State Tracking
    Chen, Zhi
    Chen, Lu
    Zhao, Yanbin
    Zhu, Su
    Yu, Kai
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 293 - 305
  • [9] Representing the Reinforcement Learning State in a Negotiation Dialogue
    Heeman, Peter A.
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 450 - 455
  • [10] Tree-Based On-Line Reinforcement Learning
    Salles Barreto, Andre da Motta
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2417 - 2423