Deep Reinforcement Learning for On-line Dialogue State Tracking

被引:2
|
作者
Chen, Zhi [1 ]
Chen, Lu [1 ]
Zhou, Xiang [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, X LANCE Lab,Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
Task-oriented Dialogue System; Joint Training; Reinforcement Learning;
D O I
10.1007/978-981-99-2401-1_25
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Dialogue state tracking (DST) is a crucial module in dialogue management. It is usually cast as a supervised training problem, which is not convenient for on-line optimization. In this paper, a novel companion teaching based deep reinforcement learning (DRL) framework for on-line DST optimization is proposed. To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems. In addition, dialogue policy can be further jointly updated. Experiments show that on-line DST optimization can effectively improve the dialogue manager performance while keeping the flexibility of using predefined policy. Joint training of both DST and policy can further improve the performance.
引用
收藏
页码:278 / 292
页数:15
相关论文
共 50 条
  • [21] Reinforcement learning: An on-line framework using support vectors
    Mansouri, Hicham
    Trafalis, Theodore B.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1278 - 1282
  • [22] On-line evolutionary reinforcement learning in computation for stochastic domains
    Whiteson, Shimon
    Stone, Peter
    GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1577 - +
  • [23] Path Tracking Control and Identification of Tire Parameters using On-line Model-based Reinforcement Learning
    Kim, Taewan
    Kim, H. Jin
    2016 16TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2016, : 215 - 219
  • [24] Tracking the Race Between Deep Reinforcement Learning and Imitation Learning
    Gros, Timo P.
    Hoeller, Daniel
    Hoffmann, Joerg
    Wolf, Verena
    QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2020), 2020, 12289 : 11 - 17
  • [25] Dialogue State Distillation Network with Inter-slot Contrastive Learning for Dialogue State Tracking
    Xu, Jing
    Song, Dandan
    Liu, Chong
    Hui, Siu Cheung
    Li, Fei
    Ju, Qiang
    He, Xiaonan
    Xie, Jian
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13834 - 13842
  • [26] On-line deep learning method for action recognition
    Konstantinos Charalampous
    Antonios Gasteratos
    Pattern Analysis and Applications, 2016, 19 : 337 - 354
  • [27] Parameter-Free On-line Deep Learning
    Wawrzynski, Pawel
    AUTOMATION 2017: INNOVATIONS IN AUTOMATION, ROBOTICS AND MEASUREMENT TECHNIQUES, 2017, 550 : 543 - 553
  • [28] On-line deep learning method for action recognition
    Charalampous, Konstantinos
    Gasteratos, Antonios
    PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (02) : 337 - 354
  • [29] Learning Markov State Abstractions for Deep Reinforcement Learning
    Allen, Cameron
    Parikh, Neev
    Gottesman, Omer
    Konidaris, George
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [30] STATE REPRESENTATION LEARNING FOR EFFECTIVE DEEP REINFORCEMENT LEARNING
    Zhao, Jian
    Zhou, Wengang
    Zhao, Tianyu
    Zhou, Yun
    Li, Houqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,