Scaling Up Deep Reinforcement Learning for Multi-Domain Dialogue Systems

被引:0
|
作者
Cuayahuitl, Heriberto [1 ]
Yu, Seunghak [2 ]
Williamson, Ashley [1 ]
Carse, Jacob [1 ]
机构
[1] Univ Lincoln, Sch Comp Sci, Lincoln, England
[2] Samsung Elect Co Ltd, Artificial Intelligence Team, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy learning via a network of DQN agents; the second makes use of compact state representations by compressing raw inputs; and the third stage applies a pre-training phase for bootstraping the behaviour of agents in the network. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that the proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems. An additional evaluation reports that the NDQN agents outperformed a K-Nearest Neighbour baseline in task success and dialogue length, yielding more efficient and successful dialogues.
引用
收藏
页码:3339 / 3346
页数:8
相关论文
共 50 条
  • [21] DISTRIBUTED DIALOGUE POLICIES FOR MULTI-DOMAIN STATISTICAL DIALOGUE MANAGEMENT
    Gasic, M.
    Kim, D.
    Tsiakoulis, P.
    Young, S.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5371 - 5375
  • [22] Multi-Agent and Cooperative Deep Reinforcement Learning for Scalable Network Automation in Multi-Domain SD-EONs
    Li, Baojia
    Zhang, Ruyun
    Tian, Xiaojian
    Zhu, Zuqing
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (04): : 4801 - 4813
  • [23] Dialogue summarization enhanced response generation for multi-domain task-oriented dialogue systems
    Wang, Lifang
    Zhao, Meng
    Ji, Hongru
    Jiang, Zejun
    Li, Ronghan
    Hu, Zhongtian
    Lu, Xinyu
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (03)
  • [24] Reinforcement Learning-based Multi-domain Network Slice Provisioning
    Wu, Zhouxiang
    Ishigaki, Genya
    Gour, Riti
    Li, Congzhou
    Mi, Feng
    Talluri, Subhash
    Jue, Jason P.
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1899 - 1904
  • [25] Multi-domain Causal Structure Learning in Linear Systems
    Ghassami, AmirEmad
    Kiyavash, Negar
    Huang, Biwei
    Zhang, Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [26] SCALABLE MULTI-DOMAIN DIALOGUE STATE TRACKING
    Rastogi, Abhinav
    Hakkani-Tur, Dilek
    Heck, Larry
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 561 - 568
  • [27] Multi-domain Aspect Extraction Based on Deep and Lifelong Learning
    Lopez, Dionis
    Arco, Leticia
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS (CIARP 2019), 2019, 11896 : 556 - 565
  • [28] Efficient slot correlation learning network for multi-domain dialogue state tracking
    Qianyu Li
    Wensheng Zhang
    Mengxing Huang
    The Journal of Supercomputing, 2023, 79 : 18547 - 18568
  • [29] Efficient slot correlation learning network for multi-domain dialogue state tracking
    Li, Qianyu
    Zhang, Wensheng
    Huang, Mengxing
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (16): : 18547 - 18568
  • [30] Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems
    Wu, Chien-Sheng
    Madotto, Andrea
    Hosseini-Asl, Ehsan
    Xiong, Caiming
    Socher, Richard
    Fung, Pascale
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 808 - 819