Scaling Up Deep Reinforcement Learning for Multi-Domain Dialogue Systems

被引:0
|
作者
Cuayahuitl, Heriberto [1 ]
Yu, Seunghak [2 ]
Williamson, Ashley [1 ]
Carse, Jacob [1 ]
机构
[1] Univ Lincoln, Sch Comp Sci, Lincoln, England
[2] Samsung Elect Co Ltd, Artificial Intelligence Team, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy learning via a network of DQN agents; the second makes use of compact state representations by compressing raw inputs; and the third stage applies a pre-training phase for bootstraping the behaviour of agents in the network. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that the proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems. An additional evaluation reports that the NDQN agents outperformed a K-Nearest Neighbour baseline in task success and dialogue length, yielding more efficient and successful dialogues.
引用
收藏
页码:3339 / 3346
页数:8
相关论文
共 50 条
  • [41] MADELYN: Multi-Domain Multi-Agent Reinforcement Learning for Data-center Networks
    Kattepur, Ajay
    David, Sushanth
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 624 - 629
  • [42] A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment in smart ecosystems
    Kibalya, Godfrey
    Serrat, Joan
    Gorricho, Juan-Luis
    Okello, Dorothy
    Zhang, Peiying
    NEURAL COMPUTING & APPLICATIONS, 2020, 35 (33): : 23795 - 23817
  • [43] A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment in smart ecosystems
    Godfrey Kibalya
    Joan Serrat
    Juan-Luis Gorricho
    Dorothy Okello
    Peiying Zhang
    Neural Computing and Applications, 2023, 35 : 23795 - 23817
  • [44] Hierarchical Reinforcement Learning in Multi-Domain Elastic Optical Networks to Realize Joint RMSA
    Xu, Liufei
    Huang, Yue-Cai
    Xue, Yun
    Hu, Xiaohui
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2023, 41 (08) : 2276 - 2288
  • [45] A Privacy-Preserving Reinforcement Learning Algorithm for Multi-Domain Virtual Network Embedding
    Andreoletti, Davide
    Velichkova, Tanya
    Verticale, Giacomo
    Tornatore, Massimo
    Giordano, Silvia
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (04): : 2291 - 2304
  • [46] Reinforcement learning for spoken dialogue systems
    Singh, S
    Kearns, M
    Litman, D
    Walker, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 956 - 962
  • [47] A Privacy-Preserving Reinforcement Learning Algorithm for Multi-Domain Virtual Network Embedding
    Andreoletti, Davide
    Velichkova, Tanya
    Verticale, Giacomo
    Tornatore, Massimo
    Giordano, Silvia
    Andreoletti, Davide (davide.andreoletti@supsi.ch), 1600, Institute of Electrical and Electronics Engineers Inc. (17): : 2291 - 2304
  • [48] Multi-Domain Active Learning for Recommendation
    Zhang, Zihan
    Jin, Xiaoming
    Li, Lianghao
    Ding, Guiguang
    Yang, Qiang
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2358 - 2364
  • [49] Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking
    Campagna, Giovanni
    Foryciarz, Agata
    Moradshahi, Mehrad
    Lam, Monica S.
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 122 - 132
  • [50] Scaling Up Deep Learning
    Bengio, Yoshua
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1966 - 1966