Scaling Up Deep Reinforcement Learning for Multi-Domain Dialogue Systems

被引:0
|
作者
Cuayahuitl, Heriberto [1 ]
Yu, Seunghak [2 ]
Williamson, Ashley [1 ]
Carse, Jacob [1 ]
机构
[1] Univ Lincoln, Sch Comp Sci, Lincoln, England
[2] Samsung Elect Co Ltd, Artificial Intelligence Team, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy learning via a network of DQN agents; the second makes use of compact state representations by compressing raw inputs; and the third stage applies a pre-training phase for bootstraping the behaviour of agents in the network. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that the proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems. An additional evaluation reports that the NDQN agents outperformed a K-Nearest Neighbour baseline in task success and dialogue length, yielding more efficient and successful dialogues.
引用
收藏
页码:3339 / 3346
页数:8
相关论文
共 50 条
  • [1] Hierarchical Reinforcement Learning With Guidance for Multi-Domain Dialogue Policy
    Rohmatillah, Mahdin
    Chien, Jen-Tzung
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 748 - 761
  • [2] On using Deep Reinforcement Learning for Multi-Domain SFC placement
    Toumi, Nassima
    Bagaa, Miloud
    Ksentini, Adlen
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [3] Single-Model Multi-domain Dialogue Management with Deep Learning
    Papangelis, Alexandros
    Stylianou, Yannis
    ADVANCED SOCIAL INTERACTION WITH AGENTS, 2019, 510 : 71 - 77
  • [4] PROGRESSIVE DIALOGUE STATE TRACKING FOR MULTI-DOMAIN DIALOGUE SYSTEMS
    Wang, Jiahao
    Liu, Minqian
    Quan, Xiaojun
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7668 - 7672
  • [5] Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems
    Balaraman, Vevake
    Magnini, Bernardo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 866 - 873
  • [6] ClippyScript: A Programming Language for Multi-Domain Dialogue Systems
    Seide, Frank
    McDirmid, Sean
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 242 - 245
  • [7] Scaling Multi-Domain Dialogue State Tracking via Query Reformulation
    Rastogi, Pushpendre
    Gupta, Arpit
    Chen, Tongfei
    Mathias, Lambert
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES(NAACL HLT 2019), VOL. 2 (INDUSTRY PAPERS), 2019, : 97 - 105
  • [8] Integrating topic estimation and dialogue history for domain selection in multi-domain spoken dialogue systems
    Ikeda, Satoshi
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 294 - 304
  • [9] Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings
    Yin, Wei
    Liu, Yifan
    Shen, Chunhua
    Sun, Baichuan
    van den Hengel, Anton
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4036 - 4051
  • [10] Robust Multi-Domain Multi-Turn Dialogue Policy via Student-Teacher Offline Reinforcement Learning
    Rohmatillah, Mahdin
    Chien, Jen-Tzung
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (01)