Scaling Up Deep Reinforcement Learning for Multi-Domain Dialogue Systems

被引:0
|
作者
Cuayahuitl, Heriberto [1 ]
Yu, Seunghak [2 ]
Williamson, Ashley [1 ]
Carse, Jacob [1 ]
机构
[1] Univ Lincoln, Sch Comp Sci, Lincoln, England
[2] Samsung Elect Co Ltd, Artificial Intelligence Team, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy learning via a network of DQN agents; the second makes use of compact state representations by compressing raw inputs; and the third stage applies a pre-training phase for bootstraping the behaviour of agents in the network. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that the proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems. An additional evaluation reports that the NDQN agents outperformed a K-Nearest Neighbour baseline in task success and dialogue length, yielding more efficient and successful dialogues.
引用
收藏
页码:3339 / 3346
页数:8
相关论文
共 50 条
  • [31] Mutually improved response generation and dialogue summarization for multi-domain task-oriented dialogue systems
    Zhao, Meng
    Wang, Lifang
    Ji, Hongru
    Jiang, Zejun
    Li, Ronghan
    Lu, Xinyu
    Hu, Zhongtian
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [32] Robust Calibration with Multi-domain Temperature Scaling
    Yu, Yaodong
    Bates, Stephen
    Ma, Yi
    Jordan, Michael I.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [33] Multi-domain Network Service Placement Optimization Using Curriculum Reinforcement Learning
    Shahbazi, Arzhang
    Cherrared, Sihem
    Guillemin, Fabrice
    2023 IEEE CONFERENCE ON NETWORK FUNCTION VIRTUALIZATION AND SOFTWARE DEFINED NETWORKS, NFV-SDN, 2023, : 21 - 26
  • [34] Scaling up Deep Reinforcement Learning for Intelligent Video Game Agents
    Debner, Anton
    2022 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2022), 2022, : 192 - 193
  • [35] Domains as Objectives: Multi-Domain Reinforcement Learning with Convex-Coverage Set Learning for Domain Uncertainty Awareness
    Ilboudo, Wendyam Eric Lionel
    Kobayashi, Taisuke
    Matsubara, Takamitsu
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5622 - 5629
  • [36] Active Learning in Multi-Domain Collaborative Filtering Recommender Systems
    Guan, Xin
    Li, Chang-Tsun
    Guan, Yu
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 1351 - 1357
  • [37] MULTI-DOMAIN DIALOGUE SUCCESS CLASSIFIERS FOR POLICY TRAINING
    Vandyke, David
    Su, Pei-Hao
    Gasic, Milica
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Young, Steve
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 763 - 770
  • [38] Multi-domain Dialogue State Tracking with Recursive Inference
    Liao, Lizi
    Zhu, Tongyao
    Long, Le Hong
    Chua, Tat Seng
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2568 - 2577
  • [39] A HIERARCHICAL TRACKER FOR MULTI-DOMAIN DIALOGUE STATE TRACKING
    Li, Jieyu
    Zhu, Su
    Yu, Kai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8014 - 8018
  • [40] PyDial: A Multi-domain Statistical Dialogue System Toolkit
    Ultes, Stefan
    Rojas-Barahona, Lina
    Su, Pei-Hao
    Vandyke, David
    Kim, Dongho
    Casanueva, Inigo
    Budzianowski, Pawel
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Gasic, Milica
    Young, Steve
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 73 - 78