Scaling Up Deep Reinforcement Learning for Multi-Domain Dialogue Systems

被引：0

作者：

Cuayahuitl, Heriberto ^{[1
]}

Yu, Seunghak ^{[2
]}

Williamson, Ashley ^{[1
]}

Carse, Jacob ^{[1
]}

机构：

[1] Univ Lincoln, Sch Comp Sci, Lincoln, England

[2] Samsung Elect Co Ltd, Artificial Intelligence Team, Seoul, South Korea

来源：

2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2017年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning-termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy learning via a network of DQN agents; the second makes use of compact state representations by compressing raw inputs; and the third stage applies a pre-training phase for bootstraping the behaviour of agents in the network. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that the proposed method exhibits better scalability and is promising for optimising the behaviour of multi-domain dialogue systems. An additional evaluation reports that the NDQN agents outperformed a K-Nearest Neighbour baseline in task success and dialogue length, yielding more efficient and successful dialogues.

引用

页码：3339 / 3346

页数：8

共 50 条

[1] Hierarchical Reinforcement Learning With Guidance for Multi-Domain Dialogue Policy
Rohmatillah, Mahdin
Chien, Jen-Tzung
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 748 - 761
[2] On using Deep Reinforcement Learning for Multi-Domain SFC placement
Toumi, Nassima
Bagaa, Miloud
Ksentini, Adlen
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[3] Single-Model Multi-domain Dialogue Management with Deep Learning
Papangelis, Alexandros
Stylianou, Yannis
ADVANCED SOCIAL INTERACTION WITH AGENTS, 2019, 510 : 71 - 77
[4] PROGRESSIVE DIALOGUE STATE TRACKING FOR MULTI-DOMAIN DIALOGUE SYSTEMS
Wang, Jiahao
Liu, Minqian
Quan, Xiaojun
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7668 - 7672
[5] Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems
Balaraman, Vevake
Magnini, Bernardo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 866 - 873
[6] ClippyScript: A Programming Language for Multi-Domain Dialogue Systems
Seide, Frank
McDirmid, Sean
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 242 - 245
[7] Scaling Multi-Domain Dialogue State Tracking via Query Reformulation
Rastogi, Pushpendre
Gupta, Arpit
Chen, Tongfei
Mathias, Lambert
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES(NAACL HLT 2019), VOL. 2 (INDUSTRY PAPERS), 2019, : 97 - 105
[8] Integrating topic estimation and dialogue history for domain selection in multi-domain spoken dialogue systems
Ikeda, Satoshi
Komatani, Kazunori
Ogata, Tetsuya
Okuno, Hiroshi G.
NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 294 - 304
[9] Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings
Yin, Wei
Liu, Yifan
Shen, Chunhua
Sun, Baichuan
van den Hengel, Anton
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4036 - 4051
[10] Robust Multi-Domain Multi-Turn Dialogue Policy via Student-Teacher Offline Reinforcement Learning
Rohmatillah, Mahdin
Chien, Jen-Tzung
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (01)

← 1 2 3 4 5 →