Task-based dialogue policy learning based on diffusion models

被引：0

作者：

Liu, Zhibin ^{[1
]}

Pang, Rucai ^{[1
]}

Dong, Zhaoan ^{[1
]}

机构：

[1] Qufu Normal Univ, Sch Comp Sci, Yantai Rd, Rizhao 276826, Shandong, Peoples R China

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 22期

基金：

中国国家自然科学基金;

关键词：

Multi-domain dialogue; Reinforcement learning; Reward estimation; Behavioural cloning; Diffusion models;

D O I：

10.1007/s10489-024-05810-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The purpose of task-based dialogue systems is to help users achieve their dialogue needs using as few dialogue rounds as possible. As the demand increases, the dialogue tasks gradually involve multiple domains and develop in the direction of complexity and diversity. Achieving high performance with low computational effort has become an essential metric for multi-domain task-based dialogue systems. This paper proposes a new approach to guided dialogue policy. The method introduces a conditional diffusion model in the reinforcement learning Q-learning algorithm to regularise the policy in a diffusion Q-learning manner. The conditional diffusion model is used to learn the action value function, regulate the actions using regularisation, sample the actions, use the sampled actions in the policy update process, and additionally add a loss term that maximizes the value of the actions in the policy update process to improve the learning efficiency. Our proposed method is based on a conditional diffusion model, combined with the reinforcement learning TD3 algorithm as a dialogue policy and an inverse reinforcement learning approach to construct a reward estimator to provide rewards for policy updates as a way of completing a multi-domain dialogue task.

引用

页码：11752 / 11764

页数：13

共 50 条

[1] Developing a Task-Based Dialogue System for English Language Learning
Li, Kuo-Chen
Chang, Maiga
Wu, Kuan-Hsing
EDUCATION SCIENCES, 2020, 10 (11): : 1 - 20
[2] Task-based learning
Race, P
MEDICAL EDUCATION, 2000, 34 (05) : 335 - 336
[3] TASK-BASED LEARNING IN EDUCATION
Naznean, Andreea
PROCEEDINGS OF THE EUROPEAN INTEGRATION: BETWEEN TRADITION AND MODERNITY, VOL 3, 2009, : 749 - 755
[4] Task-based learning for pronunciation
Lee, K
INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION, VOLS I AND II, PROCEEDINGS, 2002, : 1500 - 1501
[5] A framework for task-based learning
Yuan, FY
TESOL QUARTERLY, 1999, 33 (01) : 157 - 158
[6] Learning Task-Based Instructional Policy for Excavator-Like Robots
Maske, Harshal
Kieson, Emily
Chowdhary, Girish
Abramson, Charles
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 1962 - 1969
[7] Task-Based Learning in Task-Based Teaching: Training Teachers of Chinese as a Foreign Language
Han, ZhaoHong
ANNUAL REVIEW OF APPLIED LINGUISTICS, 2018, 38 : 162 - 186
[8] Task-based language learning and teaching
Apelgren, BM
MODERNA SPRAK, 2004, 98 (01): : 115 - 117
[9] Task-based Language Learning and Teaching
Swan, Michael
INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2005, 15 (02) : 251 - 256
[10] Task-based language teaching and learning
Ahmadian, Mohammad Javad
LANGUAGE LEARNING JOURNAL, 2016, 44 (04): : 377 - 380

← 1 2 3 4 5 →