Task-based dialogue policy learning based on diffusion models

被引:0
|
作者
Liu, Zhibin [1 ]
Pang, Rucai [1 ]
Dong, Zhaoan [1 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Yantai Rd, Rizhao 276826, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-domain dialogue; Reinforcement learning; Reward estimation; Behavioural cloning; Diffusion models;
D O I
10.1007/s10489-024-05810-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of task-based dialogue systems is to help users achieve their dialogue needs using as few dialogue rounds as possible. As the demand increases, the dialogue tasks gradually involve multiple domains and develop in the direction of complexity and diversity. Achieving high performance with low computational effort has become an essential metric for multi-domain task-based dialogue systems. This paper proposes a new approach to guided dialogue policy. The method introduces a conditional diffusion model in the reinforcement learning Q-learning algorithm to regularise the policy in a diffusion Q-learning manner. The conditional diffusion model is used to learn the action value function, regulate the actions using regularisation, sample the actions, use the sampled actions in the policy update process, and additionally add a loss term that maximizes the value of the actions in the policy update process to improve the learning efficiency. Our proposed method is based on a conditional diffusion model, combined with the reinforcement learning TD3 algorithm as a dialogue policy and an inverse reinforcement learning approach to construct a reward estimator to provide rewards for policy updates as a way of completing a multi-domain dialogue task.
引用
收藏
页码:11752 / 11764
页数:13
相关论文
共 50 条
  • [1] Developing a Task-Based Dialogue System for English Language Learning
    Li, Kuo-Chen
    Chang, Maiga
    Wu, Kuan-Hsing
    EDUCATION SCIENCES, 2020, 10 (11): : 1 - 20
  • [2] Task-based learning
    Race, P
    MEDICAL EDUCATION, 2000, 34 (05) : 335 - 336
  • [3] TASK-BASED LEARNING IN EDUCATION
    Naznean, Andreea
    PROCEEDINGS OF THE EUROPEAN INTEGRATION: BETWEEN TRADITION AND MODERNITY, VOL 3, 2009, : 749 - 755
  • [4] Task-based learning for pronunciation
    Lee, K
    INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION, VOLS I AND II, PROCEEDINGS, 2002, : 1500 - 1501
  • [5] A framework for task-based learning
    Yuan, FY
    TESOL QUARTERLY, 1999, 33 (01) : 157 - 158
  • [6] Learning Task-Based Instructional Policy for Excavator-Like Robots
    Maske, Harshal
    Kieson, Emily
    Chowdhary, Girish
    Abramson, Charles
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 1962 - 1969
  • [7] Task-Based Learning in Task-Based Teaching: Training Teachers of Chinese as a Foreign Language
    Han, ZhaoHong
    ANNUAL REVIEW OF APPLIED LINGUISTICS, 2018, 38 : 162 - 186
  • [8] Task-based language learning and teaching
    Apelgren, BM
    MODERNA SPRAK, 2004, 98 (01): : 115 - 117
  • [9] Task-based Language Learning and Teaching
    Swan, Michael
    INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2005, 15 (02) : 251 - 256
  • [10] Task-based language teaching and learning
    Ahmadian, Mohammad Javad
    LANGUAGE LEARNING JOURNAL, 2016, 44 (04): : 377 - 380