Performance Improvement on Traditional Chinese Task-Oriented Dialogue Systems With Reinforcement Learning and Regularized Dropout Technique

被引：3

作者：

Sheu, Jeng-Shin ^{[1
]}

Wu, Siang-Ru ^{[1
]}

Wu, Wen-Hung ^{[2
]}

机构：

[1] Natl Yunlin Univ Sci & Technol, Dept Comp Sci & Informat Engn, Yunlin 640002, Taiwan

[2] Ponddy Educ Taiwan Ltd, New Taipei 231, Taiwan

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Task analysis; Reinforcement learning; Computational modeling; Artificial intelligence; Tokenization; Data models; NLP; regularized dropout; reinforcement learning; task-oriented dialogue;

D O I：

10.1109/ACCESS.2023.3248796

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The development of conversational voice assistant applications has been in full swing around the world. This paper aims to develop traditional Chinese multi-domain task-oriented dialogue (TOD) systems. It is typically implemented using pipeline approach, where submodules are optimized independently, resulting in inconsistencies with each other. Instead, this paper implements end-to-end multi-domain TOD models using pre-trained deep neural networks (DNNs). This allows us to integrate all the submodules into one single DNN model to solve the inconsistencies. Data shortages are common in conversational natural language processing (NLP) tasks using DNN models. In this regard, dropout regularization has been widely used to improve overfitting caused by insufficient training dataset. However, the randomness it introduces leads to non-negligible discrepancies between training and inference. On the other hand, pre-trained language models have successfully provided effective regularization for NLP tasks. An inherent disadvantage is that fine-tuning the pre-trained language model suffers from exposure bias and loss-evaluation mismatch. To this end, we propose a reinforcement learning (RL) approach to address both issues. Furthermore, we adopt a method called regularized dropout (R-Drop) to improve the inconsistency in dropout layers of DNNs. Experimental results show that both our proposed RL approach and the R-Drop technique can significantly improve the joint target accuracy (JGA) score and combined score of traditional Chinese TOD system in tasks of dialogue state tracking (DST) and end-to-end sentence prediction, respectively.

引用

页码：19849 / 19862

页数：14

共 50 条

[1] Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
Li, Ziming
Kiseleva, Julia
de Rijke, Maarten
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
[2] Task-oriented Dialogue System Based on Reinforcement Learning
Song, Meina
Chen, Zhongfu
Niu, Peiqing
Haihong, E.
PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
[3] Continual Learning in Task-Oriented Dialogue Systems
Madotto, Andrea
Lin, Zhaojiang
Zhou, Zhenpeng
Moon, Seungwhan
Crook, Paul
Liu, Bing
Yu, Zhou
Cho, Eunjoon
Fung, Pascale
Wang, Zhiguang
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7452 - 7467
[4] Using Reinforcement Learning for Dialogue Act Classification in Task-oriented Conversation Systems
Xia, Qingyang
2018 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2018), 2018, : 187 - 196
[5] A Survey of Task-Oriented Dialogue Policies Based on Reinforcement Learning
Xu K.
Wang Z.-Y.
Wang X.
Qin H.
Long Y.-X.
Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (06): : 1201 - 1231
[6] Budgeted Policy Learning for Task-Oriented Dialogue Systems
Zhang, Zhirui
Li, Xiujun
Gao, Jianfeng
Chen, Enhong
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3742 - 3751
[7] CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Verma, Siddharth
Fu, Justin
Yang, Mengjiao
Levine, Sergey
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4471 - 4491
[8] BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Lipton, Zachary
Li, Xiujun
Gao, Jianfeng
Li, Lihong
Ahmed, Faisal
Deng, Li
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5237 - 5244
[9] A Survey on Task-Oriented Dialogue Systems
Zhao Y.-Y.
Wang Z.-Y.
Wang P.
Yang T.
Zhang R.
Yin K.
Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (10): : 1862 - 1896
[10] MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Lin, Zhaojiang
Madotto, Andrea
Winata, Genta Indra
Fung, Pascale
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3391 - 3405

← 1 2 3 4 5 →