Task-oriented reinforcement learning for continuous tasks in dynamic environment

被引：0

作者：

Kamal, MAS ^{[1
]}

Murata, J ^{[1
]}

Hirasawa, K ^{[1
]}

机构：

[1] Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Higashi Ku, Fukuoka, Japan

来源：

SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5 | 2002年

关键词：

reinforcement learning; non-episodic tasks; autonomous agents;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a more realistic way of learning for non-episodic tasks of mobile agents, in which the generalized state spaces as well as learning process do not depend on the environment structures. This work has two main contributions. First, the proposed task-oriented reinforcement learning allows the agent to use several Q-tables based on the type of subtasks that greatly reduces the dimensionality in state spaces. Second, the use of relative information of the environment topology makes the system capable of working in dynamic, environment continuously.

引用

页码：829 / 832

页数：4

共 50 条

[1] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
Hsueh, Yu-Ling
Chou, Tai-Liang
[J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
[2] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
Chou, Tai-Liang
Hsueh, Yu-Ling
[J]. NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 87 - 91
[3] Task-oriented Dialogue System Based on Reinforcement Learning
Song, Meina
Chen, Zhongfu
Niu, Peiqing
Haihong, E.
[J]. PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
[4] Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
Li, Ziming
Kiseleva, Julia
de Rijke, Maarten
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
[5] A Survey of Task-Oriented Dialogue Policies Based on Reinforcement Learning
Xu, Kai
Wang, Zhen-Yu
Wang, Xu
Qin, Hua
Long, Yu-Xuan
[J]. Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (06): : 1201 - 1231
[6] CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Verma, Siddharth
Fu, Justin
Yang, Mengjiao
Levine, Sergey
[J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4471 - 4491
[7] Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
Zhang, Jiaping
Zhao, Tiancheng
Yu, Zhou
[J]. 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 140 - 150
[8] Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control
Xiang, Guofei
Su, Jianbo
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (02) : 1056 - 1069
[9] Task-oriented learning on the Web
Whittington, CD
Campbell, LM
[J]. INNOVATIONS IN EDUCATION AND TRAINING INTERNATIONAL, 1999, 36 (01): : 26 - 33
[10] Dynamic online discussion: task-oriented interaction for deep learning
Du, Jianxia
Havard, Byron
Li, Heng
[J]. EDUCATIONAL MEDIA INTERNATIONAL, 2005, 42 (03) : 207 - 218

← 1 2 3 4 5 →