Task-oriented reinforcement learning for continuous tasks in dynamic environment

被引：0

作者：

Kamal, MAS ^{[1
]}

Murata, J ^{[1
]}

Hirasawa, K ^{[1
]}

机构：

[1] Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Higashi Ku, Fukuoka, Japan

来源：

SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5 | 2002年

关键词：

reinforcement learning; non-episodic tasks; autonomous agents;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a more realistic way of learning for non-episodic tasks of mobile agents, in which the generalized state spaces as well as learning process do not depend on the environment structures. This work has two main contributions. First, the proposed task-oriented reinforcement learning allows the agent to use several Q-tables based on the type of subtasks that greatly reduces the dimensionality in state spaces. Second, the use of relative information of the environment topology makes the system capable of working in dynamic, environment continuously.

引用

页码：829 / 832

页数：4

共 50 条

[41] Budgeted Policy Learning for Task-Oriented Dialogue Systems
Zhang, Zhirui
Li, Xiujun
Gao, Jianfeng
Chen, Enhong
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3742 - 3751
[42] Unsupervised learning of kb queries in task-oriented dialogs
Raghu, Dinesh
Gupta, Nikhil
Mausam
[J]. Transactions of the Association for Computational Linguistics, 2021, 9 : 374 - 390
[43] Task-oriented contrastive learning for unsupervised domain adaptation
Wei, Xing
Wen, Bin
Yang, Fan
Liu, Yujie
Zhao, Chong
Hu, Di
Luo, Hui
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
[44] One-Shot Learning for Task-Oriented Grasping
Holomjova, Valerija
Starkey, Andrew J.
Yun, Bruno
Meisner, Pascal
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8232 - 8238
[45] Structural Learning: Attraction and Conformity in Task-Oriented Groups
James A. Kitts
Michael W. Macy
Andreas Flache
[J]. Computational & Mathematical Organization Theory, 1999, 5 (2): : 129 - 145
[46] Performance Improvement on Traditional Chinese Task-Oriented Dialogue Systems With Reinforcement Learning and Regularized Dropout Technique
Sheu, Jeng-Shin
Wu, Siang-Ru
Wu, Wen-Hung
[J]. IEEE ACCESS, 2023, 11 : 19849 - 19862
[47] Effect of delay on search decisions in a task-oriented reading environment
Mana, Amelia
Vidal-Abarca, Eduardo
Salmeron, Ladislao
[J]. METACOGNITION AND LEARNING, 2017, 12 (01) : 113 - 130
[48] Effect of delay on search decisions in a task-oriented reading environment
Amelia Mañá
Eduardo Vidal-Abarca
Ladislao Salmerón
[J]. Metacognition and Learning, 2017, 12 : 113 - 130
[49] An execution environment for flexible task-oriented software on multicore systems
Rauber, Thomas
Ruenger, Gudula
[J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2012, 20 (02): : 161 - 173
[50] Multi-task Learning for Natural Language Generation in Task-Oriented Dialogue
Zhu, Chenguang
Zeng, Michael
Huang, Xuedong
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1261 - 1266

← 1 2 3 4 5 →