Task-oriented reinforcement learning for continuous tasks in dynamic environment

被引:0
|
作者
Kamal, MAS [1 ]
Murata, J [1 ]
Hirasawa, K [1 ]
机构
[1] Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Higashi Ku, Fukuoka, Japan
关键词
reinforcement learning; non-episodic tasks; autonomous agents;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a more realistic way of learning for non-episodic tasks of mobile agents, in which the generalized state spaces as well as learning process do not depend on the environment structures. This work has two main contributions. First, the proposed task-oriented reinforcement learning allows the agent to use several Q-tables based on the type of subtasks that greatly reduces the dimensionality in state spaces. Second, the use of relative information of the environment topology makes the system capable of working in dynamic, environment continuously.
引用
收藏
页码:829 / 832
页数:4
相关论文
共 50 条
  • [41] Budgeted Policy Learning for Task-Oriented Dialogue Systems
    Zhang, Zhirui
    Li, Xiujun
    Gao, Jianfeng
    Chen, Enhong
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3742 - 3751
  • [42] Unsupervised learning of kb queries in task-oriented dialogs
    Raghu, Dinesh
    Gupta, Nikhil
    Mausam
    [J]. Transactions of the Association for Computational Linguistics, 2021, 9 : 374 - 390
  • [43] Task-oriented contrastive learning for unsupervised domain adaptation
    Wei, Xing
    Wen, Bin
    Yang, Fan
    Liu, Yujie
    Zhao, Chong
    Hu, Di
    Luo, Hui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [44] One-Shot Learning for Task-Oriented Grasping
    Holomjova, Valerija
    Starkey, Andrew J.
    Yun, Bruno
    Meisner, Pascal
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8232 - 8238
  • [45] Structural Learning: Attraction and Conformity in Task-Oriented Groups
    James A. Kitts
    Michael W. Macy
    Andreas Flache
    [J]. Computational & Mathematical Organization Theory, 1999, 5 (2): : 129 - 145
  • [46] Performance Improvement on Traditional Chinese Task-Oriented Dialogue Systems With Reinforcement Learning and Regularized Dropout Technique
    Sheu, Jeng-Shin
    Wu, Siang-Ru
    Wu, Wen-Hung
    [J]. IEEE ACCESS, 2023, 11 : 19849 - 19862
  • [47] Effect of delay on search decisions in a task-oriented reading environment
    Mana, Amelia
    Vidal-Abarca, Eduardo
    Salmeron, Ladislao
    [J]. METACOGNITION AND LEARNING, 2017, 12 (01) : 113 - 130
  • [48] Effect of delay on search decisions in a task-oriented reading environment
    Amelia Mañá
    Eduardo Vidal-Abarca
    Ladislao Salmerón
    [J]. Metacognition and Learning, 2017, 12 : 113 - 130
  • [49] An execution environment for flexible task-oriented software on multicore systems
    Rauber, Thomas
    Ruenger, Gudula
    [J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2012, 20 (02): : 161 - 173
  • [50] Multi-task Learning for Natural Language Generation in Task-Oriented Dialogue
    Zhu, Chenguang
    Zeng, Michael
    Huang, Xuedong
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1261 - 1266