Task-oriented reinforcement learning for continuous tasks in dynamic environment

被引:0
|
作者
Kamal, MAS [1 ]
Murata, J [1 ]
Hirasawa, K [1 ]
机构
[1] Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Higashi Ku, Fukuoka, Japan
关键词
reinforcement learning; non-episodic tasks; autonomous agents;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a more realistic way of learning for non-episodic tasks of mobile agents, in which the generalized state spaces as well as learning process do not depend on the environment structures. This work has two main contributions. First, the proposed task-oriented reinforcement learning allows the agent to use several Q-tables based on the type of subtasks that greatly reduces the dimensionality in state spaces. Second, the use of relative information of the environment topology makes the system capable of working in dynamic, environment continuously.
引用
收藏
页码:829 / 832
页数:4
相关论文
共 50 条
  • [1] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Hsueh, Yu-Ling
    Chou, Tai-Liang
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [2] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Chou, Tai-Liang
    Hsueh, Yu-Ling
    [J]. NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 87 - 91
  • [3] Task-oriented Dialogue System Based on Reinforcement Learning
    Song, Meina
    Chen, Zhongfu
    Niu, Peiqing
    Haihong, E.
    [J]. PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
  • [4] Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
    Li, Ziming
    Kiseleva, Julia
    de Rijke, Maarten
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [5] A Survey of Task-Oriented Dialogue Policies Based on Reinforcement Learning
    Xu, Kai
    Wang, Zhen-Yu
    Wang, Xu
    Qin, Hua
    Long, Yu-Xuan
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (06): : 1201 - 1231
  • [6] CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
    Verma, Siddharth
    Fu, Justin
    Yang, Mengjiao
    Levine, Sergey
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4471 - 4491
  • [7] Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
    Zhang, Jiaping
    Zhao, Tiancheng
    Yu, Zhou
    [J]. 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 140 - 150
  • [8] Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control
    Xiang, Guofei
    Su, Jianbo
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (02) : 1056 - 1069
  • [9] Task-oriented learning on the Web
    Whittington, CD
    Campbell, LM
    [J]. INNOVATIONS IN EDUCATION AND TRAINING INTERNATIONAL, 1999, 36 (01): : 26 - 33
  • [10] Dynamic online discussion: task-oriented interaction for deep learning
    Du, Jianxia
    Havard, Byron
    Li, Heng
    [J]. EDUCATIONAL MEDIA INTERNATIONAL, 2005, 42 (03) : 207 - 218