Accelerated multi-objective task learning using modified Q-learning algorithm

被引:0
|
作者
Rajamohan, Varun Prakash [1 ]
Jagatheesaperumal, Senthil Kumar [1 ]
机构
[1] Mepco Schlenk Engn Coll, Dept Elect & Commun Engn, Sivakasi, Tamil Nadu, India
关键词
reinforcement learning; Q-learning; robotic manipulator; task learning; distance metric;
D O I
10.1504/IJAHUC.2024.140665
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robots find extensive applications in industry. In recent years, the influence of robots has also increased rapidly in domestic scenarios. The Q-learning algorithm aims to maximise the reward for reaching the goal. This paper proposes a modified version of the Q-learning algorithm, known as Q-learning with scaled distance metric (Q - SD). This algorithm enhances task learning and makes task completion more meaningful. A robotic manipulator (agent) applies the Q - SD algorithm to the task of table cleaning. Using Q - SD, the agent acquires the sequence of steps necessary to accomplish the task while minimising the manipulator's movement distance. We partition the table into grids of different dimensions. The first has a grid count of 3 x 3, and the second has a grid count of 4 x 4. Using the Q - SD algorithm, the maximum success obtained in these two environments was 86% and 59% respectively. Moreover, compared to the conventional Q-learning algorithm, the drop in average distance moved by the agent in these two environments using the Q - SD algorithm was 8.61% and 6.7% respectively.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 50 条
  • [41] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
    Graham, Caoimhin
    Bell, David
    Luo, Zhihui
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
  • [42] Multi-objective optimization of heat exchangers using a modified teaching-learning-based optimization algorithm
    Rao, R. Venkata
    Patel, Vivek
    APPLIED MATHEMATICAL MODELLING, 2013, 37 (03) : 1147 - 1162
  • [43] Anomaly Detection using Fuzzy Q-learning Algorithm
    Shamshirband, Shahaboddin
    Anuar, Nor Badrul
    Kiah, Miss Laiha Mat
    Misra, Sanjay
    ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
  • [44] An improved Q-learning algorithm using synthetic pheromones
    Monekosso, N
    Remagnino, P
    Szarowicz, A
    FROM THEORY TO PRACTICE IN MULTI-AGENT SYSTEMS, 2002, 2296 : 197 - 206
  • [45] MULTI-OBJECTIVE MULTI-TASK LEARNING ON RNNLM FOR SPEECH RECOGNITION
    Song, Minguang
    Zhao, Yunxin
    Wang, Shaojun
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 197 - 203
  • [46] Research on multi-objective control strategy of thermal management system of pure electric vehicle at low temperature based on Q-learning algorithm
    Zhan, Sen
    Huang, Yu
    Li, Fei
    Yin, Yanli
    Liu, Chunsheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024,
  • [47] Unmanned ground weapon target assignment based on deep Q-learning network with an improved multi-objective artificial bee colony algorithm
    Wang, Tong
    Fu, Liyue
    Wei, Zhengxian
    Zhou, Yuhu
    Gao, Shan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [48] LASSO multi-objective learning algorithm for feature selection
    Frederico Coelho
    Marcelo Costa
    Michel Verleysen
    Antônio P. Braga
    Soft Computing, 2020, 24 : 13209 - 13217
  • [49] A Multi-objective Reinforcement Learning Algorithm for JS']JSSP
    Mendez-Hernandez, Beatriz M.
    Rodriguez-Bazan, Erick D.
    Martinez-Jimenez, Yailen
    Libin, Pieter
    Nowe, Ann
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 567 - 584
  • [50] LASSO multi-objective learning algorithm for feature selection
    Coelho, Frederico
    Costa, Marcelo
    Verleysen, Michel
    Braga, Antonio P.
    SOFT COMPUTING, 2020, 24 (17) : 13209 - 13217