Accelerated multi-objective task learning using modified Q-learning algorithm

被引：0

作者：

Rajamohan, Varun Prakash ^{[1
]}

Jagatheesaperumal, Senthil Kumar ^{[1
]}

机构：

[1] Mepco Schlenk Engn Coll, Dept Elect & Commun Engn, Sivakasi, Tamil Nadu, India

来源：

INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING | 2024年 / 47卷 / 01期

关键词：

reinforcement learning; Q-learning; robotic manipulator; task learning; distance metric;

D O I：

10.1504/IJAHUC.2024.140665

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Robots find extensive applications in industry. In recent years, the influence of robots has also increased rapidly in domestic scenarios. The Q-learning algorithm aims to maximise the reward for reaching the goal. This paper proposes a modified version of the Q-learning algorithm, known as Q-learning with scaled distance metric (Q - SD). This algorithm enhances task learning and makes task completion more meaningful. A robotic manipulator (agent) applies the Q - SD algorithm to the task of table cleaning. Using Q - SD, the agent acquires the sequence of steps necessary to accomplish the task while minimising the manipulator's movement distance. We partition the table into grids of different dimensions. The first has a grid count of 3 x 3, and the second has a grid count of 4 x 4. Using the Q - SD algorithm, the maximum success obtained in these two environments was 86% and 59% respectively. Moreover, compared to the conventional Q-learning algorithm, the drop in average distance moved by the agent in these two environments using the Q - SD algorithm was 8.61% and 6.7% respectively.

引用

页码：28 / 37

页数：10

共 50 条

[41] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
Graham, Caoimhin
Bell, David
Luo, Zhihui
RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
[42] Multi-objective optimization of heat exchangers using a modified teaching-learning-based optimization algorithm
Rao, R. Venkata
Patel, Vivek
APPLIED MATHEMATICAL MODELLING, 2013, 37 (03) : 1147 - 1162
[43] Anomaly Detection using Fuzzy Q-learning Algorithm
Shamshirband, Shahaboddin
Anuar, Nor Badrul
Kiah, Miss Laiha Mat
Misra, Sanjay
ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
[44] An improved Q-learning algorithm using synthetic pheromones
Monekosso, N
Remagnino, P
Szarowicz, A
FROM THEORY TO PRACTICE IN MULTI-AGENT SYSTEMS, 2002, 2296 : 197 - 206
[45] MULTI-OBJECTIVE MULTI-TASK LEARNING ON RNNLM FOR SPEECH RECOGNITION
Song, Minguang
Zhao, Yunxin
Wang, Shaojun
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 197 - 203
[46] Research on multi-objective control strategy of thermal management system of pure electric vehicle at low temperature based on Q-learning algorithm
Zhan, Sen
Huang, Yu
Li, Fei
Yin, Yanli
Liu, Chunsheng
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024,
[47] Unmanned ground weapon target assignment based on deep Q-learning network with an improved multi-objective artificial bee colony algorithm
Wang, Tong
Fu, Liyue
Wei, Zhengxian
Zhou, Yuhu
Gao, Shan
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
[48] LASSO multi-objective learning algorithm for feature selection
Frederico Coelho
Marcelo Costa
Michel Verleysen
Antônio P. Braga
Soft Computing, 2020, 24 : 13209 - 13217
[49] A Multi-objective Reinforcement Learning Algorithm for JS']JSSP
Mendez-Hernandez, Beatriz M.
Rodriguez-Bazan, Erick D.
Martinez-Jimenez, Yailen
Libin, Pieter
Nowe, Ann
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 567 - 584
[50] LASSO multi-objective learning algorithm for feature selection
Coelho, Frederico
Costa, Marcelo
Verleysen, Michel
Braga, Antonio P.
SOFT COMPUTING, 2020, 24 (17) : 13209 - 13217

← 1 2 3 4 5 →