Heuristics-Based Trust Estimation in Multiagent Systems Using Temporal Difference Learning

被引：13

作者：

Rishwaraj, G. ^{[1
]}

Ponnambalam, S. G. ^{[1
]}

Loo, Chu Kiong ^{[2
]}

机构：

[1] Monash Univ Malaysia, Sch Engn, Subang Jaya 47500, Malaysia

[2] Univ Malaya, Jalan Univ, Kuala Lumpur 50603, Malaysia

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2017年 / 47卷 / 08期

关键词：

Multiagent system (MAS); temporal difference (TD) learning; trust estimation; MODEL;

D O I：

10.1109/TCYB.2016.2634027

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The application of multiagent system (MAS) is becoming increasing popular as it allows agents in a system to pool resources together to achieve a common objective. A vital part of the MAS is the teamwork cooperation through the sharing of information and resources among the agents to optimize their efforts in accomplishing given objectives. A critical part of the teamwork effort is the ability to trust each other when executing any task to ensure efficient and successful cooperation. This paper presents the development of a trust estimation model that could empirically evaluate the trust of an agent in MAS. The proposed model is developed using temporal difference learning by incorporating the concept of Markov games and heuristics to estimate trust. Simulation experiments are conducted to test and evaluate the performance of the developed model against some of the recently reported model in the literature. The simulation experiments indicate that the developed model performs better in terms of accuracy and efficiency in estimating trust.

引用

页码：1925 / 1935

页数：11

共 50 条

[21] An authorization-based trust model for multiagent systems
Wen, W
Mizoguchi, F
APPLIED ARTIFICIAL INTELLIGENCE, 2000, 14 (09) : 909 - 925
[22] Search for phosphors for use in displays and lighting using heuristics-based combinatorial materials science
Sharma, Asish Kumar
Sohn, Kee-Sun
JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2009, 17 (12) : 1073 - 1080
[23] Understanding heuristics-based financial decision-making using behavioral portfolio strategies
Quddus, Kamran
Banerjee, Ashok
REVIEW OF BEHAVIORAL FINANCE, 2023, 15 (02) : 121 - 137
[24] REGION-BASED HEURISTICS FOR AN ITERATIVE PARTITIONING PROBLEM IN MULTIAGENT SYSTEMS
Kemmerich, Thomas
Buening, Hans Kleine
ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 200 - 205
[25] Using asymmetric keys in a certified trust model for multiagent systems
Botelho, Vanderson
Enembreck, Fabricio
Avila, Braulio
de Azevedo, Hilton
Scalabrin, Edson
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (02) : 1233 - 1240
[26] Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions
Bianchi, Reinaldo A. C.
Lopez de Mantaras, Ramon
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 355 - 360
[27] A complementary learning systems approach to temporal difference learning
Blakeman, Sam
Mareschal, Denis
NEURAL NETWORKS, 2020, 122 : 218 - 230
[28] Minimizing delay in content-centric networks using heuristics-based in-network caching
Sumit Kumar
Rajeev Tiwari
Sergei Kozlov
Joel J. P. C. Rodrigues
Cluster Computing, 2022, 25 : 417 - 431
[29] Deepaware: A hybrid deep learning and context-aware heuristics-based model for atrial fibrillation detection
Kumar, Devender
Peimankar, Abdolrahman
Sharma, Kamal
Dominguez, Helena
Puthusserypady, Sadasivan
Bardram, Jakob E.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 221
[30] Control of multivariable systems based on emotional temporal difference learning controller
Abdi, Javad
Khalili, GholamHassan Famil
Fatourechi, Mehrdad
Lucas, Caro
Sedigh, Ali Khaki
International Journal of Engineering, Transactions A: Basics, 2004, 17 (04): : 363 - 376

← 1 2 3 4 5 →