Heuristics-Based Trust Estimation in Multiagent Systems Using Temporal Difference Learning

被引:13
|
作者
Rishwaraj, G. [1 ]
Ponnambalam, S. G. [1 ]
Loo, Chu Kiong [2 ]
机构
[1] Monash Univ Malaysia, Sch Engn, Subang Jaya 47500, Malaysia
[2] Univ Malaya, Jalan Univ, Kuala Lumpur 50603, Malaysia
关键词
Multiagent system (MAS); temporal difference (TD) learning; trust estimation; MODEL;
D O I
10.1109/TCYB.2016.2634027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of multiagent system (MAS) is becoming increasing popular as it allows agents in a system to pool resources together to achieve a common objective. A vital part of the MAS is the teamwork cooperation through the sharing of information and resources among the agents to optimize their efforts in accomplishing given objectives. A critical part of the teamwork effort is the ability to trust each other when executing any task to ensure efficient and successful cooperation. This paper presents the development of a trust estimation model that could empirically evaluate the trust of an agent in MAS. The proposed model is developed using temporal difference learning by incorporating the concept of Markov games and heuristics to estimate trust. Simulation experiments are conducted to test and evaluate the performance of the developed model against some of the recently reported model in the literature. The simulation experiments indicate that the developed model performs better in terms of accuracy and efficiency in estimating trust.
引用
收藏
页码:1925 / 1935
页数:11
相关论文
共 50 条
  • [21] An authorization-based trust model for multiagent systems
    Wen, W
    Mizoguchi, F
    APPLIED ARTIFICIAL INTELLIGENCE, 2000, 14 (09) : 909 - 925
  • [22] Search for phosphors for use in displays and lighting using heuristics-based combinatorial materials science
    Sharma, Asish Kumar
    Sohn, Kee-Sun
    JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2009, 17 (12) : 1073 - 1080
  • [23] Understanding heuristics-based financial decision-making using behavioral portfolio strategies
    Quddus, Kamran
    Banerjee, Ashok
    REVIEW OF BEHAVIORAL FINANCE, 2023, 15 (02) : 121 - 137
  • [24] REGION-BASED HEURISTICS FOR AN ITERATIVE PARTITIONING PROBLEM IN MULTIAGENT SYSTEMS
    Kemmerich, Thomas
    Buening, Hans Kleine
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 200 - 205
  • [25] Using asymmetric keys in a certified trust model for multiagent systems
    Botelho, Vanderson
    Enembreck, Fabricio
    Avila, Braulio
    de Azevedo, Hilton
    Scalabrin, Edson
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (02) : 1233 - 1240
  • [26] Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions
    Bianchi, Reinaldo A. C.
    Lopez de Mantaras, Ramon
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 355 - 360
  • [27] A complementary learning systems approach to temporal difference learning
    Blakeman, Sam
    Mareschal, Denis
    NEURAL NETWORKS, 2020, 122 : 218 - 230
  • [28] Minimizing delay in content-centric networks using heuristics-based in-network caching
    Sumit Kumar
    Rajeev Tiwari
    Sergei Kozlov
    Joel J. P. C. Rodrigues
    Cluster Computing, 2022, 25 : 417 - 431
  • [29] Deepaware: A hybrid deep learning and context-aware heuristics-based model for atrial fibrillation detection
    Kumar, Devender
    Peimankar, Abdolrahman
    Sharma, Kamal
    Dominguez, Helena
    Puthusserypady, Sadasivan
    Bardram, Jakob E.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 221
  • [30] Control of multivariable systems based on emotional temporal difference learning controller
    Abdi, Javad
    Khalili, GholamHassan Famil
    Fatourechi, Mehrdad
    Lucas, Caro
    Sedigh, Ali Khaki
    International Journal of Engineering, Transactions A: Basics, 2004, 17 (04): : 363 - 376