Heuristics-Based Trust Estimation in Multiagent Systems Using Temporal Difference Learning

被引:13
|
作者
Rishwaraj, G. [1 ]
Ponnambalam, S. G. [1 ]
Loo, Chu Kiong [2 ]
机构
[1] Monash Univ Malaysia, Sch Engn, Subang Jaya 47500, Malaysia
[2] Univ Malaya, Jalan Univ, Kuala Lumpur 50603, Malaysia
关键词
Multiagent system (MAS); temporal difference (TD) learning; trust estimation; MODEL;
D O I
10.1109/TCYB.2016.2634027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of multiagent system (MAS) is becoming increasing popular as it allows agents in a system to pool resources together to achieve a common objective. A vital part of the MAS is the teamwork cooperation through the sharing of information and resources among the agents to optimize their efforts in accomplishing given objectives. A critical part of the teamwork effort is the ability to trust each other when executing any task to ensure efficient and successful cooperation. This paper presents the development of a trust estimation model that could empirically evaluate the trust of an agent in MAS. The proposed model is developed using temporal difference learning by incorporating the concept of Markov games and heuristics to estimate trust. Simulation experiments are conducted to test and evaluate the performance of the developed model against some of the recently reported model in the literature. The simulation experiments indicate that the developed model performs better in terms of accuracy and efficiency in estimating trust.
引用
收藏
页码:1925 / 1935
页数:11
相关论文
共 50 条
  • [1] An efficient trust estimation model for multi-agent systems using temporal difference learning
    Rishwaraj, G.
    Ponnambalam, S. G.
    Kiong, Loo Chu
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S461 - S474
  • [2] An efficient trust estimation model for multi-agent systems using temporal difference learning
    G. Rishwaraj
    S. G. Ponnambalam
    Loo Chu Kiong
    Neural Computing and Applications, 2017, 28 : 461 - 474
  • [3] Heuristics-based parallelization of a class of control systems
    Rabbath, CA
    Lechevin, N
    El Cadi, AA
    Lapierre, SD
    Abdoune, M
    Crainic, T
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 2347 - 2352
  • [4] Heuristics-based learning approach for choquistic regression models
    Tehrani, Ali Fallah
    PATTERN RECOGNITION LETTERS, 2021, 149 : 137 - 142
  • [5] Heuristics-based integrated decisions for logistics network systems
    Tang, JF
    Yung, KL
    Ip, AWH
    JOURNAL OF MANUFACTURING SYSTEMS, 2004, 23 (01) : 1 - 13
  • [6] Multiagent reinforcement learning algorithm using temporal difference error
    Lee, SG
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 1, PROCEEDINGS, 2005, 3496 : 627 - 633
  • [7] A Heuristics-Based Reinforcement Learning Method to Control Bipedal Robots
    Qin, Daoling
    Zhang, Guoteng
    Zhu, Zhengguo
    Chen, Teng
    Zhu, Weiliang
    Rong, Xuewen
    Xie, Anhuan
    Li, Yibin
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2024, 21 (03)
  • [8] Water Distribution Network Design Using Heuristics-Based Algorithm
    Mohan, S.
    Babu, K. S. Jinesh
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2009, 23 (05) : 249 - 257
  • [9] Heuristics-based indoor positioning systems: a systematic literature review
    Lam, Luan D. M.
    Tang, Antony
    Grundy, John
    JOURNAL OF LOCATION BASED SERVICES, 2016, 10 (03) : 178 - 211
  • [10] Temporal feature and heuristics-based Noise Detection over Classical Machine Learning for ECG Signal Quality Assessment
    Hermawan, Indra
    Ma'sum, M. Anwar
    Intan, Riskyana Dewi P.
    Jatmiko, Wisnu
    Wiweko, Budi
    Boediman, Alfred
    Pradekso, Beno K.
    2019 4TH INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS 2019), 2019, : 1 - 7