On-Chain and Off-Chain Data Management for Blockchain-Internet of Things: A Multi-Agent Deep Reinforcement Learning Approach

被引:11
|
作者
Tsang, Y. P. [1 ]
Lee, C. K. M. [1 ]
Zhang, Kening [1 ]
Wu, C. H. [2 ]
Ip, W. H. [2 ,3 ]
机构
[1] Hong Kong Polytech Univ, Res Inst Adv Mfg, Dept Ind & Syst Engn, Hung Hom,Kowloon, Hong Kong, Peoples R China
[2] Hang Seng Univ Hong Kong, Dept Supply Chain & Informat Management, Shatin, Hong Kong, Peoples R China
[3] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK, Canada
关键词
Blockchain; Internet of Things; Data management; Deep reinforcement learning; Asynchronous advantage actor-critic (A3C) algorithm; PREDICTION; STORAGE;
D O I
10.1007/s10723-023-09739-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of blockchain technology has seen applications increasingly hybridise cloud storage and distributed ledger technology in the Internet of Things (IoT) and cyber-physical systems, complicating data management in decentralised applications (DApps). Because it is inefficient for blockchain technology to handle large amounts of data, effective on-chain and off-chain data management in peer-to-peer networks and cloud storage has drawn considerable attention. Space reservation is a cost-effective approach to managing cloud storage effectively, contrasting with the demand for additional space in real-time. Furthermore, off-chain data replication in the peer-to-peer network can eliminate single points of failure of DApps. However, recent research has rarely discussed optimising on-chain and off-chain data management in the blockchain-enabled IoT (BIoT) environment. In this study, the BIoT environment is modelled, with cloud storage and blockchain orchestrated over the peer-to-peer network. The asynchronous advantage actor-critic algorithm is applied to exploit intelligent agents with the optimal policy for data packing, space reservation, and data replication to achieve an intelligent data management strategy. The experimental analysis reveals that the proposed scheme demonstrates rapid convergence and superior performance in terms of average total reward compared with other typical schemes, resulting in enhanced scalability, security and reliability of blockchain-IoT networks, leading to an intelligent data management strategy.
引用
收藏
页数:22
相关论文
共 50 条
  • [22] Caching Transient Data for Internet of Things: A Deep Reinforcement Learning Approach
    Zhu, Hao
    Cao, Yang
    Wei, Xiao
    Wang, Wei
    Jiang, Tao
    Jin, Shi
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02): : 2074 - 2083
  • [23] A smart inventory management system with medication demand dependencies in a hospital supply chain: A multi-agent reinforcement learning approach
    Saha, Esha
    Rathore, Pradeep
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 191
  • [24] An Autonomous Multi-Agent Approach to Supply Chain Event Management
    Bearzotti, Lorena
    Salomone, Enrique
    Chiotti, Omar
    IEEE/SOLI'2008: PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS, VOLS 1 AND 2, 2008, : 524 - +
  • [25] An autonomous multi-agent approach to supply chain event management
    Bearzotti, Lorena A.
    Salomone, Enrique
    Chiotti, Omar J.
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2012, 135 (01) : 468 - 478
  • [26] Multi-agent reinforcement learning enabled link scheduling for next generation Internet of Things
    Zou, Yifei
    Yin, Haofei
    Zheng, Yanwei
    Dressler, Falko
    COMPUTER COMMUNICATIONS, 2023, 205 : 35 - 44
  • [27] Evolutionary Multi-Agent Deep Meta Reinforcement Learning Method for Swarm Intelligence Energy Management of Isolated Multi-Area Microgrid With Internet of Things
    Li, Jiawen
    Zhou, Tao
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (14) : 12923 - 12937
  • [28] Multi-Agent Deep Reinforcement Learning for content caching within the Internet of Vehicles
    Knari, Anas
    Derfouf, Mostapha
    Koulali, Mohammed-Amine
    Khoumsi, Ahmed
    Ad Hoc Networks, 2024, 152
  • [29] Multi-Agent Deep Reinforcement Learning for content caching within the Internet of Vehicles
    Knari, Anas
    Derfouf, Mostapha
    Koulali, Mohammed-Amine
    Khoumsi, Ahmed
    AD HOC NETWORKS, 2024, 152
  • [30] Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management
    Liu, Xiaotian
    Hu, Ming
    Peng, Yijie
    Yang, Yaodong
    PRODUCTION AND OPERATIONS MANAGEMENT, 2024,