On-Chain and Off-Chain Data Management for Blockchain-Internet of Things: A Multi-Agent Deep Reinforcement Learning Approach

被引：11

作者：

Tsang, Y. P. ^{[1
]}

Lee, C. K. M. ^{[1
]}

Zhang, Kening ^{[1
]}

Wu, C. H. ^{[2
]}

Ip, W. H. ^{[2
,3
]}

机构：

[1] Hong Kong Polytech Univ, Res Inst Adv Mfg, Dept Ind & Syst Engn, Hung Hom,Kowloon, Hong Kong, Peoples R China

[2] Hang Seng Univ Hong Kong, Dept Supply Chain & Informat Management, Shatin, Hong Kong, Peoples R China

[3] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK, Canada

来源：

JOURNAL OF GRID COMPUTING | 2024年 / 22卷 / 01期

关键词：

Blockchain; Internet of Things; Data management; Deep reinforcement learning; Asynchronous advantage actor-critic (A3C) algorithm; PREDICTION; STORAGE;

D O I：

10.1007/s10723-023-09739-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The emergence of blockchain technology has seen applications increasingly hybridise cloud storage and distributed ledger technology in the Internet of Things (IoT) and cyber-physical systems, complicating data management in decentralised applications (DApps). Because it is inefficient for blockchain technology to handle large amounts of data, effective on-chain and off-chain data management in peer-to-peer networks and cloud storage has drawn considerable attention. Space reservation is a cost-effective approach to managing cloud storage effectively, contrasting with the demand for additional space in real-time. Furthermore, off-chain data replication in the peer-to-peer network can eliminate single points of failure of DApps. However, recent research has rarely discussed optimising on-chain and off-chain data management in the blockchain-enabled IoT (BIoT) environment. In this study, the BIoT environment is modelled, with cloud storage and blockchain orchestrated over the peer-to-peer network. The asynchronous advantage actor-critic algorithm is applied to exploit intelligent agents with the optimal policy for data packing, space reservation, and data replication to achieve an intelligent data management strategy. The experimental analysis reveals that the proposed scheme demonstrates rapid convergence and superior performance in terms of average total reward compared with other typical schemes, resulting in enhanced scalability, security and reliability of blockchain-IoT networks, leading to an intelligent data management strategy.

引用

页数：22

共 50 条

[21] Blockchain-Enabled Deep Reinforcement Learning Approach for Performance Optimization on the Internet of Things
Alam, Tanweer
WIRELESS PERSONAL COMMUNICATIONS, 2022, 126 (02) : 995 - 1011
[22] Caching Transient Data for Internet of Things: A Deep Reinforcement Learning Approach
Zhu, Hao
Cao, Yang
Wei, Xiao
Wang, Wei
Jiang, Tao
Jin, Shi
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02): : 2074 - 2083
[23] A smart inventory management system with medication demand dependencies in a hospital supply chain: A multi-agent reinforcement learning approach
Saha, Esha
Rathore, Pradeep
COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 191
[24] An Autonomous Multi-Agent Approach to Supply Chain Event Management
Bearzotti, Lorena
Salomone, Enrique
Chiotti, Omar
IEEE/SOLI'2008: PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS, VOLS 1 AND 2, 2008, : 524 - +
[25] An autonomous multi-agent approach to supply chain event management
Bearzotti, Lorena A.
Salomone, Enrique
Chiotti, Omar J.
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2012, 135 (01) : 468 - 478
[26] Multi-agent reinforcement learning enabled link scheduling for next generation Internet of Things
Zou, Yifei
Yin, Haofei
Zheng, Yanwei
Dressler, Falko
COMPUTER COMMUNICATIONS, 2023, 205 : 35 - 44
[27] Evolutionary Multi-Agent Deep Meta Reinforcement Learning Method for Swarm Intelligence Energy Management of Isolated Multi-Area Microgrid With Internet of Things
Li, Jiawen
Zhou, Tao
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (14) : 12923 - 12937
[28] Multi-Agent Deep Reinforcement Learning for content caching within the Internet of Vehicles
Knari, Anas
Derfouf, Mostapha
Koulali, Mohammed-Amine
Khoumsi, Ahmed
Ad Hoc Networks, 2024, 152
[29] Multi-Agent Deep Reinforcement Learning for content caching within the Internet of Vehicles
Knari, Anas
Derfouf, Mostapha
Koulali, Mohammed-Amine
Khoumsi, Ahmed
AD HOC NETWORKS, 2024, 152
[30] Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management
Liu, Xiaotian
Hu, Ming
Peng, Yijie
Yang, Yaodong
PRODUCTION AND OPERATIONS MANAGEMENT, 2024,

← 1 2 3 4 5 →