Scientific Workflows in IoT Environments: A Data Placement Strategy Based on Heterogeneous Edge-Cloud Computing

被引:10
|
作者
Du, Xin [1 ,2 ]
Tang, Songtao [1 ,2 ]
Lu, Zhihui [1 ,3 ]
Gai, Keke [4 ]
Wu, Jie [1 ,5 ]
Hung, Patrick C. K. [6 ]
机构
[1] Fudan Univ, Sch Comp Sci, 2005 Songhu Rd, Shanghai, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, 2005 Songhu Rd, Shanghai, Peoples R China
[3] Shanghai Blockchain Engn Res Ctr, 2005 Songhu Rd, Shanghai, Peoples R China
[4] Beijing Inst Technol, Sch Cyberspace Secur, 5 Zhongguancun South St, Beijing, Peoples R China
[5] Peng Cheng Lab, 1 Xingke St, Shenzhen, Peoples R China
[6] Ontario Tech Univ, Faulty Business & Informat Technol, 2000 Simcoe St North, Oshawa, ON, Canada
基金
中国国家自然科学基金;
关键词
Heterogeneous edge-cloud computing; data-sharing; scientific workflows; IoT environments; DEPLOYMENT; MODEL;
D O I
10.1145/3531327
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Industry 4.0 and Internet of Things (IoT) environments, the heterogeneous edge-cloud computing paradigm can provide a more proper solution to deploy scientific workflows compared to cloud computing or other traditional distributed computing. Owing to the different sizes of scientific datasets and the privacy issue concerning some of these datasets, it is essential to find a data placement strategy that can minimize data transmission time. Some state-of-the-art data placement strategies combine edge computing and cloud computing to distribute scientific datasets. However, the dynamic distribution of newly generated datasets to appropriate datacenters and exiting the spent datasets are still a challenge during workflows execution. To address this challenge, this study not only constructs a data placement model that includes shared datasets within the individual and among multiple workflows across various geographical regions, but also proposes a data placement strategy (DYM-RL-DPS) based on algorithms of two stages. First, during the build-time stage of workflows, we use the discrete particle swarm optimization algorithm with differential evolution to pre-allocate initial datasets to proper datacenters. Then, we reformulate the dynamic datasets distribution problem as a Markov decision process and provide a reinforcement learning-based approach to learn the data placement strategy in the runtime stage of scientific workflows. Through using the heterogeneous edge-cloud computing architecture to simulate IoT environments, we designed comprehensive experiments to demonstrate the superiority of DYM-RL-DPS. The results of our strategy can effectively reduce the data transmission time as compared to other strategies.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments
    Du, Xin
    Tang, Songtao
    Lu, Zhihui
    Wu, Jie
    Gai, Keke
    Hung, Patrick C. K.
    [J]. 2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020), 2020, : 498 - 507
  • [2] A data replica placement strategy for IoT workflows in collaborative edge and cloud environments
    Shao, Yanling
    Li, Chunlin
    Tang, Hengliang
    [J]. COMPUTER NETWORKS, 2019, 148 : 46 - 59
  • [3] An Adaptive Data Placement Strategy in scientific workflows over Cloud Computing Environments
    Kim, Heewon
    Kim, Yoonhee
    [J]. NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,
  • [4] A New Data Placement Approach for Scientific Workflows in Cloud Computing Environments
    Kchaou, Hamdi
    Kechaou, Zied
    Alimi, Adel M.
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2016), 2017, 557 : 330 - 340
  • [5] A data placement strategy in scientific cloud workflows
    Yuan, Dong
    Yang, Yun
    Liu, Xiao
    Chen, Jinjun
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2010, 26 (08): : 1200 - 1214
  • [6] Enabling Data-intensive Workflows in Heterogeneous Edge-cloud Networks
    Shang, Xiaojun
    [J]. Performance Evaluation Review, 2023, 50 (03): : 36 - 38
  • [7] IoT Application Modules Placement and Dynamic Task Processing in Edge-Cloud Computing
    Fang, Juan
    Ma, Aonan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (16) : 12771 - 12781
  • [8] BDAP: A Big Data Placement Strategy for Cloud-Based Scientific Workflows
    Ebrahimi, Mahdi
    Mohan, Aravind
    Kashlev, Andrey
    Lu, Shiyong
    [J]. 2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 105 - 114
  • [9] A Data Placement Strategy for Data-Intensive Scientific Workflows in Cloud
    Zhao, Qing
    Xiong, Congcong
    Zhao, Xi
    Yu, Ce
    Xiao, Jian
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 928 - 934
  • [10] RETRACTED: SAP: An IoT Application Module Placement Strategy Based on Simulated Annealing Algorithm in Edge-Cloud Computing (Retracted Article)
    Fang, Juan
    Li, Kai
    Hu, Juntao
    Xu, Xiaobin
    Teng, Ziyi
    Xiang, Wei
    [J]. JOURNAL OF SENSORS, 2021, 2021