Big Data Processing on Volunteer Computing

被引：2

作者：

Lv, Zhihan ^{[1
]}

Chen, Dongliang ^{[1
]}

Singh, Amit Kumar ^{[2
]}

机构：

[1] Sch Data Sci & Software Engn, Qingdao 266071, Peoples R China

[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna 800005, Bihar, India

来源：

ACM TRANSACTIONS ON INTERNET TECHNOLOGY | 2021年 / 21卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Volunteer computing; large-scale complex networks; loosely coupled distributed frameworks; big data; accurate shortest paths; RESOURCE-ALLOCATION; SYSTEMS; RECOGNITION; ROBUSTNESS; ALGORITHM; FRAMEWORK; PLATFORM; NODES;

D O I：

10.1145/3409801

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In order to calculate the node big data contained in complex networks and realize the efficient calculation of complex networks, based on voluntary computing, taking ICE middleware as the communication medium, the loose coupling distributed framework DCBV based on voluntary computing is proposed. Then, the Master, Worker, and MiddleWare layers in the framework, and the development structure of a DCBV framework are designed. The task allocation and recovery strategy, message passing and communication mode, and fault tolerance processing are discussed. Finally, to calculate and verify parameters such as the average shortest path of the framework and shorten calculation time, an improved accurate shortest path algorithm, the N-SPFA algorithm, is proposed. Under different datasets, the node calculation and performance of the N-SPFA algorithm are explored. The algorithm is compared with four approximate shortest-path algorithms: Combined Link and Attribute (CIA), Lexicographic Breadth First Search (LBFS), Approximate algorithm of shortest path length based on center distance of area division (CDZ), and I Iub Vertex of area and Core Expressway (I IEA-CE). The results show that when the number of CPU threads is 4, the computation time of the DCBV framework is the shortest (514.63 ms). As the number of CPU cores increases, the overall computation time of the framework decreases gradually. For every 2 additional CPU cores, the number of tasks increases by 1. When the number of Worker nodes is 8 and the number of nodes is 1, the computation time of the framework is the shortest (210,979 ms), and the JO statistics data increase with the increase of Worker nodes. When the datasets are Undirected01 and Undirected02, the computation time of the N-SPFA algorithm is the shortest, which is 4520 ms and 7324 ms, respectively. I Iowever, the calculation time in the ca-condmat_undirected dataset is 175,292 ms, and the performance is slightly worse. Overall, however, the performance of the N-SPFA and SPFA algorithms is good. Therefore, the two algorithms are combined. For networks with less complexity, the computational scale coefficient of the SPFA algorithm can be set to 0.06, and for general networks, 0.2. When compared with other algorithms in different datasets, the pretreatment time, average query time, and overall query time of N-SPFA algorithm are the shortest, being 49.67 ms, 5.12 ms, and 94,720 ms, respectively. The accuracy (1.0087) and error rate (0.024) are also the best. In conclusion, voluntary computing can be applied to the processing of big data, which has a good reference significance for the distributed analysis of large-scale complex networks.

引用

页数：20

共 50 条

[1] The Scalability of Volunteer Computing for MapReduce Big Data Applications
Li, Wei
Guo, William
[J]. DATA SCIENCE, PT 1, 2017, 727 : 153 - 165
[2] Cloud Computing for Big Data Processing
Li, Xiaofang
Zhuang, Yanbin
Yang, Simon X.
[J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2017, 23 (04): : 545 - 546
[3] Computing infrastructure for big data processing
Liu, Ling
[J]. FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (02) : 165 - 170
[4] Computing infrastructure for big data processing
Ling Liu
[J]. Frontiers of Computer Science, 2013, 7 : 165 - 170
[5] Big Data Processing in Cloud Computing Environments
Ji, Changqing
Li, Yu
Qiu, Wenming
Awada, Uchechukwu
Li, Keqiu
[J]. PROCEEDINGS OF THE 2012 12TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (I-SPAN 2012), 2012, : 17 - 23
[6] Big Data Processing in Cloud Computing Environments
Noraziah, A.
Fakherldin, Mohammed Adam Ibrahim
Adam, Khalid
Majid, Mazlina Abdul
[J]. ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11092 - 11095
[7] Fog computing: from architecture to edge computing and big data processing
Singh, Simar Preet
Nayyar, Anand
Kumar, Rajesh
Sharma, Anju
[J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (04): : 2070 - 2105
[8] Fog computing: from architecture to edge computing and big data processing
Simar Preet Singh
Anand Nayyar
Rajesh Kumar
Anju Sharma
[J]. The Journal of Supercomputing, 2019, 75 : 2070 - 2105
[9] Analysis of Hypoexponential Computing Services for Big Data Processing
Zapechnikov, Sergey
Miloslavskaya, Natalia
Tolstoy, Alexander
[J]. 2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 579 - 584
[10] Edge computing for big data processing in underwater applications
Periola, A. A.
Alonge, A. A.
Ogudo, K. A.
[J]. WIRELESS NETWORKS, 2022, 28 (05) : 2255 - 2271

← 1 2 3 4 5 →