Big Data Processing on Volunteer Computing

被引:2
|
作者
Lv, Zhihan [1 ]
Chen, Dongliang [1 ]
Singh, Amit Kumar [2 ]
机构
[1] Sch Data Sci & Software Engn, Qingdao 266071, Peoples R China
[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna 800005, Bihar, India
基金
中国国家自然科学基金;
关键词
Volunteer computing; large-scale complex networks; loosely coupled distributed frameworks; big data; accurate shortest paths; RESOURCE-ALLOCATION; SYSTEMS; RECOGNITION; ROBUSTNESS; ALGORITHM; FRAMEWORK; PLATFORM; NODES;
D O I
10.1145/3409801
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to calculate the node big data contained in complex networks and realize the efficient calculation of complex networks, based on voluntary computing, taking ICE middleware as the communication medium, the loose coupling distributed framework DCBV based on voluntary computing is proposed. Then, the Master, Worker, and MiddleWare layers in the framework, and the development structure of a DCBV framework are designed. The task allocation and recovery strategy, message passing and communication mode, and fault tolerance processing are discussed. Finally, to calculate and verify parameters such as the average shortest path of the framework and shorten calculation time, an improved accurate shortest path algorithm, the N-SPFA algorithm, is proposed. Under different datasets, the node calculation and performance of the N-SPFA algorithm are explored. The algorithm is compared with four approximate shortest-path algorithms: Combined Link and Attribute (CIA), Lexicographic Breadth First Search (LBFS), Approximate algorithm of shortest path length based on center distance of area division (CDZ), and I Iub Vertex of area and Core Expressway (I IEA-CE). The results show that when the number of CPU threads is 4, the computation time of the DCBV framework is the shortest (514.63 ms). As the number of CPU cores increases, the overall computation time of the framework decreases gradually. For every 2 additional CPU cores, the number of tasks increases by 1. When the number of Worker nodes is 8 and the number of nodes is 1, the computation time of the framework is the shortest (210,979 ms), and the JO statistics data increase with the increase of Worker nodes. When the datasets are Undirected01 and Undirected02, the computation time of the N-SPFA algorithm is the shortest, which is 4520 ms and 7324 ms, respectively. I Iowever, the calculation time in the ca-condmat_undirected dataset is 175,292 ms, and the performance is slightly worse. Overall, however, the performance of the N-SPFA and SPFA algorithms is good. Therefore, the two algorithms are combined. For networks with less complexity, the computational scale coefficient of the SPFA algorithm can be set to 0.06, and for general networks, 0.2. When compared with other algorithms in different datasets, the pretreatment time, average query time, and overall query time of N-SPFA algorithm are the shortest, being 49.67 ms, 5.12 ms, and 94,720 ms, respectively. The accuracy (1.0087) and error rate (0.024) are also the best. In conclusion, voluntary computing can be applied to the processing of big data, which has a good reference significance for the distributed analysis of large-scale complex networks.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] The Scalability of Volunteer Computing for MapReduce Big Data Applications
    Li, Wei
    Guo, William
    [J]. DATA SCIENCE, PT 1, 2017, 727 : 153 - 165
  • [2] Cloud Computing for Big Data Processing
    Li, Xiaofang
    Zhuang, Yanbin
    Yang, Simon X.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2017, 23 (04): : 545 - 546
  • [3] Computing infrastructure for big data processing
    Liu, Ling
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (02) : 165 - 170
  • [4] Computing infrastructure for big data processing
    Ling Liu
    [J]. Frontiers of Computer Science, 2013, 7 : 165 - 170
  • [5] Big Data Processing in Cloud Computing Environments
    Ji, Changqing
    Li, Yu
    Qiu, Wenming
    Awada, Uchechukwu
    Li, Keqiu
    [J]. PROCEEDINGS OF THE 2012 12TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (I-SPAN 2012), 2012, : 17 - 23
  • [6] Big Data Processing in Cloud Computing Environments
    Noraziah, A.
    Fakherldin, Mohammed Adam Ibrahim
    Adam, Khalid
    Majid, Mazlina Abdul
    [J]. ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11092 - 11095
  • [7] Fog computing: from architecture to edge computing and big data processing
    Singh, Simar Preet
    Nayyar, Anand
    Kumar, Rajesh
    Sharma, Anju
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (04): : 2070 - 2105
  • [8] Fog computing: from architecture to edge computing and big data processing
    Simar Preet Singh
    Anand Nayyar
    Rajesh Kumar
    Anju Sharma
    [J]. The Journal of Supercomputing, 2019, 75 : 2070 - 2105
  • [9] Analysis of Hypoexponential Computing Services for Big Data Processing
    Zapechnikov, Sergey
    Miloslavskaya, Natalia
    Tolstoy, Alexander
    [J]. 2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 579 - 584
  • [10] Edge computing for big data processing in underwater applications
    Periola, A. A.
    Alonge, A. A.
    Ogudo, K. A.
    [J]. WIRELESS NETWORKS, 2022, 28 (05) : 2255 - 2271