A parallel computing framework for big data

被引:4
|
作者
Chen, Guoliang [1 ,2 ]
Mao, Rui [1 ,2 ]
Lu, Kezhong [1 ,2 ]
机构
[1] Guangdong Prov Key Lab Popular High Performance C, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
NC-computing; metric space; data partitioning; parallel computing; SIMILARITY SEARCH; METRIC-SPACES; QUERIES;
D O I
10.1007/s11704-016-5003-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data has received great attention in research and application. However, most of the current efforts focus on system and application to handle the challenges of "volume" and "velocity", and not much has been done on the theoretical foundation and to handle the challenge of "variety". Based on metric-space indexing and computationalcomplexity theory, we propose a parallel computing framework for big data. This framework consists of three components, i.e., universal representation of big data by abstracting various data types into metric space, partitioning of big data based on pair-wise distances in metric space, and parallel computing of big data with the NC-class computing theory.
引用
收藏
页码:608 / 621
页数:14
相关论文
共 50 条
  • [31] IoT, Cloud Computing and Big Data: Integrated Framework for Healthcare in Disasters
    Madanian, Samaneh
    Parry, Dave
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 998 - 1002
  • [32] The framework of social networks big data processing based on cloud computing
    Kewen, Liu, 1600, Science and Engineering Research Support Society (09):
  • [33] A Feasible Graph Partition Framework for Random Walks Implemented by Parallel Computing in Big Graph
    Liu Xiaoming
    Zhou Yadong
    Guan Xiaohong
    Sun Xiaoxiao
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 4986 - 4991
  • [34] INTERPRETABLE DECISION-TREE INDUCTION IN A BIG DATA PARALLEL FRAMEWORK
    Weinberg, Abraham Itzhak
    Last, Mark
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 27 (04) : 737 - 748
  • [35] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
    Faculty of Science of Rabat, Mohammed V University, Rabat, Morocco
    Intl. J. Adv. Comput. Sci. Appl., 7 (890-896):
  • [36] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
    Dafir, Zineb
    Slaoui, Said
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 890 - 896
  • [37] PRIMEBALL: A Parallel Processing Framework Benchmark for Big Data Applications in the Cloud
    Ferrarons, Jaume
    Adhana, Mulu
    Colmenares, Carlos
    Pietrowska, Sandra
    Bentayeb, Fadila
    Darmont, Jerome
    PERFORMANCE CHARACTERIZATION AND BENCHMARKING, 2014, 8391 : 109 - 124
  • [38] Brain big data processing with massively parallel computing technology: challenges and opportunities
    Chen, Dan
    Hu, Yangyang
    Cai, Chang
    Zeng, Ke
    Li, Xiaoli
    SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (03): : 405 - 420
  • [39] Use of parallel computing for analyzing big data in EEG studies of ambiguous perception
    Maksimenko, Vladimir A.
    Grubov, Vadim V.
    Kirsanov, Daniil, V
    DYNAMICS AND FLUCTUATIONS IN BIOMEDICAL PHOTONICS XV, 2018, 10493
  • [40] An equalisation control method for network big data transmission based on parallel computing
    Lu, Jiazhong
    Zhang, Xiaosong
    Liu, Xiaolei
    Hu, Teng
    Cao, Yu
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2020, 13 (01) : 32 - 38