Multilevel Data Processing Using Parallel Algorithms for Analyzing Big Data in High-Performance Computing

被引:0
|
作者
Awais Ahmad
Anand Paul
Sadia Din
M. Mazhar Rathore
Gyu Sang Choi
Gwanggil Jeon
机构
[1] Yeungnam University,Department of Information and Communication Engineering
[2] Kyungpook National University,School of Computer Science and Engineering
[3] Incheon National University,Department of Embedded Systems Engineering
关键词
Big Data; HPC; Parallel Processing algorithm; Four-tier system architecture;
D O I
暂无
中图分类号
学科分类号
摘要
The growing gap between users and the Big Data analytics requires innovative tools that address the challenges faced by big data volume, variety, and velocity. Therefore, it becomes computationally inefficient to analyze such massive volume of data. Moreover, advancements in the field of Big Data application and data science poses additional challenges, where High-Performance Computing solution has become a key issue and has attracted attention in recent years. However, these systems are either memoryless or computational inefficient. Therefore, keeping in view the aforementioned needs, there is a requirement for a system that can efficiently analyze a stream of Big Data within their requirements. Hence, this paper presents a system architecture that enhances the working of traditional MapReduce by incorporating parallel processing algorithm. Moreover, complete four-tier architecture is also proposed that efficiently aggregate the data, eliminate unnecessary data, and analyze the data by the proposed parallel processing algorithm. The proposed system architecture both read and writes operations that enhance the efficiency of the Input/Output operation. To check the efficiency of the proposed algorithms exploited in the proposed system architecture, we have implemented our proposed system using Hadoop and MapReduce. MapReduce is supported by a parallel algorithm that efficiently processes a huge volume of data sets. The system is implemented using MapReduce tool at the top of the Hadoop parallel nodes to generate and process graphs with near real-time. Moreover, the system is evaluated in terms of efficiency by considering the system throughput and processing time. The results show that the proposed system is more scalable and efficient.
引用
收藏
页码:508 / 527
页数:19
相关论文
共 50 条
  • [31] HIGH-PERFORMANCE COMPUTING WEB SEARCH SYSTEM BASED ON COMPUTER BIG DATA
    KANG Y.
    TANG B.
    HU X.
    Scalable Comput. Pract. Exp., 3 (1932-1939): : 1932 - 1939
  • [32] Use of parallel computing for analyzing big data in EEG studies of ambiguous perception
    Maksimenko, Vladimir A.
    Grubov, Vadim V.
    Kirsanov, Daniil, V
    DYNAMICS AND FLUCTUATIONS IN BIOMEDICAL PHOTONICS XV, 2018, 10493
  • [33] Cloud Computing in Remote Sensing : High Performance Remote Sensing Data Processing in a Big data Environment
    Sabri, Y.
    Aouad, S.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2021, 16 (06)
  • [34] High-Performance Passive Macromodeling Algorithms for Parallel Computing Platforms
    Chinea, Alessandro
    Grivet-Talocia, Stefano
    Olivadese, Salvatore Bernardo
    Gobbato, Luca
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2013, 3 (07): : 1188 - 1203
  • [35] A parallel computing framework for big data
    Guoliang Chen
    Rui Mao
    Kezhong Lu
    Frontiers of Computer Science, 2017, 11 : 608 - 621
  • [36] A parallel computing framework for big data
    Chen, Guoliang
    Mao, Rui
    Lu, Kezhong
    FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (04) : 608 - 621
  • [38] Sketching-based High-Performance Biomedical Big Data Processing Accelerator
    Kulkarni, Amey
    Jafari, Ali
    Sagedy, Chris
    Mohsenin, Tinoosh
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1138 - 1141
  • [39] High Performance Computing Cluster System and its Future Aspects in Processing Big Data
    Reddy, Kishor Kumar C.
    Chandrudu, Bala K. E.
    Anisha, P. R.
    Raju, G. V. S.
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 881 - 885
  • [40] pipsCloud: High performance cloud computing for remote sensing big data management and processing
    Wang, Lizhe
    Ma, Yan
    Yan, Jining
    Chang, Victor
    Zomaya, Albert Y.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 78 : 353 - 368