An Approach in Big Data Analytics to Improve the Velocity of Unstructured Data Using MapReduce

被引:3
|
作者
Sundarakumar, M. R. [1 ]
Mahadevan, G. [2 ]
Somula, Ramasubbareddy [3 ]
Sennan, Sankar [4 ]
Rawal, Bharat S. [5 ]
机构
[1] AMC Engn Coll, Dept Comp Sci & Engn, Bengaluru, India
[2] AMC Engn Coll, Bengaluru, India
[3] VNRVJIET, Secunderabad, India
[4] Sona Coll Technol, Salem, India
[5] Gannon Univ, Dept Cyber Secur, Erie, PA USA
关键词
Big Data; CESI; MapReduce; MRBNGS;
D O I
10.4018/IJSDA.20211001.oa6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data analytics is an innovative approach to extract the data from a huge volume of data warehouse systems. Hadoop is a framework, which is used to perform high speed data retrieval from various clusters by MapReduce and HDFS methods. The huge volumes of files are accessed using data mining, machine learning, and deep learning algorithms. However, these techniques take more time to retrieve the data among the clusters. To overcome the latency issue, the proposed work applies the hybrid algorithm, namely compressed elastic search index (CESI) and MapReduce-based next generation sequencing approach (MRBNGSA), in scheduling and shuffling phase. This proposed approach provides the tangible changes over the MapReduce phases. The performance of the proposed CESI-MRBNGSA algorithm provides significant performance than Hadoop BAM and GATK.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Unstructured Data Analysis on Big Data using Map Reduce
    Subramaniyaswamy, V
    Vijayakumar, V.
    Logesh, R.
    Indragandhi, V
    [J]. BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 456 - 465
  • [32] Extensible Query Framework for Unstructured Medical Data - A Big Data Approach
    Istephan, Sarmad
    Siadat, Mohammad-Reza
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 455 - 462
  • [33] A Visual Programming Approach to Big Data Analytics
    Bockermann, Christian
    [J]. DESIGN, USER EXPERIENCE, AND USABILITY: USER EXPERIENCE DESIGN FOR DIVERSE INTERACTION PLATFORMS AND ENVIRONMENTS, PT II, 2014, 8518 : 393 - 404
  • [34] Big data Analytics in Healthcare: A Survey Approach
    Ramesh, Dharavath
    Suraj, Pranshu
    Saini, Lokendra
    [J]. 2016 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATIONS (MICROCOM), 2016,
  • [35] Visualization: A novel approach for big data analytics
    Kumar, Omesh
    Goyal, Abhishek
    [J]. 2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, : 121 - 124
  • [36] New approach in Big Data Mining for frequent itemset using mapreduce in HDFS
    Nikam, Pallavi V.
    Deshpande, Deepa S.
    [J]. 2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [37] Unstructured medical frameworks using big data
    Banu, A. Arjuman
    Reshmy, A. K.
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 : 234 - 241
  • [38] USING NoSQL FOR PROCESSING UNSTRUCTURED BIG DATA
    Balakayeva, G. T.
    Phillips, C.
    Darkenbayev, D. K.
    Turdaliyev, M.
    [J]. NEWS OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN-SERIES OF GEOLOGY AND TECHNICAL SCIENCES, 2019, (06): : 12 - 21
  • [39] Big Data: Tutorial and guidelines on information and process fusion for analytics algorithms with MapReduce
    Ramirez-Gallego, Sergio
    Fernandez, Alberto
    Garcia, Salvador
    Chen, Min
    Herrera, Francisco
    [J]. INFORMATION FUSION, 2018, 42 : 51 - 61
  • [40] Leveraging Big Data Analytics to Improve Quality of Care in Health Care: A fsQCA Approach
    Wang, Yichuan
    [J]. PROCEEDINGS OF THE 51ST ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2018, : 770 - 779