Standardizing Unstructured Big Data and Visual Interpretation using MapReduce and Correspondence Analysis

被引:0
|
作者
Choi, Joseph [1 ]
Choi, Yong-Seok [1 ]
机构
[1] Pusan Natl Univ, Dept Stat, 2 Busandaehak Ro,63beon Gil, Busan 609735, South Korea
关键词
Big data; unstructured data; MapReduce; correspondence analysis; direct relationship words; The Korea Economic Daily;
D O I
10.5351/KJAS.2014.27.2.169
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Massive and various types of data recorded everywhere are called big data. Therefore, it is important to analyze big data and to find valuable information. Besides, to standardize unstructured big data is important for the application of statistical methods. In this paper, we will show how to standardize unstructured big data using MapReduce which is a distribution processing system. We also apply simple correspondence analysis and multiple correspondence analysis to find the relationship and characteristic of direct relationship words for Samsung Electronics and The Korea Economic Daily newspaper as well as Apple Inc.
引用
收藏
页码:169 / 183
页数:15
相关论文
共 50 条
  • [1] An Approach in Big Data Analytics to Improve the Velocity of Unstructured Data Using MapReduce
    Sundarakumar, M. R.
    Mahadevan, G.
    Somula, Ramasubbareddy
    Sennan, Sankar
    Rawal, Bharat S.
    [J]. INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2021, 10 (04)
  • [2] Big Data Analysis Solutions using MapReduce Framework
    Elagib, Sara B.
    Najeeb, Atahur Rahman
    Hashim, Aisha H.
    Olanrewaju, Rashidah F.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2014, : 127 - 130
  • [3] Unstructured Data Analysis on Big Data using Map Reduce
    Subramaniyaswamy, V
    Vijayakumar, V.
    Logesh, R.
    Indragandhi, V
    [J]. BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 456 - 465
  • [4] MapReduce: Simplified Data Analysis of Big Data
    Maitrey, Seema
    Jha, C. K.
    [J]. 3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 563 - 571
  • [5] MapReduce Algorithms for Big Data Analysis
    Shim, Kyuseok
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2016 - 2017
  • [6] Analysis of the Big Data based on MapReduce
    Tian, Zi-de
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 224 - 228
  • [7] MapReduce Algorithms for Big Data Analysis
    Shim, Kyuseok
    [J]. DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : XV - XV
  • [8] Big Data Analysis of Indian Premier League using Hadoop and MapReduce
    Paul, Rajdeep
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [9] Clustering on Big Data Using Hadoop MapReduce
    Akthar, Nadeem
    Ahamad, Mohd Vasim
    Khan, Shahbaz
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 789 - 795
  • [10] Unstructured medical frameworks using big data
    Banu, A. Arjuman
    Reshmy, A. K.
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 : 234 - 241