Standardizing Unstructured Big Data and Visual Interpretation using MapReduce and Correspondence Analysis

被引:0
|
作者
Choi, Joseph [1 ]
Choi, Yong-Seok [1 ]
机构
[1] Pusan Natl Univ, Dept Stat, 2 Busandaehak Ro,63beon Gil, Busan 609735, South Korea
关键词
Big data; unstructured data; MapReduce; correspondence analysis; direct relationship words; The Korea Economic Daily;
D O I
10.5351/KJAS.2014.27.2.169
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Massive and various types of data recorded everywhere are called big data. Therefore, it is important to analyze big data and to find valuable information. Besides, to standardize unstructured big data is important for the application of statistical methods. In this paper, we will show how to standardize unstructured big data using MapReduce which is a distribution processing system. We also apply simple correspondence analysis and multiple correspondence analysis to find the relationship and characteristic of direct relationship words for Samsung Electronics and The Korea Economic Daily newspaper as well as Apple Inc.
引用
收藏
页码:169 / 183
页数:15
相关论文
共 50 条
  • [21] PARALLEL KNOWLEDGE ACQUISITION ALGORITHM FOR BIG DATA USING MAPREDUCE
    Qian, Jin
    Xia, Min
    Lv, Ping
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL. 1, 2015, : 316 - 321
  • [22] The optimization for recurring queries in big data analysis system with MapReduce
    Zhang, Bin
    Wang, Xiaoyang
    Zheng, Zhigao
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 549 - 556
  • [23] Parallel knowledge acquisition algorithms for big data using MapReduce
    Jin Qian
    Min Xia
    Xiaodong Yue
    [J]. International Journal of Machine Learning and Cybernetics, 2018, 9 : 1007 - 1021
  • [24] Improved CURE Clustering for Big Data using Hadoop and Mapreduce
    Lathiya, Piyush
    Rani, Rinkle
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 3, 2015, : 241 - 245
  • [25] Incremental attribute reduction algorithm for big data using MapReduce
    Lv, Ping
    Qian, Jin
    Yue, Xiaodong
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2016, 16 (03) : 641 - 652
  • [26] Event Segmentation using MapReduce based Big Data Clustering
    Shafiq, M. Omair
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1857 - 1866
  • [27] Non-MapReduce computing for intelligent big data analysis
    Sun, Xudong
    Zhao, Lingxiang
    Chen, Jiaqi
    Cai, Yongda
    Wu, Dingming
    Huang, Joshua Zhexue
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [28] Budget Constraint Scheduler for Big Data Using Hadoop MapReduce
    Vinutha D.C.
    Raju G.T.
    [J]. SN Computer Science, 2021, 2 (4)
  • [29] Feature Selection and Classification of Big Data Using MapReduce Framework
    Devi, D. Renuka
    Sasikala, S.
    [J]. INTELLIGENT COMPUTING, INFORMATION AND CONTROL SYSTEMS, ICICCS 2019, 2020, 1039 : 666 - 673
  • [30] Parallel knowledge acquisition algorithms for big data using MapReduce
    Qian, Jin
    Xia, Min
    Yue, Xiaodong
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (06) : 1007 - 1021