Unstructured Data Analysis on Big Data using Map Reduce

被引:22
|
作者
Subramaniyaswamy, V [1 ]
Vijayakumar, V. [2 ]
Logesh, R. [1 ]
Indragandhi, V [3 ]
机构
[1] SASTRA Univ, Sch Comp, Thanjavur 613401, India
[2] VIT Univ, Sch Engn & Comp Sci, Madras 600127, Tamil Nadu, India
[3] SASTRA Univ, Sch Elect & Elect Engn, Thanjavur 613401, India
关键词
Hadoop; MapReduce; Collaborative Filtering; Mahout; Maven; Sentiment Analysis;
D O I
10.1016/j.procs.2015.04.015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the real time scenario, the volume of data used linearly increases with time. Social networking sites like Facebook, Twitter discovered the growth of data which will be uncontrollable in the future. In order to manage the huge volume of data, the proposed method will process the data in parallel as small chunks in distributed clusters and aggregate all the data across clusters to obtain the final processed data. In Hadoop framework, MapReduce is used to perform the task of filtering, aggregation and to maintain the efficient storage structure. The data are preferably refined using collaborative filtering, under the prediction mechanism of particular data needed by the user. The proposed method is enhanced by using the techniques such as sentiment analysis through natural language processing for parsing the data into tokens and emoticon based clustering. The process of data clustering is based on user emotions to get the data needed by a specific user. The results show that the proposed approach significantly increases the performance of complexity analysis. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:456 / 465
页数:10
相关论文
共 50 条
  • [1] Handling Big Data Efficiently by using Map Reduce Technique
    Maitrey, Seema
    Jha, C. K.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION TECHNOLOGY CICT 2015, 2015, : 703 - 708
  • [2] Addressing Big Data Problem Using Hadoop and Map Reduce
    Patel, Aditya B.
    Birla, Manashvi
    Nair, Ushma
    [J]. 3RD NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2012), 2012,
  • [3] CLASSIFICATION ALGORITHMS FOR BIG DATA ANALYSIS, A MAP REDUCE APPROACH
    Ayma, V. A.
    Ferreira, R. S.
    Happ, P.
    Oliveira, D.
    Feitosaa, R.
    Costa, G.
    Plaza, A.
    Gamba, P.
    [J]. PIA15+HRIGI15 - JOINT ISPRS CONFERENCE, VOL. I, 2015, 40-3 (W2): : 17 - 21
  • [4] CSRS: Customized Service Recommendation System for Big Data Analysis using Map Reduce
    Bande, Vijay M.
    Pakle, Ganesh K.
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 3, 2015, : 857 - 859
  • [5] BIG DATA ANALYSIS FOR HEART DISEASE DETECTION SYSTEM USING MAP REDUCE TECHNIQUE
    Vaishali, G.
    Kalaivani, V.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [6] Big Data Analytics using Hadoop Map Reduce Framework and Data Migration Process
    Bante, Payal M.
    Rajeswari, K.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [7] MODEL OF BIG DATA MAP/REDUCE PROCESSING
    Orozova, Daniela
    Atanassov, Krassimir
    [J]. COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2019, 72 (11): : 1537 - 1545
  • [8] An Approach to Sentiment Analysis on Unstructured Data in Big Data Environment
    Borikar, Dilipkumar A.
    Chandak, Manoj B.
    [J]. SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 169 - 176
  • [9] Unstructured medical frameworks using big data
    Banu, A. Arjuman
    Reshmy, A. K.
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 : 234 - 241
  • [10] USING NoSQL FOR PROCESSING UNSTRUCTURED BIG DATA
    Balakayeva, G. T.
    Phillips, C.
    Darkenbayev, D. K.
    Turdaliyev, M.
    [J]. NEWS OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN-SERIES OF GEOLOGY AND TECHNICAL SCIENCES, 2019, (06): : 12 - 21