Unstructured Data Analysis on Big Data using Map Reduce

被引:22
|
作者
Subramaniyaswamy, V [1 ]
Vijayakumar, V. [2 ]
Logesh, R. [1 ]
Indragandhi, V [3 ]
机构
[1] SASTRA Univ, Sch Comp, Thanjavur 613401, India
[2] VIT Univ, Sch Engn & Comp Sci, Madras 600127, Tamil Nadu, India
[3] SASTRA Univ, Sch Elect & Elect Engn, Thanjavur 613401, India
关键词
Hadoop; MapReduce; Collaborative Filtering; Mahout; Maven; Sentiment Analysis;
D O I
10.1016/j.procs.2015.04.015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the real time scenario, the volume of data used linearly increases with time. Social networking sites like Facebook, Twitter discovered the growth of data which will be uncontrollable in the future. In order to manage the huge volume of data, the proposed method will process the data in parallel as small chunks in distributed clusters and aggregate all the data across clusters to obtain the final processed data. In Hadoop framework, MapReduce is used to perform the task of filtering, aggregation and to maintain the efficient storage structure. The data are preferably refined using collaborative filtering, under the prediction mechanism of particular data needed by the user. The proposed method is enhanced by using the techniques such as sentiment analysis through natural language processing for parsing the data into tokens and emoticon based clustering. The process of data clustering is based on user emotions to get the data needed by a specific user. The results show that the proposed approach significantly increases the performance of complexity analysis. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:456 / 465
页数:10
相关论文
共 50 条
  • [21] Analysis of Bayesian optimization algorithms for big data classification based on Map Reduce framework
    Banchhor, Chitrakant
    Srinivasu, N.
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [22] Analysis of Bayesian optimization algorithms for big data classification based on Map Reduce framework
    Chitrakant Banchhor
    N. Srinivasu
    [J]. Journal of Big Data, 8
  • [23] The Improved Effectual Data Processing in Big Data Executing Map Reduce Frame Work
    Sudhakar Yadav, N.
    Yeruva, Sagar
    Sunil Kumar, T.
    Susan, Talluri
    [J]. 2021 IEEE Mysore Sub Section International Conference, MysuruCon 2021, 2021, : 587 - 595
  • [24] An Approach in Big Data Analytics to Improve the Velocity of Unstructured Data Using MapReduce
    Sundarakumar, M. R.
    Mahadevan, G.
    Somula, Ramasubbareddy
    Sennan, Sankar
    Rawal, Bharat S.
    [J]. INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2021, 10 (04)
  • [25] ExNav: An Interactive Big Data hxploration Framework for Big Unstructured Data
    Ge, Xiaoyu
    Zhang, Xiaozhong
    Chrysanthis, Panos K.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 503 - 512
  • [26] Unravelling Unstructured Data: A Wealth of Information in Big Data
    Tanwar, Mona
    Duggal, Reena
    Khatri, Sunil Kumar
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [27] Big Data Quality Assessment Model for Unstructured Data
    Taleb, Ikbal
    Serhani, Mohamed Adel
    Dssouli, Rachida
    [J]. PROCEEDINGS OF THE 2018 13TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2018, : 69 - 74
  • [28] Big Data Trend: Knowledge Discovery on the Unstructured Data
    Abu Muntalib, Shamsiah
    Sidi, Fatimah
    Jabar, Marzanah A.
    Ishak, Iskandar
    [J]. PROCEEDING OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2014, VOLS 1 AND 2, 2014, : 338 - 342
  • [29] Data Analysis for Chronic disease -Diabetes using Map Reduce Technique
    Ramkumar, N.
    Prakash, S.
    Sangeetha, K.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2016,
  • [30] An Approach to Security for Unstructured Big Data
    Md. Ezazul Islam
    Md. Rafiqul Islam
    A B M Shawkat Ali
    [J]. The Review of Socionetwork Strategies, 2016, 10 (2) : 105 - 123