Appraising SPARK on Large-Scale Social Media Analysis

被引:2
|
作者
Belcastro, Loris [1 ]
Marozzo, Fabrizio [1 ]
Talia, Domenico [1 ]
Trunfio, Paolo [1 ]
机构
[1] Univ Calabria, DIMES, Arcavacata Di Rende, Italy
关键词
Social data analysis; Scalability; Spark Cloud computing; Parallel library; Big Data; PATTERNS;
D O I
10.1007/978-3-319-75178-8_39
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Software systems for social media analysis provide algorithms and tools for extracting useful knowledge from user-generated social media data. ParSoDA (Parallel Social Data Analytics) is a Java library for developing parallel data analysis applications based on the extraction of useful knowledge from social media data. This library aims at reducing the programming skills necessary to implement scalable social data analysis applications. This work describes how the ParSoDA library has been extended to execute applications on Apache Spark. Using a cluster of 12 workers, the Spark version of the library reduces the execution time of two case study applications exploiting social media data up to 42%, compared to the Hadoop version of the library.
引用
收藏
页码:483 / 495
页数:13
相关论文
共 50 条
  • [1] Large-Scale Stylistic Analysis of Formality in Academia and Social Media
    Thin Nguyen
    Venkatesh, Svetha
    Dinh Phung
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2016, PT II, 2016, 10042 : 137 - 145
  • [2] Accelerating Large-Scale Genomic Analysis with Spark
    Li, Xueqi
    Tan, Guangming
    Zhang, Chunming
    Li, Xu
    Zhang, Zhonghai
    Sun, Ninghui
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 747 - 751
  • [3] Guest Editorial: Large-Scale Multimedia Content Analysis on Social Media
    Haojie Li
    Zheng-Jun Zha
    Benoit Huet
    Qi Tian
    [J]. Multimedia Tools and Applications, 2016, 75 : 1365 - 1369
  • [4] Guest Editorial: Large-Scale Multimedia Content Analysis on Social Media
    Li, Haojie
    Zha, Zheng-Jun
    Huet, Benoit
    Tian, Qi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (03) : 1365 - 1369
  • [5] Large-Scale Social-Media Analytics on Stratosphere
    Boden, Christoph
    Markl, Volker
    Karnstedt, Marcel
    Fernandez, Miriam
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 257 - 260
  • [6] Systematic, Large-scale Analysis on the Feasibility of Media Prefetching in Online Social Networks
    Paul, Thomas
    Puscher, Daniel
    Wilk, Stefan
    Strufe, Thorsten
    [J]. 2015 12TH ANNUAL IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, 2015, : 755 - 760
  • [7] Distantly Supervised Lifelong Learning for Large-Scale Social Media Sentiment Analysis
    Xia, Rui
    Jiang, Jie
    He, Huihui
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 480 - 491
  • [8] Large-Scale Sleep Condition Analysis Using Selfies from Social Media
    Peng, Xuefeng
    Luo, Jiebo
    Glenn, Catherine
    Zhan, Jingyao
    Liu, Yuhan
    [J]. SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 151 - 161
  • [9] Large-Scale Learning with AdaGrad on Spark
    Hadgu, Asmelash Teka
    Nigam, Aastha
    Diaz-Aviles, Ernesto
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2828 - 2830
  • [10] APPRAISING LARGE-SCALE INVESTMENTS IN A METROPOLITAN TRANSPORTATION SYSTEM
    ANDERSTIG, C
    MATTSSON, LG
    [J]. TRANSPORTATION, 1992, 19 (03) : 267 - 283