T-Hoarder: A framework to process Twitter data streams

被引:59
|
作者
Congosto, Mariluz [1 ,4 ]
Basanta-Val, Pablo [1 ,2 ,3 ]
Sanchez-Fernandez, Luis [1 ,2 ,3 ]
机构
[1] Dept Ingn Telemat, Web Technol Lab, Madrid, Spain
[2] Univ Carlos III Madrid, Dept Ingn Telemat, E-28903 Getafe, Spain
[3] Univ Carlos III Madrid, Inst Financial Big Data, BS UC3M, E-28903 Getafe, Spain
[4] Dept Ingn Telemat, Edificio Torres Quevedo,Ave Univ 30, Madrid, Spain
关键词
Twitter; Micro-blogging; Twitter analytics; Visualization;
D O I
10.1016/j.jnca.2017.01.029
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the eruption of online social networks, like Twitter and Facebook, a series of new APIs have appeared to allow access to the data that these new sources of information accumulate. One of most popular online social networks is the micro-blogging site Twitter. Its APIs allow many machines to access the torrent simultaneously to Twitter data, listening to tweets and accessing other useful information such as user profiles. A number of tools have appeared for processing Twitter data with different algorithms and for different purposes. In this paper T-Hoarder is described: a framework that enables tweet crawling, data filtering, and which is also able to display summarized and analytical information about the Twitter activity with respect to a certain topic or event in a web-page. This information is updated on a daily basis. The tool has been validated with real use-cases that allow making a series of analysis on the performance one may expect from this type of infrastructure.
引用
收藏
页码:28 / 39
页数:12
相关论文
共 50 条
  • [41] Stealth ADS: Enhanced Framework for Alternate Data Streams
    Mahajan, Ruhi
    [J]. 2016 INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2016,
  • [42] A Framework to Enforce Access Control over Data Streams
    Carminati, Barbara
    Dicom, Elena Ferrari
    Cao, Jianneng
    Tan, Kian Lee
    [J]. ACM TRANSACTIONS ON INFORMATION AND SYSTEM SECURITY, 2010, 13 (03)
  • [43] STREAMER: A Powerful Framework for Continuous Learning in Data Streams
    Garcia-Rodriguez, Sandra
    Alshaer, Mohammad
    Gouy-Pailler, Cedric
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3385 - 3388
  • [44] A framework to preserve the privacy of electronic health data streams
    Kim, Soohyung
    Sung, Mm Kyoung
    Chung, Yon Dohn
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 : 95 - 106
  • [45] A Framework for Clustering Massive-Domain Data Streams
    Aggarwal, Charu C.
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 102 - 113
  • [46] A Framework for Clustering Massive Text and Categorical Data Streams
    Aggarwal, Charu C.
    Yu, Philip S.
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 479 - 483
  • [47] A unified framework for monitoring data streams in real time
    Bulut, A
    Singh, AK
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 44 - 55
  • [48] A reflective framework for mixed data-streams on the internet
    Curran, K.J.
    Parr, G.P.
    [J]. International Journal of Parallel and Distributed Systems and Networks, 2002, 5 (01): : 7 - 16
  • [49] Developing a Real-time Data Analytics Framework For Twitter Streaming Data
    Yadranjiaghdam, Babak
    Yasrobi, Seyedfaraz
    Tabrizi, Nasseh
    [J]. 2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 329 - 336
  • [50] Emotion Detection Framework for Twitter Data Using Supervised Classifiers
    Suhasini, Matla
    Srinivasu, Badugu
    [J]. DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 565 - 576