Engineering Scalable Distributed Services for Real-Time Big Data Analytics

被引:2
|
作者
Jambi, Sahar [1 ]
Anderson, Kenneth M. [1 ]
机构
[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
关键词
social media analysis; lambda architecture; query support; crisis informatics; SYSTEMS;
D O I
10.1109/BigDataService.2017.22
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is high demand for tools that analyze large sets of streaming data in both industrial and academic settings. While existing work has examined a wide range of issues, we focus on query support. In particular, we focus on providing analysts flexibility with respect to the types of queries they can make on large data sets in real time as well as over historical data. We have designed and implemented a lightweight service-based framework-EPIC Real-Time that manages a set of queries that can be applied to user initiated data analysis events (such as studying tweets generated during a disaster). Our prototype combines stream processing and batch processing techniques inspired by the Lambda Architecture. We investigate a core set of query types that can answer a wide range of queries asked by analysts who study crisis events. In this paper, we present a prototype implementation of EPIC Real-Time which makes use of event driven and reactive programming techniques. We also present a performance evaluation on how efficiently the real-time and batch-oriented queries perform, how well these queries meet the needs of our analysts, and provide insight into how EPIC Real-Time performs along a number of dimensions including performance, usability, scalability, and reliability.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 50 条
  • [1] Scalable Containerized Pipeline for Real-time Big Data Analytics
    Aurangzaib, Rana
    Iqbal, Waheed
    Abdullah, Muhammad
    Bukhari, Faisal
    Ullah, Faheem
    Erradi, Abdelkarim
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2022), 2022, : 25 - 32
  • [2] Real-Time Big Data Analytics: Applications and Challenges
    Mohamed, Nader
    Al-Jaroodi, Jameela
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 305 - 310
  • [3] Squall: Scalable Real-time Analytics
    Vitorovic, Aleksandar
    Elseidy, Mohammed
    Guliyev, Khayyam
    Khue Vu Minh
    Espino, Daniel
    Dashti, Mohammad
    Klonatos, Yannis
    Koch, Christoph
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1553 - 1556
  • [4] A Methodology of Real-Time Data Fusion for Localized Big Data Analytics
    Jabbar, Sohail
    Malik, Kaleem R.
    Ahmad, Mudassar
    Aldabbas, Omar
    Asif, Muhammad
    Khalid, Shehzad
    Han, Kijun
    Ahmed, Syed Hassan
    [J]. IEEE ACCESS, 2018, 6 : 24510 - 24520
  • [5] Logical big data integration and near real-time data analytics
    Silva, Bruno
    Moreira, Jose
    Costa, Rogerio Luis de C.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2023, 146
  • [6] Big Data Stream Computing in Healthcare Real-Time Analytics
    Ta, Van-Dai
    Liu, Chuan-Ming
    Nkabinde, Goodwill Wandile
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 37 - 42
  • [7] A Survey on Real-time Big Data Analytics: Applications and Tools
    Yadranjiaghdam, Babak
    Pool, Nathan
    Tabrizi, Nasseh
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 404 - 409
  • [8] An incremental approach for real-time Big Data visual analytics
    Garcia, Ignacio
    Casado, Ruben
    Bouchachia, Abdelhamid
    [J]. 2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 177 - 182
  • [9] Real-time Big Data Analytics for Multimedia Transmission and Storage
    Wang, Kun
    Mi, Jun
    Xu, Chenhan
    Shu, Lei
    Deng, Der-Jiunn
    [J]. 2016 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2016,
  • [10] Big Data Streaming Platforms to Support Real-time Analytics
    Fernandes, Eliana
    Salgado, Ana Carolina
    Bernardino, Jorge
    [J]. ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 426 - 433