Bridging High Velocity and High Volume Industrial Big Data Through Distributed In-Memory Storage & Analytics

被引:0
|
作者
Williams, Jenny Weisenberg [1 ]
Aggour, Kareem S. [1 ]
Interrante, John [1 ]
McHugh, Justin [1 ]
Pool, Eric [2 ]
机构
[1] GE Global Res, Knowledge Discovery Lab, Niskayuna, NY 12309 USA
[2] GE Power & Water, Life Cycle Engn, Atlanta, GA 30339 USA
关键词
time series data; remote monitoring and diagnostics; distributed computing; in-memory data grids; big data;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With an exponential increase in time series sensor data generated by an ever-growing number of sensors on industrial equipment, new systems are required to efficiently store and analyze this "Industrial Big Data." To actively monitor industrial equipment there is a need to process large streams of high velocity time series sensor data as it arrives, and then store that data for subsequent analysis. Historically, separate systems would meet these needs, with neither system having the ability to perform fast analytics incorporating both just-arrived and historical data. In-memory data grids are a promising technology that can support both near real-time analysis and mid-term storage of big datasets, bridging the gap between high velocity and high volume big time series sensor data. This paper describes the development of a prototype infrastructure with an in-memory data grid at its core to analyze high velocity (>100,000 points per second), high volume (TB's) time series data produced by a fleet of gas turbines monitored at GE Power & Water's Remote Monitoring & Diagnostics Center.
引用
收藏
页码:932 / 941
页数:10
相关论文
共 50 条
  • [41] High Bandwidth Memory on FPGAs: A Data Analytics Perspective
    Kara, Kaan
    Hagleitner, Christoph
    Diamantopoulos, Dionysios
    Syrivelis, Dimitris
    Alonso, Gustavo
    [J]. 2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 1 - 8
  • [42] Big Data for Predictive Analytics in High Acuity Health Settings
    Zaleski, John
    [J]. Studies in Big Data, 2019, 42 : 51 - 100
  • [43] High performance deep learning techniques for big data analytics
    Li, Maozhen
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (23):
  • [44] Load Balancing Scheme for Supporting Real-time Processing of Big Data in Distributed In-Memory Systems
    Bok, Kyoungsoo
    Choi, Kitae
    Lim, Jongtae
    Yoo, Jaesoo
    [J]. PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 170 - 174
  • [45] PERFORMANCE-EFFICIENT RECOMMENDATION AND PREDICTION SERVICE FOR BIG DATA FRAMEWORKS FOCUSING ON DATA COMPRESSION AND IN-MEMORY DATA STORAGE INDICATORS
    Astsatryan, Hrachya
    Lalayan, Arthur
    Kocharyan, Aram
    Hagimont, Daniel
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2021, 22 (04): : 401 - 412
  • [46] Database Storage Format for High Performance Analytics of Immutable Data
    Rovnyagin, Mikhail M.
    Dmitriev, Sviatoslav O.
    Hrapov, Alexander S.
    Maksutov, Artem A.
    Turovskiy, Igor A.
    [J]. PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 618 - 622
  • [47] Alignment of High-Throughput Sequencing Data Inside In-Memory Databases
    Firnkorn, Daniel
    Knaup-Gregori, Petra
    Bermejo, Justo Lorenzo
    Ganzinger, Matthias
    [J]. E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 : 476 - 480
  • [48] In-Memory Data Anonymization Using Scalable and High Performance RDD Design
    Bazai, Sibghat Ullah
    Jang-Jaccard, Julian
    [J]. ELECTRONICS, 2020, 9 (10) : 1 - 26
  • [49] The impact of big data analytics on firms’ high value business performance
    Aleš Popovič
    Ray Hackney
    Rana Tassabehji
    Mauro Castelli
    [J]. Information Systems Frontiers, 2018, 20 : 209 - 222
  • [50] HIFUN - a high level functional query language for big data analytics
    Nicolas Spyratos
    Tsuyoshi Sugibuchi
    [J]. Journal of Intelligent Information Systems, 2018, 51 : 529 - 555