Bridging High Velocity and High Volume Industrial Big Data Through Distributed In-Memory Storage & Analytics

被引:0
|
作者
Williams, Jenny Weisenberg [1 ]
Aggour, Kareem S. [1 ]
Interrante, John [1 ]
McHugh, Justin [1 ]
Pool, Eric [2 ]
机构
[1] GE Global Res, Knowledge Discovery Lab, Niskayuna, NY 12309 USA
[2] GE Power & Water, Life Cycle Engn, Atlanta, GA 30339 USA
关键词
time series data; remote monitoring and diagnostics; distributed computing; in-memory data grids; big data;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With an exponential increase in time series sensor data generated by an ever-growing number of sensors on industrial equipment, new systems are required to efficiently store and analyze this "Industrial Big Data." To actively monitor industrial equipment there is a need to process large streams of high velocity time series sensor data as it arrives, and then store that data for subsequent analysis. Historically, separate systems would meet these needs, with neither system having the ability to perform fast analytics incorporating both just-arrived and historical data. In-memory data grids are a promising technology that can support both near real-time analysis and mid-term storage of big datasets, bridging the gap between high velocity and high volume big time series sensor data. This paper describes the development of a prototype infrastructure with an in-memory data grid at its core to analyze high velocity (>100,000 points per second), high volume (TB's) time series data produced by a fleet of gas turbines monitored at GE Power & Water's Remote Monitoring & Diagnostics Center.
引用
收藏
页码:932 / 941
页数:10
相关论文
共 50 条
  • [1] Distributed In-Memory Analytics for Big Temporal Data
    Yao, Bin
    Zhang, Wei
    Wang, Zhi-Jie
    Chen, Zhongpu
    Shang, Shuo
    Zheng, Kai
    Guo, Minyi
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 549 - 565
  • [2] ClimateSpark: An in-memory distributed computing framework for big climate data analytics
    Hu, Fei
    Yang, Chaowei
    Schnase, John L.
    Duffy, Daniel Q.
    Xu, Mengchao
    Bowen, Michael K.
    Lee, Tsengdar
    Song, Weiwei
    [J]. COMPUTERS & GEOSCIENCES, 2018, 115 : 154 - 166
  • [3] Performance Enhancement of Distributed K-Means Clustering for Big Data Analytics Through In-memory Computation
    Ketu, Shwet
    Agarwal, Sonali
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2015, : 318 - 324
  • [4] Using In-Memory Analytics to Quickly Crunch Big Data
    Garber, Lee
    [J]. COMPUTER, 2012, 45 (10) : 16 - 18
  • [5] Towards Automatic Memory Tuning for In-Memory Big Data Analytics in Clusters
    Koliopoulos, Aris-Kyriakos
    Yiapanis, Paraskevas
    Tekiner, Firat
    Nenadic, Goran
    Keane, John
    [J]. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 353 - 356
  • [6] Survey of In-memory Big Data Analytics and Latest Research Opportunities
    Gangarde, Rupali
    Pawar, Ambika
    Dani, Ajay
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 197 - 201
  • [7] Big Data Analytics on High Velocity Streams: A Case Study
    Chardonnens, Thibaud
    Cudre-Mauroux, Philippe
    Grund, Martin
    Perroud, Benoit
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [8] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Chen, Zhiguang
    Lu, Yutong
    Xiao, Nong
    Liu, Fang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 335 - 354
  • [9] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Zhiguang Chen
    Yutong Lu
    Nong Xiao
    Fang Liu
    [J]. Knowledge and Information Systems, 2014, 41 : 335 - 354
  • [10] BlueDBM: Distributed Flash Storage for Big Data Analytics
    Jun, Sang-Woo
    Liu, Ming
    Lee, Sungjin
    Hicks, Jamey
    Ankcorn, John
    King, Myron
    Xu, Shuotao
    Arvind
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2016, 34 (03):