Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking

被引:34
|
作者
Liu, Xiufeng [1 ]
Golab, Lukasz [2 ]
Golab, Wojciech [2 ]
Ilyas, Ihab F. [2 ]
Jin, Shichao [2 ]
机构
[1] Tech Univ Denmark, DK-2800 Lyngby, Denmark
[2] Univ Waterloo, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2017年 / 42卷 / 01期
关键词
Smart meters; data analytics; performance benchmarking; Hadoop; Spark; HIVE;
D O I
10.1145/3004295
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart electricity meters have been replacing conventional meters worldwide, enabling automated collection of fine-grained (e.g.,every 15 minutes or hourly) consumption data. A variety of smart meter analytics algorithms and applications have been proposed, mainly in the smart grid literature. However, the focus has been on what can be done with the data rather than how to do it efficiently. In this article, we examine smart meter analytics from a software performance perspective. First, we design a performance benchmark that includes common smart meter analytics tasks. These include offline feature extraction and model building as well as a framework for online anomaly detection that we propose. Second, since obtaining real smart meter data is difficult due to privacy issues, we present an algorithm for generating large realistic datasets from a small seed of real data. Third, we implement the proposed benchmark using five representative platforms: a traditional numeric computing platform (Matlab), a relational DBMS with a built-in machine learning toolkit (PostgreSQL/MADlib), a main-memory column store ('' System C ''), and two distributed data processing platforms (Hive and Spark/Spark Streaming). We compare the five platforms in terms of application development effort and performance on a multicore machine as well as a cluster of 16 commodity servers.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] Benchmarking study on Smart City Data Analytics
    El Mendili, Saida
    El Bouzekri El Idrissi, Younes
    Hmina, Nabil
    [J]. 2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 841 - 846
  • [2] A Big Data platform for smart meter data analytics
    Wilcox, Tom
    Jin, Nanlin
    Flach, Peter
    Thumim, Joshua
    [J]. COMPUTERS IN INDUSTRY, 2019, 105 : 250 - 259
  • [3] Fast Big Data Analytics for Smart Meter Data
    Mohajeri, Morteza
    Ghassemi, Abolfazl
    Gulliver, T. Aaron
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 : 1864 - 1871
  • [4] Smart Meter Data Analytics for Distribution Network
    Tang, Guojing
    Han, Yinghua
    Wang, Jinkuan
    Zhao, Qiang
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 8882 - 8887
  • [5] SMAS: A Smart Meter Data Analytics System
    Liu, Xiufeng
    Golab, Lukasz
    Ilyas, Ihab F.
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1476 - 1479
  • [6] Special Issue: "Energy Data Analytics for Smart Meter Data"
    Reinhardt, Andreas
    Pereira, Lucas
    [J]. ENERGIES, 2021, 14 (17)
  • [7] SMART METER DATA ANALYTICS using OPENTSDB and HADOOP
    Prasad, Srikrishna
    Avinash, S. B.
    [J]. 2013 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2013,
  • [8] Smart Electricity Meter Data Analytics: A Brief Review
    Pawar, Savita
    Momin, B. F.
    [J]. 2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
  • [9] Smart Meter Data Analytics using R and Hadoop
    Mathiyalagan, P.
    Shanmugapriya, A.
    Geethu, A., V
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2017, : 623 - 629
  • [10] A residential labeled dataset for smart meter data analytics
    Pereira, Lucas
    Costa, Donovan
    Ribeiro, Miguel
    [J]. SCIENTIFIC DATA, 2022, 9 (01)