Data-driven estimation of building energy consumption with multi-source heterogeneous data

被引:100
|
作者
Pan, Yue [1 ]
Zhang, Limao [1 ]
机构
[1] Nanyang Technol Univ, Sch Civil & Environm Engn, 50 Nanyang Ave, Singapore 639798, Singapore
关键词
Building energy estimation; Data mining; Categorical boosting (CatBoost) model; Feature importance; ARTIFICIAL NEURAL-NETWORK; ELECTRICITY CONSUMPTION; FAULT-DETECTION; PREDICTION; MACHINE; CHINA; PERFORMANCE; EFFICIENCY; EMISSIONS; CATBOOST;
D O I
10.1016/j.apenergy.2020.114965
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
For better energy evaluation and management, a categorical boosting (CatBoost)-based predictive method is presented to accurately estimate building energy consumption by learning large volumes of multi-source heterogeneous data collected from buildings. To be specific, the newly-developed CatBoost model belonging to the ensemble learning has superiority in handling categorical variables and producing reliable results. As a case study, our proposed method is validated in a multi-dimensional dataset about Seattle's building energy performance provided by the city's government, aiming to estimate the weather normalized site energy use intensity of buildings and characterize its non-linear relationship with other 12 possible influential features. Results from the 5-fold cross-validation demonstrate that the model exhibits a strong ability in predicting the exact value of energy intensity precisely, which can even outperform popular machine learning algorithms including random forest and gradient boosting decision tree under R-2 of 0.897. Based on a defined threshold, these predicted values can be classified as the normal or abnormal energy consumption reaching an accuracy of 99.32% for outlier detection, which is helpful in alarming potential risks at an early stage and developing strategies to enhance the energy efficiency. Moreover, results from the established model can be interpreted objectively, suggesting that features concerning the physical and energy characteristics contribute more to energy estimation than environmental features. Since such results understand the building energy consumption and efficiency in a data-driven manner, they can eventually serve as guidance for building owners and designers in designing and renovating buildings to achieve better energy-conserving performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Multi-source data driven harmonic spectrum estimation of substation feeder current
    Chen, Xiangwei
    Yang, Chaoyun
    Zhang, Yi
    Zhu, Longyang
    Liu, Bijie
    Zhang, Liangyu
    Lin, Nan
    [J]. ENERGY REPORTS, 2024, 11 : 3492 - 3500
  • [32] Multi-source heterogeneous data storage methods for omnimedia data space
    Zhuo, Wenbo
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2024, 15 (3-4) : 314 - 322
  • [33] Data-Driven Predictive Control of Building Energy Consumption under the IoT Architecture
    Ke, Ji
    Qin, Yude
    Wang, Biao
    Yang, Shundong
    Wu, Hao
    Yang, Hang
    Zhao, Xing
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020 (2020):
  • [34] Research on large-scale clean energy optimal scheduling method based on multi-source data-driven
    Xiong, Chuanyu
    Xu, Lingfeng
    Ma, Li
    Hu, Pan
    Ye, Ziyong
    Sun, Jialun
    [J]. FRONTIERS IN ENERGY RESEARCH, 2024, 11
  • [35] SimbaQL: A Query Language for Multi-source Heterogeneous Data
    Li, Yuepeng
    Shen, Zhihong
    Li, Jianhui
    [J]. BIG SCIENTIFIC DATA MANAGEMENT, 2019, 11473 : 275 - 284
  • [36] An Integration Model of Multi-Source Heterogeneous Audit Data
    Li Chunqiang
    Chai Weiyan
    Chen Linan
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRONIC SCIENCE AND AUTOMATION CONTROL, 2015, 20 : 262 - 266
  • [37] Multi-source onboard data-driven method for intelligent identification of subway track irregularities
    Peng, Fei
    Xie, Qinglin
    Tao, Gongquan
    Wen, Zefeng
    Ren, Yu
    [J]. Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2024, 55 (06): : 2432 - 2445
  • [38] Study on Multi-source Data-Driven Static Security Risk Assessment of Power Grids
    Li, Xinwei
    Wang, Chao
    Liu, Jiaxin
    Liu, Wansong
    Liu, Xiaoming
    Shi, Renwei
    Jiao, Zaibin
    Liu, Jun
    [J]. 2022 6TH INTERNATIONAL CONFERENCE ON POWER AND ENERGY ENGINEERING, ICPEE, 2022, : 220 - 225
  • [39] Multi-source data-driven approach for prediction of melt density during polymer compounding
    Zhang, Bin-Bin
    Chen, Zhu-Yun
    Zhang, Fei
    Jin, Gang
    [J]. POLYMER ENGINEERING AND SCIENCE, 2024, 64 (06): : 2627 - 2639
  • [40] A Review of Data-Driven Building Energy Prediction
    Liu, Huiheng
    Liang, Jinrui
    Liu, Yanchen
    Wu, Huijun
    [J]. BUILDINGS, 2023, 13 (02)