Occupancy prediction: A comparative study of static and MOTIF time series features using WiFi Syslog data

被引:0
|
作者
Abdelghani, Bassam A. [1 ]
Al Mohammad, Ahlam [1 ]
Dari, Jamal [1 ]
Maleki, Mina [1 ]
Banitaan, Shadi [1 ]
机构
[1] Univ Detroit Mercy, Dept Elect & Comp Engn & Comp Sci, Detroit, MI 48221 USA
关键词
Occupancy prediction; WI-FI; HVAC; Random forest; Stacking; Bagging; Blending; MOTIF;
D O I
10.1016/j.suscom.2024.101040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Occupancy prediction has been the subject of ongoing research, employing various methods and data sources to improve occupancy prediction accuracy and energy efficiency in buildings. Precise occupancy prediction is crucial for optimizing energy usage, ensuring occupant comfort, and enhancing building management. With the increasing demand for intelligent building management systems, robust and accurate occupancy prediction models are becoming more critical. This study aims to predict building occupancy using WiFi Syslog files from three different datasets: an open-source dataset from the University of Massachusetts Dartmouth, a new locally collected dataset from the dental school at the University of Detroit Mercy, and finally, a dataset from an office building in Berkeley, California. Two types of features, static features, and MOTIF time series features, were extracted from the datasets to process and compare their performance in occupancy prediction. The first step of the proposed framework consisted of selecting the most suitable time range to compare occupancy prediction models between different datasets. It was concluded that this analysis was best conducted semester by semester. Multiple regression algorithms, such as random forest and LightGBM, were applied in the following step, along with advanced ensemble techniques, including stacking and blending, to assess the model. The stacking regression showed the best results for static features across all datasets. It achieved a Coefficient of Determination (R2) R 2 ) of 0.9540 in the first dataset, 0.9482 in the second, and 0.9977 in the third. For MOTIF features, however, the best algorithm depended on the dataset. All algorithms performed similarly in the first dataset, with R2 2 of 0.956. In contrast, LightGBM and the Stacking Regressor had better results than the others in the second dataset, with a low R2 2 of 0.531 due to dataset-specific differences. The stacking regression once again delivered the best results in the last dataset with an R2 2 of 0.9967.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Solar Flare Prediction Using Multivariate Time Series of Photospheric Magnetic Field Parameters: A Comparative Analysis of Vector, Time Series, and Graph Data Representations
    Vural, Onur
    Hamdi, Shah Muhammad
    Boubrahimi, Soukaina Filali
    REMOTE SENSING, 2025, 17 (06)
  • [22] Prediction of chaotic time series using recurrent neural networks and reservoir computing techniques: A comparative study
    Shahi, Shahrokh
    Fenton, Flavio H.
    Cherry, Elizabeth M.
    MACHINE LEARNING WITH APPLICATIONS, 2022, 8
  • [23] Cryptocurrency Price Prediction Using Time Series and Social Sentiment Data
    Pang, Yan
    Sundararaj, Ganeshkumar
    Ren, Jiewen
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 35 - 42
  • [24] Inference and prediction of malaria transmission dynamics using time series data
    Shi, Benyun
    Lin, Shan
    Tan, Qi
    Cao, Jie
    Zhou, Xiaohong
    Xia, Shang
    Zhou, Xiao-Nong
    Liu, Jiming
    INFECTIOUS DISEASES OF POVERTY, 2020, 9 (01)
  • [25] Book Loan Quantity Prediction Using Time Series Data Mining
    Shi, Yuqing
    Zhu, Yuelong
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMECHANICAL CONTROL TECHNOLOGY AND TRANSPORTATION, 2015, 41 : 452 - 455
  • [26] Time Series Data Prediction using IoT and Machine Learning Technique
    Kumar, Raghavendra
    Kumar, Pardeep
    Kumar, Yugal
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 373 - 381
  • [27] Prediction of vegetation dynamics using NDVI time series data and LSTM
    Reddy D.S.
    Prasad P.R.C.
    Modeling Earth Systems and Environment, 2018, 4 (1) : 409 - 419
  • [28] Inference and prediction of malaria transmission dynamics using time series data
    Benyun Shi
    Shan Lin
    Qi Tan
    Jie Cao
    Xiaohong Zhou
    Shang Xia
    Xiao-Nong Zhou
    Jiming Liu
    Infectious Diseases of Poverty, 9
  • [29] Inference and prediction of malaria transmission dynamics using time series data
    Shi Benyun
    Lin Shan
    Tan Qi
    Cao Jie
    Zhou Xiaohong
    Xia Shang
    Zhou XiaoNong
    Liu Jiming
    贫困所致传染病(英文), 2020, 09 (04) : 84 - 96
  • [30] Workload Prediction over Cloud Server using Time Series Data
    Yadav, Mahendra Pratap
    Pal, Nisha
    Yadav, Dharmendar Kumar
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 267 - 272