Occupancy prediction: A comparative study of static and MOTIF time series features using WiFi Syslog data

被引:0
|
作者
Abdelghani, Bassam A. [1 ]
Al Mohammad, Ahlam [1 ]
Dari, Jamal [1 ]
Maleki, Mina [1 ]
Banitaan, Shadi [1 ]
机构
[1] Univ Detroit Mercy, Dept Elect & Comp Engn & Comp Sci, Detroit, MI 48221 USA
关键词
Occupancy prediction; WI-FI; HVAC; Random forest; Stacking; Bagging; Blending; MOTIF;
D O I
10.1016/j.suscom.2024.101040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Occupancy prediction has been the subject of ongoing research, employing various methods and data sources to improve occupancy prediction accuracy and energy efficiency in buildings. Precise occupancy prediction is crucial for optimizing energy usage, ensuring occupant comfort, and enhancing building management. With the increasing demand for intelligent building management systems, robust and accurate occupancy prediction models are becoming more critical. This study aims to predict building occupancy using WiFi Syslog files from three different datasets: an open-source dataset from the University of Massachusetts Dartmouth, a new locally collected dataset from the dental school at the University of Detroit Mercy, and finally, a dataset from an office building in Berkeley, California. Two types of features, static features, and MOTIF time series features, were extracted from the datasets to process and compare their performance in occupancy prediction. The first step of the proposed framework consisted of selecting the most suitable time range to compare occupancy prediction models between different datasets. It was concluded that this analysis was best conducted semester by semester. Multiple regression algorithms, such as random forest and LightGBM, were applied in the following step, along with advanced ensemble techniques, including stacking and blending, to assess the model. The stacking regression showed the best results for static features across all datasets. It achieved a Coefficient of Determination (R2) R 2 ) of 0.9540 in the first dataset, 0.9482 in the second, and 0.9977 in the third. For MOTIF features, however, the best algorithm depended on the dataset. All algorithms performed similarly in the first dataset, with R2 2 of 0.956. In contrast, LightGBM and the Stacking Regressor had better results than the others in the second dataset, with a low R2 2 of 0.531 due to dataset-specific differences. The stacking regression once again delivered the best results in the last dataset with an R2 2 of 0.9967.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Effective Frequent Motif Discovery for Long Time Series Classification: A Study using Phonocardiogram
    Alhijailan, Hajar
    Coenen, Frans
    KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 266 - 273
  • [42] Travel Time Prediction and Explanation with Spatio-Temporal Features: A Comparative Study
    Ahmed, Irfan
    Kumara, Indika
    Reshadat, Vahideh
    Kayes, A. S. M.
    van den Heuvel, Willem-Jan
    Tamburri, Damian A.
    ELECTRONICS, 2022, 11 (01)
  • [43] Analyzing big time series data in solar engineering using features and PCA
    Yang, Dazhi
    Dong, Zibo
    Lim, Li Hong I.
    Liu, Licheng
    SOLAR ENERGY, 2017, 153 : 317 - 328
  • [44] Using Autoregressive Integrated Moving Average (ARIMA) for Prediction of Time Series Data
    Borkin, Dmitrii
    Nemeth, Martin
    Nemethova, Andrea
    INTELLIGENT SYSTEMS APPLICATIONS IN SOFTWARE ENGINEERING, VOL 1, 2019, 1046 : 470 - 476
  • [45] Time Series Prediction of Debian Bug Data Using Autoregressive Neural Network
    Pati, Jayadeep
    Shukla, K. K.
    2013 4TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER & COMMUNICATION TECHNOLOGY (ICCCT), 2013, : 110 - 115
  • [46] Time Series Method for Machine Performance Prediction Using Condition Monitoring Data
    Sarwar, Umair
    Muhammad, Masdi B.
    Karim, Z. A. Abdul
    2014 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS, AND CONTROL TECHNOLOGY (I4CT), 2014, : 394 - 398
  • [47] Prediction of Cryptocurrency Price using Time Series Data and Deep Learning Algorithms
    Nair, Michael
    Marie, Mohamed I.
    Abd-Elmegid, Laila A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (08) : 338 - 347
  • [48] Prediction of Bitcoin Prices with Machine Learning Methods using Time Series Data
    Karasu, Seckin
    Altan, Aytac
    Sarac, Zehra
    Hacioglu, Rifat
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [49] AN EFFECTIVE PREDICTION SYSTEM FOR TIME SERIES DATA USING PATTERN MATCHING ALGORITHMS
    Sridevi, S.
    Parthasarathy, S.
    Rajaram, S.
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2018, 25 (02): : 123 - 136
  • [50] Multivariate Time Series Prediction of Pediatric ICU data using Deep Learning
    Adiba, Farzana Islam
    Sharwardy, Sharmin Nahar
    Rahman, Mohammad Zahidur
    2021 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2021,