Occupancy prediction: A comparative study of static and MOTIF time series features using WiFi Syslog data

被引:0
|
作者
Abdelghani, Bassam A. [1 ]
Al Mohammad, Ahlam [1 ]
Dari, Jamal [1 ]
Maleki, Mina [1 ]
Banitaan, Shadi [1 ]
机构
[1] Univ Detroit Mercy, Dept Elect & Comp Engn & Comp Sci, Detroit, MI 48221 USA
关键词
Occupancy prediction; WI-FI; HVAC; Random forest; Stacking; Bagging; Blending; MOTIF;
D O I
10.1016/j.suscom.2024.101040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Occupancy prediction has been the subject of ongoing research, employing various methods and data sources to improve occupancy prediction accuracy and energy efficiency in buildings. Precise occupancy prediction is crucial for optimizing energy usage, ensuring occupant comfort, and enhancing building management. With the increasing demand for intelligent building management systems, robust and accurate occupancy prediction models are becoming more critical. This study aims to predict building occupancy using WiFi Syslog files from three different datasets: an open-source dataset from the University of Massachusetts Dartmouth, a new locally collected dataset from the dental school at the University of Detroit Mercy, and finally, a dataset from an office building in Berkeley, California. Two types of features, static features, and MOTIF time series features, were extracted from the datasets to process and compare their performance in occupancy prediction. The first step of the proposed framework consisted of selecting the most suitable time range to compare occupancy prediction models between different datasets. It was concluded that this analysis was best conducted semester by semester. Multiple regression algorithms, such as random forest and LightGBM, were applied in the following step, along with advanced ensemble techniques, including stacking and blending, to assess the model. The stacking regression showed the best results for static features across all datasets. It achieved a Coefficient of Determination (R2) R 2 ) of 0.9540 in the first dataset, 0.9482 in the second, and 0.9977 in the third. For MOTIF features, however, the best algorithm depended on the dataset. All algorithms performed similarly in the first dataset, with R2 2 of 0.956. In contrast, LightGBM and the Stacking Regressor had better results than the others in the second dataset, with a low R2 2 of 0.531 due to dataset-specific differences. The stacking regression once again delivered the best results in the last dataset with an R2 2 of 0.9967.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Wildfire Occurrence Prediction Using Time Series Classification: A Comparative Study
    Laube, Ryan
    Hamilton, Howard J.
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4178 - 4182
  • [2] Queuing Time Prediction Using WiFi Positioning Data in an Indoor Scenario
    Shu, Hua
    Song, Ci
    Pei, Tao
    Xu, Lianming
    Ou, Yang
    Zhang, Libin
    Li, Tao
    SENSORS, 2016, 16 (11)
  • [3] Incomplete Time Series Prediction Using Max-Margin Classification of Data with Absent Features
    Shang Zhaowei
    Zhang Lingfeng
    Ma Shangjun
    Fang Bin
    Zhang Taiping
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2010, 2010
  • [4] Importance of data preprocessing in time series prediction using SARIMA: A case study
    Adineh, Amir Hossein
    Narimani, Zahra
    Satapathy, Suresh Chandra
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (04) : 331 - 342
  • [5] Static Model and Neural Networks in the Prediction of Price using Time Series
    Manrique Rojas, Esperanza
    Ramirez Ramirez, Margarita
    Marquez Lobato, Bogart Yail
    Ramirez Moreno, Hilda Beatriz
    Salgado Soto, Maria del Consuelo
    Vazquez Nunez, Sergio Octavio
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [6] Time series forecasting with neural networks: A comparative study using the airline data
    Applied Statistics. Journal of the Royal Statistical Society Series C, 47 (pt 2):
  • [7] Time series forecasting with neural networks: A comparative study using the airline data
    Faraway, J
    Chatfield, C
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1998, 47 : 231 - 250
  • [8] Flood prediction using Time Series Data Mining
    Damle, Chaitanya
    Yalcin, Ali
    JOURNAL OF HYDROLOGY, 2007, 333 (2-4) : 305 - 316
  • [9] Prediction of arrhythmia using multivariate time series data
    Lee, Minhai
    Noh, Hohsuk
    KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (05) : 671 - 681
  • [10] Data-driven Runway Occupancy Time Prediction using Decision Trees
    Chow, Hong Wei
    Lim, Zhi Jun
    Alam, Sameer
    2021 IEEE/AIAA 40TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2021,