Occupancy prediction: A comparative study of static and MOTIF time series features using WiFi Syslog data

被引:0
|
作者
Abdelghani, Bassam A. [1 ]
Al Mohammad, Ahlam [1 ]
Dari, Jamal [1 ]
Maleki, Mina [1 ]
Banitaan, Shadi [1 ]
机构
[1] Univ Detroit Mercy, Dept Elect & Comp Engn & Comp Sci, Detroit, MI 48221 USA
关键词
Occupancy prediction; WI-FI; HVAC; Random forest; Stacking; Bagging; Blending; MOTIF;
D O I
10.1016/j.suscom.2024.101040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Occupancy prediction has been the subject of ongoing research, employing various methods and data sources to improve occupancy prediction accuracy and energy efficiency in buildings. Precise occupancy prediction is crucial for optimizing energy usage, ensuring occupant comfort, and enhancing building management. With the increasing demand for intelligent building management systems, robust and accurate occupancy prediction models are becoming more critical. This study aims to predict building occupancy using WiFi Syslog files from three different datasets: an open-source dataset from the University of Massachusetts Dartmouth, a new locally collected dataset from the dental school at the University of Detroit Mercy, and finally, a dataset from an office building in Berkeley, California. Two types of features, static features, and MOTIF time series features, were extracted from the datasets to process and compare their performance in occupancy prediction. The first step of the proposed framework consisted of selecting the most suitable time range to compare occupancy prediction models between different datasets. It was concluded that this analysis was best conducted semester by semester. Multiple regression algorithms, such as random forest and LightGBM, were applied in the following step, along with advanced ensemble techniques, including stacking and blending, to assess the model. The stacking regression showed the best results for static features across all datasets. It achieved a Coefficient of Determination (R2) R 2 ) of 0.9540 in the first dataset, 0.9482 in the second, and 0.9977 in the third. For MOTIF features, however, the best algorithm depended on the dataset. All algorithms performed similarly in the first dataset, with R2 2 of 0.956. In contrast, LightGBM and the Stacking Regressor had better results than the others in the second dataset, with a low R2 2 of 0.531 due to dataset-specific differences. The stacking regression once again delivered the best results in the last dataset with an R2 2 of 0.9967.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Prediction Technique for Time Series Data Sets Using Regression Models
    Sagar, Pinki
    Gupta, Prinima
    Kashyap, Indu
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 480 - 488
  • [32] Data partition and variable selection for time series prediction using wrappers
    Puma-Villanueva, Wilfredo J.
    dos Santos, Euripedes P.
    Von Zuben, Fernando J.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4740 - +
  • [33] DASMcC: Data Augmented SMOTE Multi-Class Classifier for Prediction of Cardiovascular Diseases Using Time Series Features
    Sinha, Nidhi
    Kumar, M. A. Ganesh
    Joshi, Amit M.
    Cenkeramaddi, Linga Reddy
    IEEE ACCESS, 2023, 11 : 117643 - 117655
  • [34] Ranking Prediction for Time-series Data using Learning to Rank (Case Study: Top Mobile Games Prediction)
    Ramadhan, Alfian
    Khodra, Masayu Leylia
    2014 INTERNATIONAL CONFERENCE OF ADVANCED INFORMATICS: CONCEPT, THEORY AND APPLICATION (ICAICTA), 2014, : 214 - 219
  • [35] Comparative Analysis of Time Series Prediction Algorithms on Multiple Network Function Data of NWDAF
    Chen, Dasheng
    Song, Qi
    Zhang, Yinbin
    Li, Ling
    Yang, Zhiming
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2024, 2024
  • [36] Modeling and Prediction of Time-Series-A Case Study with Forex Data
    Shiao, Yi Chen
    Chakraborty, Goutam
    Chen, Shin Fu
    Li, Li Hua
    Chen, Rung Ching
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 177 - 181
  • [37] Effect of data structure for time series prediction using time delay neural network
    Wang, XK
    Lu, WZ
    Cao, SY
    Computational Mechanics, Proceedings, 2004, : 721 - 727
  • [38] Study on maize yield prediction using time series analysis
    Zhao, Yueling
    Han, Haiyan
    Cao, Liying
    Chen, Guifen
    MODERN TECHNOLOGIES IN MATERIALS, MECHANICS AND INTELLIGENT SYSTEMS, 2014, 1049 : 1463 - +
  • [39] Comparative Study on Data Editing Techniques for Fatigue Time Series Signals
    Nopiah, Z. M.
    Abdullah, S.
    Baharin, M. N.
    Putra, T. E.
    Sahadan, S. N.
    Willis, K. O.
    ADVANCES IN SUPERALLOYS, PTS 1 AND 2, 2011, 146-147 : 1681 - 1684
  • [40] A Comparative Study of Federated Versus Centralized Learning for Time Series Prediction in IoT
    Furtado, Lia Sucupira
    da Costa, Leonardo Ferreira
    Goncalves Rocha, Paulo Henrique
    Leal Rego, Paulo Antonio
    INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 297 - 311