Using data mining techniques for bike sharing demand prediction in metropolitan city

被引:74
|
作者
Sathishkumar, V. E. [1 ]
Park, Jangwoo [1 ]
Cho, Yongyun [1 ]
机构
[1] Sunchon Natl Univ, Dept Informat & Commun Engn, Suncheon Si, South Korea
关键词
Data mining; Predictive analytics; Public bikes; Regression; Bike sharing demand; FORESTS; TRENDS;
D O I
10.1016/j.comcom.2020.02.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently Rental bikes are introduced in many urban cities for the enhancement of mobility comfort. It is important to make the rental bike available and accessible to the public at the right time as it lessens the waiting time. Eventually, providing the city with a stable supply of rental bikes becomes a major concern. The crucial part is the prediction of bike count required at each hour for the stable supply of rental bikes. A Data mining technique is employed for overcoming the hurdles for the prediction of hourly rental bike demand. This paper discusses the models for hourly rental bike demand prediction. Data used include weather information (Temperature, Humidity, Windspeed, Visibility, Dewpoint, Solar radiation, Snowfall, Rainfall), the number of bikes rented per hour and date information. The paper also explores an filtering of features approach to eliminate the parameters which are not predictive and ranks the features based on its prediction performance. Five Statistical regression models were trained with their best hyperparameters using repeated cross-validation and the performance is evaluated using a testing set (a) Linear Regression (b) Gradient Boosting Machine (c) Support Vector Machine (Radial Basis Function Kernel) (d) Boosted Trees, and (e) Extreme Gradient Boosting Trees. When all the predictors are employed, the best model Gradient Boosting Machine can give the best and highest R-2 value of 0.96 in the training set and 0.92 in the test set. Furthermore, several analyzes are carried out in Gradient Boosting Machine with different combinations of predictors to identify the most significant predictors and the relationships between them.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [1] Seoul bike trip duration prediction using data mining techniques
    Sathishkumar, V. E.
    Park, Jangwoo
    Cho, Yongyun
    IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (11) : 1465 - 1474
  • [2] Rainfall Prediction in Lahore City using Data Mining Techniques
    Aftab, Shabib
    Ahmad, Munir
    Hameed, Noureen
    Bashir, Muhammad Salman
    Ali, Iftikhar
    Nawaz, Zahid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (04) : 254 - 260
  • [3] A rule-based model for Seoul Bike sharing demand prediction using weather data
    Sathishkumar, V. E.
    Cho, Yongyun
    EUROPEAN JOURNAL OF REMOTE SENSING, 2020, 53 (sup1) : 166 - 183
  • [4] Excess demand prediction for bike sharing systems
    Liu, Xin
    Pelechrinis, Konstantinos
    PLOS ONE, 2021, 16 (06):
  • [5] Bike Sharing Demand Prediction Using Multiheaded Convolution Neural Networks
    Sathishkumar, V. E.
    Agrawal, Priyum
    Park, Jangwoo
    Cho, Yongyun
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 264 - 265
  • [6] A Quantum Bayesian Approach for Bike Sharing Demand Prediction
    Harikrishnakumar, Ramkumar
    Borujeni, Sima E.
    Dand, Alok
    Nannapaneni, Saideep
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2401 - 2409
  • [7] Prediction of bike-sharing station demand using explainable artificial intelligence
    Ngeni, Frank
    Kutela, Boniphace
    Chengula, Tumlumbe Juliana
    Ruseruka, Cuthbert
    Musau, Hannah
    Novat, Norris
    Indah, Debbie Aisiana
    Kasomi, Sarah
    MACHINE LEARNING WITH APPLICATIONS, 2024, 17
  • [8] Capturing the conditions that introduce systematic variation in bike-sharing travel behavior using data mining techniques
    Bordagaray, Maria
    dell'Olio, Luigi
    Fonzone, Achille
    Ibeas, Angel
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 71 : 231 - 248
  • [9] Prediction bike-sharing demand with gradient boosting methods
    Aydin, Zeliha Ergul
    Erdem, Banu Icmen
    Cicek, Zeynep Idil Erzurum
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2023, 29 (08): : 824 - 832
  • [10] Central Station Based Demand Prediction in a Bike Sharing System
    Huang, Jianbin
    Wang, Xiangyu
    Sun, Heli
    2019 20TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2019), 2019, : 346 - 348