Using data mining techniques for bike sharing demand prediction in metropolitan city

被引:74
|
作者
Sathishkumar, V. E. [1 ]
Park, Jangwoo [1 ]
Cho, Yongyun [1 ]
机构
[1] Sunchon Natl Univ, Dept Informat & Commun Engn, Suncheon Si, South Korea
关键词
Data mining; Predictive analytics; Public bikes; Regression; Bike sharing demand; FORESTS; TRENDS;
D O I
10.1016/j.comcom.2020.02.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently Rental bikes are introduced in many urban cities for the enhancement of mobility comfort. It is important to make the rental bike available and accessible to the public at the right time as it lessens the waiting time. Eventually, providing the city with a stable supply of rental bikes becomes a major concern. The crucial part is the prediction of bike count required at each hour for the stable supply of rental bikes. A Data mining technique is employed for overcoming the hurdles for the prediction of hourly rental bike demand. This paper discusses the models for hourly rental bike demand prediction. Data used include weather information (Temperature, Humidity, Windspeed, Visibility, Dewpoint, Solar radiation, Snowfall, Rainfall), the number of bikes rented per hour and date information. The paper also explores an filtering of features approach to eliminate the parameters which are not predictive and ranks the features based on its prediction performance. Five Statistical regression models were trained with their best hyperparameters using repeated cross-validation and the performance is evaluated using a testing set (a) Linear Regression (b) Gradient Boosting Machine (c) Support Vector Machine (Radial Basis Function Kernel) (d) Boosted Trees, and (e) Extreme Gradient Boosting Trees. When all the predictors are employed, the best model Gradient Boosting Machine can give the best and highest R-2 value of 0.96 in the training set and 0.92 in the test set. Furthermore, several analyzes are carried out in Gradient Boosting Machine with different combinations of predictors to identify the most significant predictors and the relationships between them.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [31] Prediction of Sugarcane Diseases using Data Mining Techniques
    Beulah, R.
    Punithavalli, M.
    2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA), 2016, : 393 - 396
  • [32] Prediction of heart disease using data mining techniques
    Ritika Chadha
    Shubhankar Mayank
    CSI Transactions on ICT, 2016, 4 (2-4) : 193 - 198
  • [33] Prediction Of Soil Accuracy Using Data Mining Techniques
    Bhanudas, Deone Jyoti
    Afreen, Khan Rahat
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [34] Corporate bankruptcy prediction using data mining techniques
    Santos, M. F.
    Cortez, P.
    Pereira, J.
    Quintela, H.
    DATA MINING VII: DATA, TEXT AND WEB MINING AND THEIR BUSINESS APPLICATIONS, 2006, 37 : 349 - +
  • [35] Diabetes prediction model using data mining techniques
    Rastogi R.
    Bansal M.
    Measurement: Sensors, 2023, 25
  • [36] Students Performance Prediction Using Data Mining Techniques
    Kumar, Rajesh T.
    Vamsidhar, T.
    Harika, B.
    Kumar, Madan T.
    Nissy, R.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2019), 2019, : 407 - 411
  • [37] Prediction of thyroid Disease Using Data Mining Techniques
    Begum, Bibi Amina
    Parkavi, A.
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 342 - 345
  • [38] Thyroid prediction using ensemble data mining techniques
    Yadav D.C.
    Pal S.
    International Journal of Information Technology, 2022, 14 (3) : 1273 - 1283
  • [39] Prediction of Stroke using Data Mining Classification Techniques
    Almadani, Ohoud
    Alshammari, Riyad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (01) : 457 - 460
  • [40] Prediction of Heart Attacks using Data Mining Techniques
    Abdelghani, Bassam A.
    Fadal, Sophia
    Bedoor, Shadi
    Banitaan, Shadi
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 951 - 956