Research on Rain Pattern Classification Based on Machine Learning: A Case Study in Pi River Basin

被引:3
|
作者
Fu, Xiaodi [1 ,2 ,3 ]
Kan, Guangyuan [1 ,2 ,3 ]
Liu, Ronghua [1 ,2 ,3 ]
Liang, Ke [4 ]
He, Xiaoyan [1 ,2 ,3 ]
Ding, Liuqian [1 ,2 ]
机构
[1] State Key Lab Simulat & Regulat Water Cycle River, Beijing 100038, Peoples R China
[2] China Inst Water Resources & Hydropower Res, Beijing 100038, Peoples R China
[3] Minist Water Resources, Res Ctr Flood & Drought Disaster Reduct, Beijing 100038, Peoples R China
[4] Beijing IWHR Corp, Beijing 100048, Peoples R China
基金
中国国家自然科学基金;
关键词
rain patterns; distribution over time characteristics; dynamic time planning; LightGBM; LSTM; Decision Tree; SVM;
D O I
10.3390/w15081570
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
For the purpose of improving the scientific nature, reliability, and accuracy of flood forecasting, it is an effective and practical way to construct a flood forecasting scheme and carry out real-time forecasting with consideration of different rain patterns. The technique for rain pattern classification is of great significance in the above-mentioned technical roadmap. With the rapid development of artificial intelligence technologies such as machine learning, it is possible and necessary to apply these new methods to assist rain classification applications. In this research, multiple machine learning methods were adopted to study the time-history distribution characteristics and conduct rain pattern classification from observed rainfall time series data. Firstly, the hourly rainfall data between 2003 and 2021 of 37 rain gauge stations in the Pi River Basin were collected to classify rain patterns based on the universally acknowledged dynamic time warping (DTW) algorithm, and the classifications were treated as the benchmark result. After that, four other machine learning methods, including the Decision Tree (DT), Long- and Short-Term Memory (LSTM) neural network, Light Gradient Boosting Machine (LightGBM), and Support Vector Machine (SVM), were specifically selected to establish classification models and the model performances were compared. By adjusting the sampling size, the influence of different sizes on the classification was analyzed. Intercomparison results indicated that LightGBM achieved the highest accuracy and the fastest training speed, the accuracy and F-1 score were 98.95% and 98.58%, respectively, and the loss function and accuracy converged quickly after only 20 iterations. LSTM and SVM have satisfactory accuracy but relatively low training efficiency, and DT has fast classification speed but relatively low accuracy. With the increase in the sampling size, classification results became stable and more accurate. Besides the higher accuracy, the training efficiency of the four methods was also improved.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Water Quality Index Classification Based on Machine Learning: A Case from the Langat River Basin Model
    Shamsuddin, Illa Iza Suhana
    Othman, Zalinda
    Sani, Nor Samsiah
    [J]. WATER, 2022, 14 (19)
  • [2] Research on Pulsar Classification Based on Machine Learning
    Chen, Zhiyu
    Xu, Aiting
    Zhou, Yingying
    Gai, Yingjie
    [J]. 2020 THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS'2020), 2020, : 14 - 18
  • [3] Precipitation Interception Modelling Using Machine Learning Methods - The Dragonja River Basin Case Study
    Stravs, L.
    Brilly, M.
    Sraj, M.
    [J]. PRACTICAL HYDROINFORMATICS: COMPUTATIONAL INTELLIGENCE AND TECHNOLOGICAL DEVELOPMENTS IN WATER APPLICATIONS, 2008, 68 : 347 - 358
  • [4] Suitability of curvature as a feature for image-based pattern recognition: a case study on leaf image classification based on machine learning
    Aditi Ghosh
    Parthajit Roy
    Paramartha Dutta
    [J]. Soft Computing, 2024, 28 : 5709 - 5720
  • [5] Suitability of curvature as a feature for image-based pattern recognition: a case study on leaf image classification based on machine learning
    Ghosh, Aditi
    Roy, Parthajit
    Dutta, Paramartha
    [J]. SOFT COMPUTING, 2024, 28 (06) : 5709 - 5720
  • [6] A Pinball loss based Extreme Learning Machine for pattern classification
    Anand, Pritam
    Rastogi, Reshma
    Bharti, Amisha
    [J]. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 2212 - 2215
  • [7] Image-based Candlestick Pattern Classification with Machine Learning
    Xu, Chenghan
    [J]. PROCEEDINGS OF 2021 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES (ICMLT 2021), 2021, : 26 - 33
  • [8] Machine Learning based Pattern Recognition and Classification Framework Development
    Omarov, Batyrkhan
    Cho, Young Im
    [J]. 2017 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2017,
  • [9] Machine-Learning-Based Precipitation Reconstructions: A Study on Slovenia's Sava River Basin
    Molina, Abel Andres Ramirez
    Bezak, Nejc
    Tootle, Glenn
    Wang, Chen
    Gong, Jiaqi
    [J]. HYDROLOGY, 2023, 10 (11)
  • [10] Classification and Research of Skin Lesions Based on Machine Learning
    Liu, Jian
    Wang, Wantao
    Chen, Jie
    Sun, Guozhong
    Yang, Alan
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 62 (03): : 1187 - 1200