Flash flood susceptibility modeling using an optimized fuzzy rule based feature selection technique and tree based ensemble methods

被引:188
|
作者
Dieu Tien Bui [1 ]
Tsangaratos, Paraskevas [2 ,3 ]
Phuong-Thao Thi Ngo [4 ]
Tien Dat Pham [5 ]
Binh Thai Pham [6 ]
机构
[1] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[2] Ton Duc Thang Univ, Geog Informat Sci Res Grp, Ho Chi Minh City, Vietnam
[3] Ton Duc Thang Univ, Fac Environm & Labour Safety, Ho Chi Minh City, Vietnam
[4] Hanoi Univ Min & Geol, Fac Informat Technol, Dept Geoinformat, 18 Pho Vien, Hanoi, Vietnam
[5] RIKEN, Geoinformat Unit, Ctr Adv Intelligence Project AIP, Chuo Ku, Mitsui Bldg,15th Floor,1-4-1 Nihonbashi, Tokyo 1030027, Japan
[6] Univ Transport Technol, Geotech Engn & Artificial Intelligence Res Grp GE, Hano, Vietnam
关键词
FURIA; Genetic algorithms; Bagging and boosting models; Flash flood susceptibility; Vietnam; DATA MINING TECHNIQUES; ADAPTIVE NEURO-FUZZY; WEIGHTS-OF-EVIDENCE; LANDSLIDE SUSCEPTIBILITY; SPATIAL PREDICTION; FREQUENCY RATIO; RANDOM-FOREST; LOGISTIC-REGRESSION; STATISTICAL-MODELS; DECISION TREE;
D O I
10.1016/j.scitotenv.2019.02.422
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The main objective of the present study was to provide a novel methodological approach for flash flood susceptibility modeling based on a feature selection method (FSM) and tree based ensemble methods. The FSM, used a fuzzy rule based algorithm FURIA, as attribute evaluator, whereas GA were used as the search method, in order to obtain optimal set of variables used in flood susceptibility modeling assessments. The novel FURIA-GA was combined with LogitBoost, Bagging and AdaBoost ensemble algorithms. The performance of the developed methodology was evaluated at the Bao Yen district and the Bac Ha district of Lao Cai Province in the Northeast region of Vietnam. For the case study, 654 floods and twelve geo-environmental variables were used. The predictive performance of each model was estimated through the calculation of the classification accuracy, the sensitivity, the specificity, the success and predictive rate curve and the area under the curves (AUC). The FURIA-GA FSM compared to a conventional rule based method gave more accurate predictive results. Also, the FURIA-GA based models, presented higher learning and predictive ability compared to the ensemble models that had not undergone a FSM. Based on the predictive classification accuracy, FURIA-GA-Bagging (93.37%) outperformed FURIA-GA-LogitBoost (92.35%) and FURIA-GA-AdaBoost (89.03%). FURIA-GA-Bagging showed also the highest sensitivity (96.94%) and specificity (89.80%). On the other hand, the FURIA-GA-LogitBoost showed the lowest percentage in very high susceptible zone and the highest relative flash-flood density, whereas the FURIA-GA-AdaBoost achieved the highest prediction AUC value (0.9740), based on the prediction rate curve, followed by FIJRIA-GABagging (0.9566), and FURIA-GA-LogitBoost (0.8955). It can be concluded that the usage of different statistical metrics, provides different outcomes concerning the best prediction model, which mainly could be attributed to sites specific settings. The proposed models could be considered as a novel alternative investigation tools appropriate for flash flood susceptibility mapping. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:1038 / 1054
页数:17
相关论文
共 50 条
  • [41] An effective ensemble classification framework using random forests and a correlation based feature selection technique
    Chutia, Dibyajyoti
    Bhattacharyya, Dhruba Kumar
    Sarma, Jaganath
    Raju, Penumetcha Narasa Lakshmi
    TRANSACTIONS IN GIS, 2017, 21 (06) : 1165 - 1178
  • [42] An evaluation of ensemble methods in handwritten word recognition based on feature selection
    Günter, S
    Bunke, H
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, : 388 - 392
  • [43] Fuzzy Decision Tree using Soft Discretization and a Genetic Algorithm based Feature Selection Method
    Chen, Min
    Ludwig, Simone A.
    2013 WORLD CONGRESS ON NATURE AND BIOLOGICALLY INSPIRED COMPUTING (NABIC), 2013, : 238 - 244
  • [44] Classification of lung cancer using ensemble-based feature selection and machine learning methods
    Cai, Zhihua
    Xu, Dong
    Zhang, Qing
    Zhang, Jiexia
    Ngai, Sai-Ming
    Shao, Jianlin
    MOLECULAR BIOSYSTEMS, 2015, 11 (03) : 791 - 800
  • [45] Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification
    Hallajian, Bita
    Motameni, Homayun
    Akbari, Ebrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [46] Skin disease prediction using ensemble methods and a new hybrid feature selection technique
    Anurag Kumar Verma
    Saurabh Pal
    B. B. Tiwari
    Iran Journal of Computer Science, 2020, 3 (4) : 207 - 216
  • [47] A feature ranking technique based on interclass separability for fuzzy modeling
    Tikk, Domonkos
    Wong, Kok Wai
    ICCC 2007: 5TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS, PROCEEDINGS, 2007, : 251 - +
  • [48] Assessing flood vulnerability using a rule-based fuzzy system
    Yazdi, J.
    Neyshabouri, S. A. A. S.
    WATER SCIENCE AND TECHNOLOGY, 2012, 66 (08) : 1766 - 1773
  • [49] PSO Based Optimized Ensemble Learning and Feature Selection Approach for Efficient Energy Forecast
    Shafqat, Wafa
    Malik, Sehrish
    Lee, Kyu-Tae
    Kim, Do-Hyeun
    ELECTRONICS, 2021, 10 (18)
  • [50] Revisiting evolutionary algorithms in feature selection and nonfuzzy/fuzzy rule based classification
    Dehuri, Satchidananda
    Ghosh, Ashish
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (02) : 83 - 108