A COMPARATIVE STUDY OF DATA SAMPLING TECHNIQUES FOR CONSTRUCTING NEURAL NETWORK ENSEMBLES

被引：19

作者：

Akhand, M. A. H. ^{[1
]}

Islam, M. D. Monirul ^{[1
]}

Murase, Kazuyuki ^{[1
,2
]}

机构：

[1] Univ Fukui, Grad Sch Engn, Fukui 9108507, Japan

[2] Univ Fukui, Res & Educ Program Life Sci, Fukui 9108507, Japan

来源：

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS | 2009年 / 19卷 / 02期

关键词：

Neural network ensemble; generalization; diversity; bagging; boosting; negative correlation learning; random subspace method; GRADIENT LEARNING ALGORITHM; CLASSIFICATION;

D O I：

10.1142/S0129065709001859

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Ensembles with several classifiers (such as neural networks or decision trees) are widely used to improve the generalization performance over a single classifier. Proper diversity among component classifiers is considered an important parameter for ensemble construction so that failure of one may be compensated by others. Among various approaches, data sampling, i.e., different data sets for different classifiers, is found more effective than other approaches. A number of ensemble methods have been proposed under the umbrella of data sampling in which some are constrained to neural networks or decision trees and others are commonly applicable to both types of classifiers. We studied prominent data sampling techniques for neural network ensembles, and then experimentally evaluated their effectiveness on a common test ground. Based on overlap and uncover, the relation between generalization and diversity is presented. Eight ensemble methods were tested on 30 benchmark classification problems. We found that bagging and boosting, the pioneer ensemble methods, are still better than most of the other proposed methods. However, negative correlation learning that implicitly encourages different networks to different training spaces is shown as better or at least comparable to bagging and boosting that explicitly create different training spaces.

引用

页码：67 / 89

页数：23

共 50 条

[41] Diversity and Generalization in Neural Network Ensembles
Ortega, Luis A.
Cabanas, Rafael
Masegosa, Andres R.
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[42] Neural Network Ensembles in Reinforcement Learning
Stefan Faußer
Friedhelm Schwenker
[J]. Neural Processing Letters, 2015, 41 : 55 - 69
[43] Lithology recognition by neural network ensembles
dos Santos, RV
Artola, F
da Fontoura, S
Vellasco, M
[J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2507 : 302 - 312
[44] Model clustering for neural network ensembles
Bakker, B
Heskes, T
[J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 383 - 388
[45] Neural network ensembles for intrusion detection
Golovko, Vladimir
Kachurka, Pavel
Vaitsekhovich, Leanid
[J]. IDAACS 2007: PROCEEDINGS OF THE 4TH IEEE WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2007, : 578 - 583
[46] Tissue classification using gene expression data and artificial neural network ensembles
Lu, Huijuan
Zhang, Jinxiang
Zhang, Lei
[J]. COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, PT 3, PROCEEDINGS, 2006, 4115 : 792 - 800
[47] From Designing A Single Neural Network to Designing Neural Network Ensembles
Liu Yong
[J]. Wuhan University Journal of Natural Sciences, 2003, (S1) : 155 - 164
[48] Neural Network Ensembles with Missing Data Processing and Data Fusion Capacities: Applications in Medicine and in the Environment
Garcia Baez, Patricio
Suarez Araujo, Carmen Paz
Fernandez Lopez, Pablo
[J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT II, 2011, 6692 : 169 - 176
[49] RELATIONSHIP BETWEEN DATA SIZE, ACCURACY, DIVERSITY AND CLUSTERS IN NEURAL NETWORK ENSEMBLES
Chiu, Chien-Yuan
Verma, Brijesh
[J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2013, 12 (04)
[50] A Comparative Study of Data Anonymization Techniques
Murthy, Suntherasvaran
Abu Bakar, Asmidar
Rahim, Fiza Abdul
Ramli, Ramona
[J]. 2019 IEEE 5TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY) / IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC) / IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2019, : 306 - 309

← 1 2 3 4 5 →