Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets

被引:137
|
作者
Fernandez, Alberto [1 ]
del Jesus, Maria Jose [2 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, E-18071 Granada, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen, Spain
关键词
Classification; Fuzzy rule based classification systems; Imbalanced data-sets; Genetic fuzzy systems; Genetic rule selection; Hierarchical fuzzy partitions; REASONING METHODS; IDENTIFICATION; CLASSIFIERS; ALGORITHMS; DESIGN;
D O I
10.1016/j.ijar.2008.11.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real application areas, the data used are highly skewed and the number of instances for some classes are much higher than that of the other classes. Solving a classification task using such an imbalanced data-set is difficult due to the bias of the training towards the majority classes. The aim of this paper is to improve the performance of fuzzy rule based classification systems on imbalanced domains, increasing the granularity of the fuzzy partitions on the boundary areas between the classes, in order to obtain a better separability. We propose the use of a hierarchical fuzzy rule based classification system, which is based on the refinement of a simple linguistic fuzzy model by means of the extension of the structure of the knowledge base in a hierarchical way and the use of a genetic rule selection process in order to get a compact and accurate model. The good performance of this approach is shown through an extensive experimental study carried out over a large collection of imbalanced data-sets. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:561 / 577
页数:17
相关论文
共 50 条