Mining incomplete data—A rough set approach

被引:0
|
作者
GRZYMALA-BUSSE Jerzy W
机构
[1] KS 66045
[2] USA
[3] University of Kansas Lawrence
关键词
rough set theory; incomplete data sets; missing attribute values; lost values; attribute-concept values; do not care" conditions; the MLEM2 algorithm of rule induction;
D O I
暂无
中图分类号
TP182 [专家系统、知识工程];
学科分类号
1111 ;
摘要
Many real-life data sets are incomplete,or in different words,are affected by missing attribute values.Three interpretations of missing attribute values are discussed in the paper:lost values(erased values),attribute-concept values(such a value may be replaced by any value from the attribute domain restricted to the concept),and "do not care" conditions(a missing attribute value may be replaced by any value from the attribute domain).For incomplete data sets three definitions of lower and upper approximations are discussed.Experiments were conducted on six typical data sets with missing attribute values,using three different interpretations of missing attribute values and the same definition of concept lower and upper approximations.The conclusion is that the best approach to missing attribute values is the lost value type.
引用
收藏
页码:282 / 290
页数:9
相关论文
共 4 条
  • [1] Knowledge acquisition under uncertainty — a rough set approach[J] . Jerzy W. Grzymala-Busse.Journal of Intelligent and Robotic Systems . 1988 (1)
  • [2] ROUGH SETS
    PAWLAK, Z
    [J]. INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05): : 341 - 356
  • [3] Learning cov-erage rules from incomplete data based on rough sets .2 HONG T P,TSENG L H,CHIEN B C. Proceedings of the IEEE International Conference on Systems,Man and Cybernetics . 2004
  • [4] On the extension of rough sets under incomplete information .2 STEFANOWSKI J,TSOUKIAS A. Inter-nationlal Journal of Leterlligeh Sgstom . 1999