Reduction of categorical and numerical attribute values for understandability of data and rules

被引:0
|
作者
Muto, Yuji [1 ]
Kudo, Mineichi [1 ]
Shidara, Yohji [1 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Div Comp Sci, Sapporo, Hokkaido 0600814, Japan
关键词
attribute values; reduction; grouping; granularity; understandability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we discuss attribute-value reduction for raising up the understandability of data and rules. In the traditional "reduction" sense, the goal is to find the smallest number of attributes such that they enable us to discern each tuple or each decision class. However, once we pay attention also to the number of attribute values, that is, the size/resolution of each attribute domain, another goal appears. An interesting question is like, which one is better in the following two situations 1) we can discern individual tuples with a single attribute described in fine granularity, and 2) we can do this with a few attributes described in rough granularity. Such a question is related to understandability and Kansei expression of data as well as rules. We propose a criterion and an algorithm to find near-optimal solutions for the criterion. In addition, we show some illustrative results for some databases in UCI repository of machine learning databases.
引用
收藏
页码:211 / +
页数:2
相关论文
共 50 条
  • [1] Fuzzy Rough Attribute Reduction for Categorical Data
    Wang, Changzhong
    Wang, Yan
    Shao, Mingwen
    Qian, Yuhua
    Chen, Degang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 818 - 830
  • [2] CLUSTERING CATEGORICAL DATA BASED ON COMBINATIONS OF ATTRIBUTE VALUES
    Do, Hee-Jung
    Kim, Jae Yearn
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (12A): : 4393 - 4405
  • [3] On a rough sets based tool for generating rules from data with categorical and numerical values
    Sakai, Hiroshi
    Koba, Kazuhiro
    Ishibashi, Ryuji
    Nakata, Michinori
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4617 : 269 - +
  • [4] Weighted Numerical and Categorical Attribute Clustering in Data Streams
    Liang, Wen-Bin
    Wang, Chang-Dong
    Lai, Jian-Huang
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3066 - 3072
  • [5] Categorical data clustering using tine combinations of attribute values
    Do, Hee-Jung
    Kim, Jae-Yearn
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 2, PROCEEDINGS, 2008, 5073 : 220 - 231
  • [6] Labeling unclustered categorical data into clusters based on the important attribute values
    Chen, HL
    Chuang, KT
    Chen, MS
    Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 106 - 113
  • [7] Clustering mixed numerical and categorical data with missing values
    Dinh, Duy-Tai
    Huynh, Van-Nam
    Sriboonchitta, Songsak
    INFORMATION SCIENCES, 2021, 571 : 418 - 442
  • [8] A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set
    Ahmad, Amir
    Dey, Lipika
    PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 110 - 118
  • [9] A new internal clustering validation index for categorical data based on concentration of attribute values
    Fu L.-W.
    Wu S.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2019, 41 (05): : 682 - 693
  • [10] Novel Attribute Reduction on Decision Rules
    Wang, Can
    Lin, Qiang
    Xu, Chunming
    Li, Lin
    Fan, Xiaoyong
    2020 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2020, : 69 - 74