A novel soft set approach in selecting clustering attribute

被引:49
|
作者
Qin, Hongwu [1 ,2 ]
Ma, Xiuqin [1 ,2 ]
Zain, Jasni Mohamad [1 ]
Herawan, Tutut [1 ]
机构
[1] Univ Malaysia Pahang, Fac Comp Syst & Software Engn, Gambang 26300, Kuantan, Malaysia
[2] NW Normal Univ, Coll Math & Informat Sci, Lanzhou 730070, Gansu, Peoples R China
关键词
Soft set; Rough set; Information system; Clustering attribute; ROUGH SET; REDUCTION ALGORITHM; MODEL;
D O I
10.1016/j.knosys.2012.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is one of the most useful tasks in data mining process for discovering groups and identifying interesting distributions and patterns in the underlying data. One of the techniques of data clustering was performed by introducing a clustering attribute. Soft set theory, initiated by Molodtsov in 1999, is a new general mathematical tool for dealing with uncertainties. In this paper, we define a soft set model on the equivalence classes of an information system, which can be easily applied in obtaining approximate sets of rough sets. Furthermore, we use it to select a clustering attribute for categorical datasets and a heuristic algorithm is presented. Experiment results on fifteen UCI benchmark datasets showed that the proposed approach provides a faster decision in selecting a clustering attribute as compared with maximum dependency attributes (MDAs) approach up to 14.84%. Furthermore, MDA and NSS have a good scalability i.e. the executing time of both algorithms tends to increase linearly as the number of instances and attributes are increased, respectively. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:139 / 145
页数:7
相关论文
共 50 条
  • [1] Soft Set Approach for Selecting Decision Attribute in Data Clustering
    Awang, Mohd Isa
    Rose, Ahmad Nazari Mohd
    Herawan, Tutut
    Deris, Mustafa Mat
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 87 - 98
  • [2] A rough set approach for selecting clustering attribute
    Herawan, Tutut
    Deris, Mustafa Mat
    Abawajy, Jemal H.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2010, 23 (03) : 220 - 231
  • [3] A Soft Set Approach for Fast Clustering Attribute Selection
    Hartama, Dedy
    Yanto, Iwm Tri Riyadi
    Zarlis, Muhammad
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 12 - 15
  • [4] ROUGH SET THEORY FOR SELECTING CLUSTERING ATTRIBUTE
    Herawan, Tutut
    Dens, Mustafa Mat
    [J]. POWER CONTROL AND OPTIMIZATION, PROCEEDINGS, 2009, 1159 : 331 - 338
  • [5] Maximum Attribute Relative Approach of Soft Set Theory in Selecting Cluster Attribute of Electronic Government Data Set
    Jacob, Deden Witarsyah
    Yanto, Iwan Tri Riyadi
    Fudzee, Mohd Farhan Md
    Salamat, Mohamad Aizi
    [J]. RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING (SCDM 2018), 2018, 700 : 473 - 484
  • [6] MAR: Maximum Attribute Relative of soft set for clustering attribute selection
    Mamat, Rabiei
    Herawan, Tutut
    Denis, Mustafa Mat
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 52 : 11 - 20
  • [7] A Mean Mutual Information Based Approach for Selecting Clustering Attribute
    Qin, Hongwu
    Ma, Xiuqin
    Zain, Jasni Mohamad
    Sulaiman, Norrozila
    Herawan, Tutut
    [J]. SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 2, 2011, 180 : 1 - 15
  • [8] Soft Set Approach for Clustering Graduated Dataset
    Saedudin, Rd Rohmat
    Kasim, Shahreen Binti
    Mahdin, Hairulnizam
    Hasibuan, Muhammad Azani
    [J]. RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 631 - 637
  • [9] An New Algorithm-based Rough Set for Selecting Clustering Attribute in Categorical Data
    Baroud, Muftah Mohamed Jomah
    Hashim, Siti Zaiton Mohd
    Zainal, Anazida
    Ahnad, Jamilah
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1358 - 1364
  • [10] A Weighted Attribute Decision Making Approach in Incomplete Soft Set
    Zhang, Lishi
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 1553 - 1556