Clustering algorithm using rough set theory for unsupervised feature selection

被引:0
|
作者
Pacheco, Fannia [1 ]
Cerrada, Mariela [2 ,3 ]
Li, Chuan [2 ,4 ]
Sanchez, Rene Vinicio [2 ]
Cabrera, Diego [2 ]
de Oliveira, Jose Valente [5 ]
机构
[1] Univ Politecn Salesiana, GIDTEC Mech Engn Dept, Cuenca, Ecuador
[2] GIDTEC, Merida, Venezuela
[3] Univ Los Andes, Merida, Venezuela
[4] Chongqing Technol & Business Univ, Natl Res Base Intelligent Mfg Serv, Chongqing, Peoples R China
[5] Univ Algrave, CEOT, Faro, Portugal
来源
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年
关键词
RELATIVE DEPENDENCY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the available data to describe real world problems grows in considerable manner, due to the amount of measurable characteristics (features) that can be collected. Machine learning techniques are widely used to extract valuable knowledge from data, but their performance might decrease when the proper features are not selected. Feature selection is introduced to search relations to disclose possible redundant or irrelevant features in a case study; this search is performed either in a supervised or unsupervised manner. In the present work, we propose an unsupervised feature selection algorithm using: (1) relative dependency to search similarities between features, (2) a clustering algorithm to group similar features, and (3) a procedure to select the most representative feature to obtain a reduced feature space. The relative dependency degree between pairs of attributes is used to compute a similarity measure. This measure is used by a clustering algorithm to perform attribute clustering through KNN and prototype based clustering. The proposal is tested with well-known benchmarks, and compared with classic supervised and unsupervised feature selection techniques. Additionally, a real world application in fault diagnosis for rotating machinery is evaluated by our proposal.
引用
收藏
页码:3493 / 3499
页数:7
相关论文
共 50 条
  • [31] Discretization using clustering and rough set theory
    Singh, Girish Kumar
    Minz, Sonajharia
    ICCTA 2007: INTERNATIONAL CONFERENCE ON COMPUTING: THEORY AND APPLICATIONS, PROCEEDINGS, 2007, : 330 - +
  • [32] Autonomous Clustering Using Rough Set Theory
    Bean, Charlotte
    Kambhampati, Chandra
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2008, 5 (01) : 90 - 102
  • [33] Autonomous Clustering Using Rough Set Theory
    Charlotte Bean
    Chandra Kambhampati
    International Journal of Automation & Computing, 2008, (01) : 90 - 102
  • [34] An Enhanced Feature Selection Method Comprising Rough Set and Clustering Techniques
    Murugan, A.
    Sridevi, T.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 401 - 404
  • [35] Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm
    Hong, Yi
    Kwong, Sam
    Chang, Yuchou
    Ren, Qingsheng
    PATTERN RECOGNITION, 2008, 41 (09) : 2742 - 2756
  • [36] Intelligent Water Drops Algorithm for Rough Set Feature Selection
    Alijla, Basem O.
    Peng, Lim Chee
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 356 - 365
  • [37] Feature Selection Based on Neighborhood Systems and Rough Set Theory
    He, Ming
    WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 3 - 5
  • [38] Fault feature subset selection based on rough set theory
    Zhao, Yueling
    Xu, Lin
    Wang, Jianhui
    Gu, Shusheng
    Complexity Analysis and Control for Social, Economical and Biological Systems, 2006, 1 : 162 - 171
  • [39] Information and Rough Set Theory Based Feature Selection Techniques
    Cervante, Liam
    Gao, Xiaoying
    ACTIVE MEDIA TECHNOLOGY, AMT 2013, 2013, 8210 : 166 - 176
  • [40] Feature selection using rough set in intrusion detection
    Zainal, Anazida
    Maarof, Mohd Aizaini
    Shamsuddin, Siti Mariyam
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 2026 - +