Unsupervised fuzzy-rough set-based dimensionality reduction

被引:60
|
作者
Mac Parthalain, Neil [1 ]
Jensen, Richard [1 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
关键词
Rough set; Fuzzy set; Attribute reduction; Unsupervised feature selection; Unsupervised learning; FEATURE-SELECTION;
D O I
10.1016/j.ins.2012.12.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Each year worldwide, more and more data is collected. In fact, it is estimated that the amount of data collected and stored at least doubles every 2 years. Of this data, a large percentage is unlabelled or has labels which are incomplete or missing. It is because this data is so large that it becomes very difficult for humans to manually assign labels to data objects. Additionally, many real-world application datasets such as those in gene expression analysis, and text classification are also of large dimensionality. This further frustrates the process of label assignment for domain experts as not all of the features are relevant or necessary in order to assign a given label. Hence unsupervised feature selection is required. For supervised learning, feature selection algorithms attempt to maximise a given function of predictive accuracy. This function typically considers the ability of feature vectors to reflect decision class labels. However, for the unsupervised learning task, decision class labels are not provided, which poses questions such as: which features should be retained? In fact, not all features are important and some are irrelevant, redundant or noisy. In this paper, several unsupervised FS approaches are presented which are based on fuzzy-rough sets. These approaches require no thresholding information, are domain-independent, and can operate on real-valued data without the need for discretisation. They offer a significant reduction in dimensionality whilst retaining the semantics of the data, and can even result in supersets of the supervised fuzzy-rough approaches. The approaches are compared with some supervised techniques and are shown to retain useful features. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:106 / 121
页数:16
相关论文
共 50 条
  • [1] Fuzzy-rough set models and fuzzy-rough data reduction
    Ghroutkhar, Alireza Mansouri
    Nehi, Hassan Mishmast
    [J]. CROATIAN OPERATIONAL RESEARCH REVIEW, 2020, 11 (01) : 67 - 80
  • [2] Fuzzy-rough hybrid dimensionality reduction
    Wang, Zhihong
    Chen, Hongmei
    Yuan, Zhong
    Li, Tianrui
    [J]. FUZZY SETS AND SYSTEMS, 2023, 459 : 95 - 117
  • [3] An Intuitionistic Fuzzy-Rough Set-Based Classification for Anomaly Detection
    Mazarbhuiya, Fokrul Alom
    Shenify, Mohamed
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [4] Fuzzy-rough sets for descriptive dimensionality reduction
    Jensen, R
    Shen, Q
    [J]. PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 29 - 34
  • [5] Fuzzy-Rough Set Bireducts for Data Reduction
    Parthalain, Neil Mac
    Jensen, Richard
    Diao, Ren
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (08) : 1840 - 1850
  • [6] Dataset condensation using OWA fuzzy-rough set-based nearest neighbor classifier
    Amiri, Mehran
    Jensen, Richard
    Eftekhari, Mahdi
    Mac Parthalain, Neil
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 1934 - 1941
  • [7] Fuzzy-Rough Set Based Attribute Reduction with a Simple Fuzzification Method
    Wang, Xueen
    Han, Deqiang
    Han, Chongzhao
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3793 - 3797
  • [8] An improved fuzzy set-based multifactor dimensionality reduction for detecting epistasis
    Yang, Cheng-Hong
    Chuang, Li-Yeh
    Lin, Yu-Da
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 102
  • [9] A rough set-based fuzzy clustering
    Zhao, YQ
    Zhou, XZ
    Tang, GZ
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 401 - 409
  • [10] Fuzzy rough set-based attribute reduction using distance measures
    Wang, Changzhong
    Huang, Yang
    Shao, Mingwen
    Fan, Xiaodong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 164 : 205 - 212