UFSSF - An Efficient Unsupervised Feature Selection for Streaming Features

被引:4
|
作者
Almusallam, Naif [1 ,2 ]
Tari, Zahir [1 ]
Chan, Jeffrey [1 ]
AlHarthi, Adil [3 ]
机构
[1] Royal Melbourne Inst RMIT, Melbourne, Vic, Australia
[2] Al Imam Muhammad Bin Saud Islamic Univ IMSIU, Riyadh, Saudi Arabia
[3] Albaha Univ, Albaha, Saudi Arabia
关键词
D O I
10.1007/978-3-319-93037-4_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming features applications pose challenges for feature selection. For such dynamic features applications: (a) features are sequentially generated and are processed one by one upon their arrival while the number of instances/ points remains fixed; and (b) the complete feature space is not known in advance. Existing approaches require class labels as a guide to select the representative features. However, in real-world applications most data are not labeled and, moreover, manual labeling is costly. A new algorithm, called Unsupervised Feature Selection for Streaming Features (UFSSF), is proposed in this paper to select representative features in streaming features applications without the need to know the features or class labels in advance. UFSSF extends the k-mean clustering algorithm to include linearly dependent similarity measures so as to incrementally decide whether to add the newly arrived feature to the existing set of representative features. Those features that are not representative are discarded. Experimental results indicates that UFSSF significantly has a better prediction accuracy and running time compared to the baseline approaches.
引用
下载
收藏
页码:493 / 505
页数:13
相关论文
共 50 条
  • [11] An efficient unsupervised feature selection procedure through feature clustering
    Yan, Xuyang
    Nazmi, Shabnam
    Erol, Berat A.
    Homaifar, Abdollah
    Gebru, Biniam
    Tunstel, Edward
    PATTERN RECOGNITION LETTERS, 2020, 131 : 277 - 284
  • [12] Predictable Features Elimination: An Unsupervised Approach to Feature Selection
    Barbiero, Pietro
    Squillero, Giovanni
    Tonda, Alberto
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT I, 2022, 13163 : 399 - 412
  • [13] Unsupervised Feature Selection: Minimize Information Redundancy of Features
    Yen, Chun-Chao
    Chen, Liang-Chieh
    Lin, Shou-De
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 247 - 254
  • [14] Efficient information-theoretic unsupervised feature selection
    Lee, J.
    Seo, W.
    Kim, D. -W.
    ELECTRONICS LETTERS, 2018, 54 (02) : 76 - 77
  • [15] Online Feature Selection with Capricious Streaming Features: A General Framework
    Wu, Di
    He, Yi
    Luo, Xin
    Shang, Mingsheng
    Wu, Xindong
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 683 - 688
  • [16] Online feature selection for multi-source streaming features
    You, Dianlong
    Sun, Miaomiao
    Liang, Shunpan
    Li, Ruiqi
    Wang, Yang
    Xiao, Jiawei
    Yuan, Fuyong
    Shen, Limin
    Wu, Xindong
    INFORMATION SCIENCES, 2022, 590 : 267 - 295
  • [17] Unsupervised Feature Selection for Outlier Detection on Streaming Data to Enhance Network Security
    Heigl, Michael
    Weigelt, Enrico
    Fiala, Dalibor
    Schramm, Martin
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [18] Deep unsupervised feature selection by discarding nuisance and correlated features
    Shaham, Uri
    Lindenbaum, Ofir
    Svirsky, Jonathan
    Kluger, Yuval
    NEURAL NETWORKS, 2022, 152 : 34 - 43
  • [19] Towards an Unsupervised Feature Selection Method for Effective Dynamic Features
    Almusallam, Naif
    Tari, Zahir
    Chan, Jeffrey
    Fahad, Adil
    Alabdulatif, Abdulatif
    Al-Naeem, Mohammed
    IEEE ACCESS, 2021, 9 : 77149 - 77163
  • [20] Unsupervised Feature Selection for Efficient Exploration of High Dimensional Data
    Chakrabarti, Arnab
    Das, Abhijeet
    Cochez, Michael
    Quix, Christoph
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2021, 2021, 12843 : 183 - 197