UFSSF - An Efficient Unsupervised Feature Selection for Streaming Features

被引:4
|
作者
Almusallam, Naif [1 ,2 ]
Tari, Zahir [1 ]
Chan, Jeffrey [1 ]
AlHarthi, Adil [3 ]
机构
[1] Royal Melbourne Inst RMIT, Melbourne, Vic, Australia
[2] Al Imam Muhammad Bin Saud Islamic Univ IMSIU, Riyadh, Saudi Arabia
[3] Albaha Univ, Albaha, Saudi Arabia
关键词
D O I
10.1007/978-3-319-93037-4_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming features applications pose challenges for feature selection. For such dynamic features applications: (a) features are sequentially generated and are processed one by one upon their arrival while the number of instances/ points remains fixed; and (b) the complete feature space is not known in advance. Existing approaches require class labels as a guide to select the representative features. However, in real-world applications most data are not labeled and, moreover, manual labeling is costly. A new algorithm, called Unsupervised Feature Selection for Streaming Features (UFSSF), is proposed in this paper to select representative features in streaming features applications without the need to know the features or class labels in advance. UFSSF extends the k-mean clustering algorithm to include linearly dependent similarity measures so as to incrementally decide whether to add the newly arrived feature to the existing set of representative features. Those features that are not representative are discarded. Experimental results indicates that UFSSF significantly has a better prediction accuracy and running time compared to the baseline approaches.
引用
收藏
页码:493 / 505
页数:13
相关论文
共 50 条
  • [1] An Online Unsupervised Streaming Features Selection Through Dynamic Feature Clustering
    Yan X.
    Homaifar A.
    Sarkar M.
    Lartey B.
    Gupta K.D.
    [J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1281 - 1292
  • [2] Group Feature Selection with Streaming Features
    Li, Haiguang
    Wu, Xindong
    Li, Zhao
    Ding, Wei
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1109 - 1114
  • [3] Online Feature Selection with Streaming Features
    Wu, Xindong
    Yu, Kui
    Ding, Wei
    Wang, Hao
    Zhu, Xingquan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (05) : 1178 - 1192
  • [4] An efficient framework for unsupervised feature selection
    Zhang, Han
    Zhang, Rui
    Nie, Feiping
    Li, Xuelong
    [J]. NEUROCOMPUTING, 2019, 366 : 194 - 207
  • [5] A survey on online feature selection with streaming features
    Hu, Xuegang
    Zhou, Peng
    Li, Peipei
    Wang, Jing
    Wu, Xindong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (03) : 479 - 493
  • [6] Online Causal Feature Selection for Streaming Features
    You, Dianlong
    Li, Ruiqi
    Liang, Shunpan
    Sun, Miaomiao
    Ou, Xinju
    Yuan, Fuyong
    Shen, Limin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1563 - 1577
  • [7] A survey on online feature selection with streaming features
    Xuegang Hu
    Peng Zhou
    Peipei Li
    Jing Wang
    Xindong Wu
    [J]. Frontiers of Computer Science, 2018, 12 : 479 - 493
  • [8] Efficient greedy feature selection for unsupervised learning
    Ahmed K. Farahat
    Ali Ghodsi
    Mohamed S. Kamel
    [J]. Knowledge and Information Systems, 2013, 35 : 285 - 310
  • [9] Efficient greedy feature selection for unsupervised learning
    Farahat, Ahmed K.
    Ghodsi, Ali
    Kamel, Mohamed S.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 35 (02) : 285 - 310
  • [10] An efficient unsupervised feature selection procedure through feature clustering
    Yan, Xuyang
    Nazmi, Shabnam
    Erol, Berat A.
    Homaifar, Abdollah
    Gebru, Biniam
    Tunstel, Edward
    [J]. PATTERN RECOGNITION LETTERS, 2020, 131 : 277 - 284