Online streaming feature selection based on neighborhood rough set

被引:12
|
作者
Li, Shuangjie [1 ]
Zhang, Kaixiang [1 ]
Li, Yali [1 ]
Wang, Shuqin [1 ]
Zhang, Shaoqiang [1 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin 300387, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Online streaming feature selection; Neighborhood rough set; K-nearest neighborhood; Gap neighborhood; ATTRIBUTE REDUCTION; INFORMATION; FRAMEWORK;
D O I
10.1016/j.asoc.2021.108025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is considered as a necessary and significant pre-processing step in many fields, especially in machine learning. However, in some real problems, in which features flow one by one, many existing approaches do not work well on the online streaming features, and most online streaming feature selection (OSFS) methods face the challenge of requiring domain knowledge before setting optimal parameters in advance. Therefore, an effective feature selection method for online streaming features, named OFS-Gapknn, is proposed in this paper. A new neighborhood rough set relation is firstly defined, which combines the advantages of both the k-nearest and the Gap neighborhood. The proposed neighborhood relation can not only work well on the unevenly distributed sample space, but also need not any parameters and domain knowledge. Then, the relevance and redundancy features are analyzed by using the dependency based on the neighborhood rough set. Finally, one of the optimal feature subsets is obtained. To validate the effectiveness of the proposed algorithm, four traditional methods and three OSFS methods are compared with it on 11 datasets. Experimental results indicate the dominance and significance of the proposed method. (C) 2021 Published by Elsevier B.V.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Online hierarchical streaming feature selection based on adaptive neighborhood rough set
    Shu, Tongxin
    Lin, Yaojin
    Guo, Lei
    [J]. APPLIED SOFT COMPUTING, 2024, 152
  • [2] Online streaming feature selection using adapted Neighborhood Rough Set
    Zhou, Peng
    Hu, Xuegang
    Li, Peipei
    Wu, Xindong
    [J]. INFORMATION SCIENCES, 2019, 481 : 258 - 279
  • [3] New Online Streaming Feature Selection Based on Neighborhood Rough Set for Medical Data
    Lei, Dingfei
    Liang, Pei
    Hu, Junhua
    Yuan, Yuan
    [J]. SYMMETRY-BASEL, 2020, 12 (10): : 1 - 31
  • [4] Online multi-label streaming feature selection based on neighborhood rough set
    Liu, Jinghua
    Lin, Yaojin
    Li, Yuwen
    Weng, Wei
    Wu, Shunxiang
    [J]. PATTERN RECOGNITION, 2018, 84 : 273 - 287
  • [5] Large-Scale Hierarchical Classification Online Streaming Feature Selection Based on Neighborhood Rough Set
    Bai, Shengxing
    Lin, Yaojin
    Wang, Chenxi
    Chen, Shengyu
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (09): : 811 - 820
  • [6] Online early terminated streaming feature selection based on Rough Set theory
    Zhou, Peng
    Li, Peipei
    Zhao, Shu
    Zhang, Yanping
    [J]. APPLIED SOFT COMPUTING, 2021, 113
  • [7] Online Streaming Feature Selection for High-Dimensional and Class-Imbalanced Data Based on Neighborhood Rough Set
    Chen, Xiangyan
    Lin, Yaojin
    Wang, Chenxi
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (08): : 726 - 735
  • [8] Online group streaming feature selection based on fuzzy neighborhood granular ball rough sets
    Sun, Yuanhao
    Zhu, Ping
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [9] A New Online Feature Selection Method Using Neighborhood Rough Set
    Zhou, Peng
    Hu, Xuegang
    Li, Peipei
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 135 - 142
  • [10] ASFS: A novel streaming feature selection for multi-label data based on neighborhood rough set
    Jinghua Liu
    Yaojin Lin
    Jixiang Du
    Hongbo Zhang
    Ziyi Chen
    Jia Zhang
    [J]. Applied Intelligence, 2023, 53 : 1707 - 1724