Scaling Up Feature Selection: A Distributed Filter Approach

被引:0
|
作者
Bolon-Canedo, Veronica [1 ]
Sanchez-Marono, Noelia [1 ]
Cervino-Rabunal, Joana [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, Lab Res & Dev Artificial Intelligence LIDIA, La Coruna 15071, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditionally, feature selection has been required as a preliminary step for many pattern recognition problems. In recent years, distributed learning has been the focus of much attention, due to the proliferation of big databases, in some cases distributed across different nodes. However, most of the existing feature selection algorithms were designed for working in a centralized manner, i.e. using the whole dataset at once. In this research, a new approach for using filter methods in a distributed manner is presented. The approach splits the data horizontally, i.e., by samples. A filter is applied at each partition performing several rounds to obtain a stable set of features. Later, a merging procedure is performed in order to combine the results into a single subset of relevant features. Five of the most well-known filters were used to test the approach. The experimental results on six representative datasets show that the execution time is shortened whereas the performance is maintained or even improved compared to the standard algorithms applied to the non-partitioned datasets.
引用
收藏
页码:121 / 130
页数:10
相关论文
共 50 条
  • [21] Feature Subset Selection: A Correlation-Based SVM Filter Approach
    Li, Boyang
    Wang, Qiangwei
    Hu, Jinglu
    [J]. IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2011, 6 (02) : 173 - 179
  • [22] RISC: A new filter approach for feature selection from proteomic data
    Vu, Trung-Nghia
    Ohn, Syng-Yup
    Kim, Chul-Woo
    [J]. MEDICAL BIOMETRICS, PROCEEDINGS, 2007, 4901 : 17 - +
  • [23] Binary Harris Hawks Optimisation Filter Based Approach for Feature Selection
    Abu Khurma, Ruba
    Awadallah, Mohammed A.
    Aljarah, Ibrahim
    [J]. 2021 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT 2021), 2021, : 59 - 64
  • [24] A HYBRID FILTER-WRAPPER FEATURE SELECTION APPROACH FOR AUTHORSHIP ATTRIBUTION
    Ma, Jianbin
    Xue, Bing
    Zhang, Mengjie
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (05): : 1989 - 2006
  • [25] A Filter-APOSD approach for feature selection and linguistic knowledge discovery
    Yu, Jianping
    Yuan, Laidi
    Zhang, Tao
    Fu, Jilin
    Cao, Yuyang
    Li, Shaoxiong
    Xu, Xueping
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (03) : 4013 - 4028
  • [26] A Framework for Distributed Feature Selection
    Sharifnezhad, Mona
    Rahmani, Mohsen
    Ghaffarian, Hossein
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (07)
  • [27] A filter-based feature construction and feature selection approach for classification using Genetic Programming
    Ma, Jianbin
    Gao, Xiaoying
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 196
  • [28] An Unsupervised Approach for Selection of Candidate Feature Set Using Filter Based Techniques
    Potharaju, Sai Prasad
    Sreedevi, Marriboyina
    [J]. GAZI UNIVERSITY JOURNAL OF SCIENCE, 2018, 31 (03): : 789 - 799
  • [29] Speech Feature Selection of Normal and Autistic children using Filter and Wrapper Approach
    Akhtar, Muhammed Ali
    Ali, Syed Abbas
    Siddiqui, Maria Andleeb
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (05): : 129 - 132
  • [30] An unsupervised approach for selection of candidate feature set using filter based techniques
    [J]. Potharaju, Sai Prasad (psaiprasadcse@gmail.com), 2018, Gazi Universitesi (31):