Scaling Up Feature Selection: A Distributed Filter Approach

被引:0
|
作者
Bolon-Canedo, Veronica [1 ]
Sanchez-Marono, Noelia [1 ]
Cervino-Rabunal, Joana [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, Lab Res & Dev Artificial Intelligence LIDIA, La Coruna 15071, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditionally, feature selection has been required as a preliminary step for many pattern recognition problems. In recent years, distributed learning has been the focus of much attention, due to the proliferation of big databases, in some cases distributed across different nodes. However, most of the existing feature selection algorithms were designed for working in a centralized manner, i.e. using the whole dataset at once. In this research, a new approach for using filter methods in a distributed manner is presented. The approach splits the data horizontally, i.e., by samples. A filter is applied at each partition performing several rounds to obtain a stable set of features. Later, a merging procedure is performed in order to combine the results into a single subset of relevant features. Five of the most well-known filters were used to test the approach. The experimental results on six representative datasets show that the execution time is shortened whereas the performance is maintained or even improved compared to the standard algorithms applied to the non-partitioned datasets.
引用
收藏
页码:121 / 130
页数:10
相关论文
共 50 条
  • [1] Sequential Learning Approach for Scaling Up Filter-Based Feature Subset Selection
    Ditzler, Gregory
    Polikar, Robi
    Rosen, Gail
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2530 - 2544
  • [2] Scaling Up Feature Selection by Means of Democratization
    de Haro-Garcia, Aida
    Garcia-Pedrajas, Nicolas
    [J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 662 - 672
  • [3] A Cluster-Filter Feature Selection Approach
    Dubey, Vimal Kumar
    Saxena, Amit Kumar
    Shrivas, Madan Madhaw
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ICT IN BUSINESS INDUSTRY & GOVERNMENT (ICTBIG), 2016,
  • [4] Fast feature selection with genetic algorithms: A filter approach
    Lanzi, PL
    [J]. PROCEEDINGS OF 1997 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '97), 1997, : 537 - 540
  • [5] A COMBINED APPROACH FOR FILTER FEATURE SELECTION IN DOCUMENT CLASSIFICATION
    Le Nguyen Hoai Nam
    Ho Bao Quoc
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 317 - 324
  • [6] Feature subset selection: A correlation based filter approach
    Hall, MA
    Smith, LA
    [J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 855 - 858
  • [7] A filter approach to feature selection based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 84 - 89
  • [8] A Distributed Feature Selection Approach Based on a Complexity Measure
    Bolon-Canedo, Veronica
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2015, 9095 : 15 - 28
  • [9] Filter-Wrapper Approach to Feature Selection of GPCR Protein
    Kamal, Nor Ashikin Mohamad
    Abu Bakar, Azuraliza
    Zainudin, Suhaila
    [J]. 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 693 - 698
  • [10] A filter-based feature selection approach in multilabel classification
    Shaikh, Rafia
    Rafi, Muhammad
    Mahoto, Naeem Ahmed
    Sulaiman, Adel
    Shaikh, Asadullah
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):