mRMRe: an R package for parallelized mRMR ensemble feature selection

被引:173
|
作者
De Jay, Nicolas [1 ]
Papillon-Cavanagh, Simon [1 ]
Olsen, Catharina [2 ]
El-Hachem, Nehme [1 ]
Bontempi, Gianluca [2 ]
Haibe-Kains, Benjamin [1 ]
机构
[1] Inst Rech Clin Montreal, Bioinformat & Computat Biol Lab, Integrat Syst Biol Axis, Montreal, PQ H2W 1R7, Canada
[2] Univ Libre Bruxelles, Machine Learning Grp, Dept Comp Sci, B-1050 Brussels, Belgium
关键词
D O I
10.1093/bioinformatics/btt383
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Feature selection is one of the main challenges in analyzing high- throughput genomic data. Minimum redundancy maximum relevance (mRMR) is a particularly fast feature selection method for finding a set of both relevant and complementary features. Here we describe the mRMRe R package, in which the mRMR technique is extended by using an ensemble approach to better explore the feature space and build more robust predictors. To deal with the computational complexity of the ensemble approach, the main functions of the package are implemented and parallelized in C using the openMP Application Programming Interface. Results: Our ensemble mRMR implementations outperform the classical mRMR approach in terms of prediction accuracy. They identify genes more relevant to the biological context and may lead to richer biological interpretations. The parallelized functions included in the package show significant gains in terms of run-time speed when compared with previously released packages.
引用
收藏
页码:2365 / 2368
页数:4
相关论文
共 50 条
  • [1] GPU-Accelerated CatBoost-Forest for Hyperspectral Image Classification Via Parallelized mRMR Ensemble Subspace Feature Selection
    Samat, Alim
    Li, Erzhu
    Du, Peijun
    Liu, Sicong
    Xia, Junshi
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 3200 - 3214
  • [2] EFS: an ensemble feature selection tool implemented as R-package and web-application
    Neumann, Ursula
    Genze, Nikita
    Heider, Dominik
    [J]. BIODATA MINING, 2017, 10
  • [3] EFS: an ensemble feature selection tool implemented as R-package and web-application
    Ursula Neumann
    Nikita Genze
    Dominik Heider
    [J]. BioData Mining, 10
  • [4] GameRank: R package for feature selection and construction
    Henneges, Carsten
    Paulson, Joseph N.
    [J]. BIOINFORMATICS, 2022, 38 (20) : 4840 - 4842
  • [5] Feature Selection Methods in the Framework of mRMR
    Wang, Xiujuan
    Tao, Yuanrui
    Zheng, Kangfeng
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1490 - 1495
  • [6] Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification
    Idris, Adnan
    Khan, Asifullah
    Lee, Yeon Soo
    [J]. APPLIED INTELLIGENCE, 2013, 39 (03) : 659 - 672
  • [7] Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification
    Adnan Idris
    Asifullah Khan
    Yeon Soo Lee
    [J]. Applied Intelligence, 2013, 39 : 659 - 672
  • [8] Adapt the mRMR Criterion for Unsupervised Feature Selection
    Xu, Junling
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 111 - 121
  • [9] Stable Feature Selection Using MRMR Algorithm
    Gulgezen, Gokhan
    Cataltepe, Zehra
    Yu, Lei
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 818 - +
  • [10] Feature Selection with the R Package MXM: Discovering Statistically Equivalent Feature Subsets
    Lagani, Vincenzo
    Athineou, Giorgos
    Farcomeni, Alessio
    Tsagris, Michail
    Tsamardinos, Ioannis
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2017, 80 (07):