Ensemble of sparse classifiers for high-dimensional biological data

被引:7
|
作者
Kim, Sunghan [1 ,2 ]
Scalzo, Fabien [2 ]
Telesca, Donatello [3 ]
Hu, Xiao [2 ]
机构
[1] E Carolina Univ, Coll Technol & Comp Sci, Dept Engn, Greenville, NC 27858 USA
[2] Univ Calif Los Angeles, David Geffen Sch Med, Dept Neurosurg, Neural Syst & Dynam Lab, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90095 USA
关键词
ensemble sparse classifier; i(0)-norm solution; feature selection; mass spectrometry; sparse solvers; OVARIAN-CANCER IDENTIFICATION; PROTEOMIC PATTERNS; SELECTION; SERUM; RECONSTRUCTION;
D O I
10.1504/IJDMB.2015.069416
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biological data are often high in dimension while the number of samples is small. In such cases, the performance of classification can be improved by reducing the dimension of data, which is referred to as feature selection. Recently, a novel feature selection method has been proposed utilising the sparsity of high-dimensional biological data where a small subset of features accounts for most variance of the dataset. In this study we propose a new classification method for high-dimensional biological data, which performs both feature selection and classification within a single framework. Our proposed method utilises a sparse linear solution technique and the bootstrap aggregating algorithm. We tested its performance on four public mass spectrometry cancer datasets along with two other conventional classification techniques such as Support Vector Machines and Adaptive Boosting. The results demonstrate that our proposed method performs more accurate classification across various cancer datasets than those conventional classification techniques.
引用
收藏
页码:167 / 183
页数:17
相关论文
共 50 条
  • [21] HyperSurface classifiers ensemble for high dimensional data sets
    Zhao, Xiu-Rong
    He, Qing
    Shi, Zhong-Zhi
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1299 - 1304
  • [22] Group Learning for High-Dimensional Sparse Data
    Cherkassky, Vladimir
    Chen, Hsiang-Han
    Shiao, Han-Tai
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [23] Sparse PCA for High-Dimensional Data With Outliers
    Hubert, Mia
    Reynkens, Tom
    Schmitt, Eric
    Verdonck, Tim
    TECHNOMETRICS, 2016, 58 (04) : 424 - 434
  • [24] Similarity Learning for High-Dimensional Sparse Data
    Liu, Kuan
    Bellet, Aurelien
    Sha, Fei
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 653 - 662
  • [25] A method for learning a sparse classifier in the presence of missing data for high-dimensional biological datasets
    Severson, Kristen A.
    Monian, Brinda
    Love, J. Christopher
    Braatz, Richard D.
    BIOINFORMATICS, 2017, 33 (18) : 2897 - 2905
  • [26] Learning classifiers for high-dimensional micro-array data
    Bosin, Andrea
    Dessi, Nicoletta
    Pes, Barbara
    APPLIED ARTIFICIAL INTELLIGENCE, 2006, : 593 - +
  • [27] Boosting threshold classifiers for high-dimensional data in functional genomics
    Lausser, Ludwig
    Buchholz, Malte
    Kestler, Hans A.
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2008, 5064 : 147 - +
  • [28] Principal component analysis for sparse high-dimensional data
    Raiko, Tapani
    Ilin, Alexander
    Karhunen, Juha
    NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 566 - 575
  • [29] Sparse kernel methods for high-dimensional survival data
    Evers, Ludger
    Messow, Claudia-Martina
    BIOINFORMATICS, 2008, 24 (14) : 1632 - 1638
  • [30] Sparse meta-analysis with high-dimensional data
    He, Qianchuan
    Zhang, Hao Helen
    Avery, Christy L.
    Lin, D. Y.
    BIOSTATISTICS, 2016, 17 (02) : 205 - 220