Dataset complexity in gene expression based cancer classification using ensembles of k-nearest neighbors

被引:23
|
作者
Okun, Oleg [1 ]
Priisalu, Helen [2 ]
机构
[1] Univ Oulu, Elect & Informat Engn Dept, Oulu 90014, Finland
[2] Tallinn Univ Technol, Inst Cybernet, EE-12618 Tallinn, Estonia
关键词
Pattern recognition; Gene expression; Cancer classification; k-nearest neighbors; Ensemble of classifiers; FEATURE-SELECTION; MICROARRAY DATA; DNA; PREDICTION; CLASSIFIERS; TUMOR;
D O I
10.1016/j.artmed.2008.08.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: We explore the link between dataset complexity, determining how difficult a dataset is for classification, and classification performance defined by low-variance and tow-biased bolstered resubstitution error made by k-nearest neighbor classifiers. Methods and material: Gene expression based cancer classification is used as the task in this study. Six gene expression datasets containing different types of cancer constitute test data. Results: Through extensive simulation coupled with the copula method for analysis of association in bivariate data, we show that dataset complexity and bolstered resubstitution error are associated in terms of dependence. As a result, we propose a new scheme for generating ensembles of classifiers that selects subsets of features of low complexity for ensemble members, which constitutes the accurate members according to the found dependence relation. Conclusion: Experiments with six gene expression datasets demonstrate that our ensemble generating scheme based on the dependence of dataset complexity and classification error is superior to a-single best classifier in the ensemble and to the traditional ensemble construction scheme that is ignorant of dataset complexity. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 162
页数:12
相关论文
共 50 条
  • [41] Sesquiterpene lactones-based classification of the family Asteraceae using neural networks and k-nearest neighbors
    Hristozov, Dimitar
    Da Costa, Fernando B.
    Gasteiger, Johann
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (01) : 9 - 19
  • [42] Ensemble k-nearest neighbors based on centroid displacement
    Wang, Alex X.
    Chukova, Stefanka S.
    Nguyen, Binh P.
    INFORMATION SCIENCES, 2023, 629 : 313 - 323
  • [43] Chameleon algorithm based on mutual k-nearest neighbors
    Yuru Zhang
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    Ling Ding
    Applied Intelligence, 2021, 51 : 2031 - 2044
  • [44] Chameleon algorithm based on mutual k-nearest neighbors
    Zhang, Yuru
    Ding, Shifei
    Wang, Lijuan
    Wang, Yanru
    Ding, Ling
    APPLIED INTELLIGENCE, 2021, 51 (04) : 2031 - 2044
  • [45] Classification of amino resins and formaldehyde near infrared spectra using K-nearest neighbors
    Goncalves, M.
    Paiva, N. T.
    Ferra, J. M.
    Martins, J.
    Magalhaes, F.
    Carvalho, L.
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2019, 27 (05) : 345 - 353
  • [46] Landmine Classification Using Possibilistic K-Nearest Neighbors with Wideband Electromagnetic Induction Data
    Dula, J.
    Zare, A.
    Ho, D.
    Gader, P.
    DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVIII, 2013, 8709
  • [47] Optimized k-nearest neighbors for classification of prosthetic hand movements using electromyography signal
    Sahu, Padmini
    Singh, Bikesh Kumar
    Nirala, Neelamshobha
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [48] Quantum Algorithm for K-Nearest Neighbors Classification Based on the Categorical Tensor Network States
    Ma, Yan-zhu
    Song, Hong-fei
    Zhang, Jun
    INTERNATIONAL JOURNAL OF THEORETICAL PHYSICS, 2021, 60 (03) : 1164 - 1174
  • [49] Quantum Algorithm for K-Nearest Neighbors Classification Based on the Categorical Tensor Network States
    Yan-zhu Ma
    Hong-fei Song
    Jun Zhang
    International Journal of Theoretical Physics, 2021, 60 : 1164 - 1174
  • [50] Appliance Classification Method Based On K-Nearest Neighbors for Home Energy Management System
    Thanh Dat Nguyen
    Truong Dong Do
    My Ha Le
    Ngoc Thien Le
    Benjapolakul, Watit
    2019 FIRST INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION, CONTROL, ARTIFICIAL INTELLIGENCE, AND ROBOTICS (ICA-SYMP 2019), 2019, : 53 - 56