Active Learning Using Fuzzy-Rough Nearest Neighbor Classifier for Cancer Prediction from Microarray Gene Expression Data

被引:3
|
作者
Kumar, Ansuman [1 ]
Halder, Anindya [1 ]
机构
[1] North Eastern Hill Univ, Dept Comp Applicat, Tura Campus, Shillong 794002, Meghalaya, India
关键词
Active learning; cancer prediction; microarray gene expression data; fuzzy set; rough set; TUMOR CLASSIFICATION; CLUSTER-ANALYSIS; ALGORITHM;
D O I
10.1142/S0218001420570013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer prediction from gene expression data is a very challenging area of research in the field of computational biology and bioinformatics. Conventional classifiers are often unable to achieve desired accuracy due to the lack of 'sufficient' training patterns in terms of clinically labeled samples. Active learning technique, in this respect, can be useful as it automatically finds only few most informative (or confusing) samples to get their class labels from the experts and those are added to the training set, which can improve the accuracy of the prediction consequently. A novel active learning technique using fuzzy-rough nearest neighbor classifier (ALFRNN) is proposed in this paper for cancer classification from microarray gene expression data. The proposed ALFRNN method is capable of dealing with the uncertainty, overlapping and indiscernibility often present in cancer subtypes (classes) of the gene expression data. The performance of the proposed method is tested using different real-life microarray gene expression cancer datasets and its performance is compared with five other state-of-the-art techniques (out of which three are active learning-based and two are traditional classification methods) in terms of percentage accuracy, precision, recall, F-1-measures and kappa. Superiority of the proposed method over the other counterpart algorithms is established from experimental results for cancer prediction and results of the paired t-test confirm statistical significance of the results in favor of the proposed method for almost all the datasets.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Active learning using rough fuzzy classifier for cancer prediction from microarray gene expression data
    Halder, Anindya
    Kumar, Ansuman
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 92
  • [2] Dataset condensation using OWA fuzzy-rough set-based nearest neighbor classifier
    Amiri, Mehran
    Jensen, Richard
    Eftekhari, Mahdi
    Mac Parthalain, Neil
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 1934 - 1941
  • [3] Fuzzy-Rough Nearest Neighbour Classifier for Person Authentication using EEG Signaals
    Liew, Siaw-Hong
    Choo, Yun-Huoy
    Low, Yin Fen
    [J]. 2013 INTERNATIONAL CONFERENCE ON FUZZY THEORY AND ITS APPLICATIONS (IFUZZY 2013), 2013, : 316 - 321
  • [4] Prediction of moving objects' k-nearest neighbor based on fuzzy-rough sets theory
    Hong, Xiaoguang
    Yuan, Yan
    Hu, Xinglei
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 407 - 411
  • [5] Active Learning Using Fuzzy k-NN for Cancer Classification from Microarray Gene Expression Data
    Halder, Anindya
    Dey, Samrat
    Kumar, Ansuman
    [J]. ADVANCES IN COMMUNICATION AND COMPUTING, 2015, 347 : 103 - 113
  • [6] A fuzzy-rough nearest neighbor classifier combined with consistency-based subset evaluation and instance selection for automated diagnosis of breast cancer
    Onan, Aytug
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (20) : 6844 - 6852
  • [7] Genetic diagnosis of cancer by fuzzy-rough gene selection and the complementary hierarchical fuzzy classifier
    Shaeiri, Zahra
    Ghaderi, Reza
    [J]. BIO-MEDICAL MATERIALS AND ENGINEERING, 2011, 21 (01) : 37 - 52
  • [8] Simultaneous Gene Selection and Weighting in Nearest Neighbor Classifier for Gene Expression Data
    Alarcon-Paredes, Antonio
    Adolfo Alonso, Gustavo
    Cabrera, Eduardo
    Cuevas-Valencia, Rene
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II, 2017, 10209 : 372 - 381
  • [9] Microarray Data Classification using Fuzzy K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Santanu Ku
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1032 - 1038
  • [10] Learning of a Robusted Nearest Neighbor Classifier Using Multiple Training Data
    Malach, Tobias
    Pomenkova, Jitka
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 47 - 50