DNA microarray data analysis: Effective feature selection for accurate cancer classification

被引:1
|
作者
Patra, Jagdish C. [1 ]
Lim, Goh P. [1 ]
Meher, Pramod K. [1 ]
Ang, Ee Luang [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
D O I
10.1109/IJCNN.2007.4370965
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate classification of DNA microarray data is vital for cancer diagnosis and treatment. For greater accuracy, a preferable strategy is to make a decision based on the result of a single classifier that is trained with various aspects of data space. It is a difficult task to create an optimal classifier for DNA analysis that deals with only a few samples with large number of features. Usually, different feature sets are provided for classifiers to learn. If the feature sets provide similar information, the classifiers trained from them cannot improve the performance because they will make the same error and there is no possibility of compensation. In this paper, we adopt correlation analysis of feature selection methods as a guideline for selection of features for classifiers to learn. We use a negative correlation method for generation of feature sets those are mutually exclusive. Each classifier is learned from different features sets based on correlation analysis to classify cancer precisely. In this way, we evaluated the performance with two benchmark datasets. Experimental results show that classifiers, which have learned from different feature sets that are negatively correlated with each other, produce the best recognition rates on the two benchmark datasets.
引用
下载
收藏
页码:260 / 265
页数:6
相关论文
共 50 条
  • [41] Microarray classification with hierarchical data representation and novel feature selection criteria
    Bosio, Mattia
    Bellot, Pau
    Salembier, Philippe
    Oliveras Verges, Albert
    IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 344 - 349
  • [42] An Approach Based on Resampling and Feature Selection to Improve the Classification of Microarray Data
    Soleymani, Nafiseh
    Moattar, Mohammad Hussein
    2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 61 - 64
  • [43] Linear regression-based feature selection for microarray data classification
    Hasan, Md Abid
    Hasan, Md Kamrul
    Mottalib, M. Abdul
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (02) : 167 - 179
  • [44] Feature Genes Selection and Classification with SVM for Microarray Data of Lung Tissue
    Du, Si-Hao
    Jeng, Jin-Tsong
    Su, Shun-Feng
    Hsiao, Chih-Ching
    2014 JOINT 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 15TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2014, : 1054 - 1058
  • [45] Iterative ensemble feature selection for multiclass classification of imbalanced microarray data
    Yang, Junshan
    Zhou, Jiarui
    Zhu, Zexuan
    Ma, Xiaoliang
    Ji, Zhen
    JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2016, 23
  • [46] An ensemble approach to variable selection for classification of DNA microarray data
    Masulli, F
    Rovetta, S
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3089 - 3094
  • [47] Feature selection, mutual information, and the classification of high-dimensional patternsApplications to image classification and microarray data analysis
    Boyan Bonev
    Francisco Escolano
    Miguel Cazorla
    Pattern Analysis and Applications, 2008, 11 : 309 - 319
  • [48] L1-Regulated Feature Selection and Classification of Microarray Cancer Data Using Deep Learning
    Shekar, B. H.
    Dagnew, Guesh
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 227 - 242
  • [49] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Wang, Zixuan
    Zhou, Yi
    Takagi, Tatsuya
    Song, Jiangning
    Tian, Yu-Shi
    Shibuya, Tetsuo
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [50] Feature Selection and Classification of Microarray Data for Cancer Prediction Using MapReduce Implementation of Random Forest Algorithm
    Dhanalakshmi, R.
    Khaire, Utkarsh M.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2019, 78 (03): : 158 - 161