Feature selection and gene clustering from gene expression data

被引:4
|
作者
Mitra, P [1 ]
Majumder, DD [1 ]
机构
[1] Indian Stat Inst, Machine Intelligent Unit, Kolkata 700108, W Bengal, India
关键词
microarray; maximal information compression index; cancer classification; representation entropy; data mining;
D O I
10.1109/ICPR.2004.1334213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we describe an algorithm for feature selection and gene clustering from high dimensional gene expression data. The method is based on measuring similarity between feat\ures/genes whereby redundancy therein is removed. This does not need any search and therefore is fast. A novel feature similarity measure, called maximum information compression index, is used. The feature selection algorithm also obtains gene clusters in a multiscale fashion. The superiority of the algorithm, in terms of speed and performance, is established on a real life molecular cancer classification dataset.
引用
收藏
页码:343 / 346
页数:4
相关论文
共 50 条
  • [21] Gene encoder: a feature selection technique through unsupervised deep learning-based clustering for large gene expression data
    Feras Uzma
    Abdallah Al-Obeidat
    Babar Tubaishat
    Zahid Shah
    [J]. Neural Computing and Applications, 2022, 34 : 8309 - 8331
  • [22] Gene encoder: a feature selection technique through unsupervised deep learning-based clustering for large gene expression data
    Uzma
    Al-Obeidat, Feras
    Tubaishat, Abdallah
    Shah, Babar
    Halim, Zahid
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 8309 - 8331
  • [23] Gene Selection for Cancer Clustering Analysis Based on Expression Data
    Xu, Taosheng
    Su, Ning
    Wang, Rujing
    Song, Liangtu
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 516 - 519
  • [24] Attribute clustering for grouping, selection, and classification of gene expression data
    Au, WH
    Chan, KCC
    Wong, AKC
    Wang, Y
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) : 83 - 101
  • [25] Feature extraction and clustering tools for analysing gene expression data from DNA microarrays
    Nick Sampas
    Zohar Yakhini
    Glenda Delenstarr
    Cynthia Enderwick
    Amir Ben-Dor
    Stuart Kim
    Ron Shamir
    [J]. Nature Genetics, 1999, 23 (Suppl 3) : 71 - 72
  • [26] The γ-OMP Algorithm for Feature Selection With Application to Gene Expression Data
    Tsagris, Michail
    Papadovasilakis, Zacharias
    Lakiotaki, Kleanthi
    Tsamardinos, Ioannis
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 1214 - 1224
  • [27] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    [J]. NEUROCOMPUTING, 2017, 256 : 56 - 62
  • [28] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [29] Feature selection as a preprocessing step for classification in gene expression data
    Borges, Helyane Bronoski
    Nievola, Julio Cesar
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 157 - +
  • [30] A Hybrid Feature Selection Method Using Gene Expression Data
    Chuang, Li-Yeh
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, : 100 - +