Feature selection and gene clustering from gene expression data

被引:4
|
作者
Mitra, P [1 ]
Majumder, DD [1 ]
机构
[1] Indian Stat Inst, Machine Intelligent Unit, Kolkata 700108, W Bengal, India
关键词
microarray; maximal information compression index; cancer classification; representation entropy; data mining;
D O I
10.1109/ICPR.2004.1334213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we describe an algorithm for feature selection and gene clustering from high dimensional gene expression data. The method is based on measuring similarity between feat\ures/genes whereby redundancy therein is removed. This does not need any search and therefore is fast. A novel feature similarity measure, called maximum information compression index, is used. The feature selection algorithm also obtains gene clusters in a multiscale fashion. The superiority of the algorithm, in terms of speed and performance, is established on a real life molecular cancer classification dataset.
引用
收藏
页码:343 / 346
页数:4
相关论文
共 50 条
  • [1] Informative Feature Clustering and Selection for Gene Expression Data
    Yang, Yuqi
    Yin, Pengshuai
    Luo, Zhihang
    Gu, Wenwen
    Chen, Renjie
    Wu, Qingyao
    [J]. IEEE ACCESS, 2019, 7 : 169174 - 169184
  • [2] PSO Based Feature Selection for Clustering Gene Expression Data
    Deepthi, P. S.
    Thampi, Sabu M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [3] Gene ontology driven feature selection from microarray gene expression data
    Qi, Jianlong
    Tang, Jian
    [J]. PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 428 - +
  • [4] Simultaneous Feature Selection and Semi-supervised Clustering for Gene-Expression Data
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    Kanekar, Neha
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [5] CGUFS: A clustering-guided unsupervised feature selection algorithm for gene expression data
    Xu, Zhaozhao
    Yang, Fangyuan
    Wang, Hong
    Sun, Junding
    Zhu, Hengde
    Wang, Shuihua
    Zhang, Yudong
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [6] Minimum redundancy feature selection from microarray gene expression data
    Ding, C
    Peng, HC
    [J]. PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 523 - 528
  • [7] On the selection of appropriate distances for gene expression data clustering
    Pablo A Jaskowiak
    Ricardo JGB Campello
    Ivan G Costa
    [J]. BMC Bioinformatics, 15
  • [8] On the selection of appropriate distances for gene expression data clustering
    Jaskowiak, Pablo A.
    Campello, Ricardo J. G. B.
    Costa, Ivan G.
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [9] Data mining for feature selection in gene expression autism data
    Latkowski, Tomasz
    Osowski, Stanislaw
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) : 864 - 872
  • [10] Bi-dimensional principal gene feature selection from big gene expression data
    Hou, Xiaoqian
    Hou, Jingyu
    Huang, Guangyan
    [J]. PLOS ONE, 2022, 17 (12):