Clustering analysis of SAGE data using a Poisson approach

被引:0
|
作者
Li Cai
Haiyan Huang
Seth Blackshaw
Jun S Liu
Connie Cepko
Wing H Wong
机构
[1] Dana-Farber Cancer Institute,Department of Research Computing
[2] Harvard School of Public Health,Department of Biostatistics
[3] Harvard Medical School,Department of Genetics
[4] Harvard University,Department of Statistics
[5] Science Center,Department of Statistics
[6] University of California,Department of Neuroscience
[7] Johns Hopkins University School of Medicine,undefined
来源
关键词
Additional Data File; Massive Parallel Signature Sequencing; Joint Likelihood; Unknown Biological Function; RIKEN cDNAs;
D O I
暂无
中图分类号
学科分类号
摘要
Serial analysis of gene expression (SAGE) data have been poorly exploited by clustering analysis owing to the lack of appropriate statistical methods that consider their specific properties. We modeled SAGE data by Poisson statistics and developed two Poisson-based distances. Their application to simulated and experimental mouse retina data show that the Poisson-based distances are more appropriate and reliable for analyzing SAGE data compared to other commonly used distances or similarity measures such as Pearson correlation or Euclidean distance.
引用
收藏
相关论文
共 50 条
  • [41] Clustering the Patent Data Using K-Means Approach
    Anuranjana
    Mittas, Nisha
    Mehrotra, Deepti
    SOFTWARE ENGINEERING (CSI 2015), 2019, 731 : 639 - 645
  • [42] A semi-supervised clustering approach using labeled data
    Taghizabet, A.
    Tanha, J.
    Amini, A.
    Mohammadzadeh, J.
    SCIENTIA IRANICA, 2023, 30 (01) : 104 - 115
  • [43] NEW CLUSTERING APPROACH TO CHIP FLOORPLAN USING FUNCTIONAL DATA
    HARADA, I
    ADACHI, T
    ELECTRONICS LETTERS, 1987, 23 (17) : 900 - 902
  • [44] A new approach to clustering biological data using message passing
    Geng, HM
    Bastola, D
    Ali, H
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 493 - 494
  • [45] A Clustering Approach using PSO Optimization Technique for Data Mining
    Dagde, Rashmi
    Radke, Dipeeka
    Lokhande, Ashwini
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 427 - 431
  • [46] Dynamic clustering of gene expression data using a fuzzy approach
    Sirbu, Adela-Maria
    Czibula, Gabriela
    Bocicor, Maria-Iuliana
    16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, : 220 - 227
  • [47] Two Way Clustering of Microarray Data Using a Hybrid Approach
    Malutan, Raul
    Belean, Bogdan
    Gomez Vilda, Pedro
    Borda, Monica
    2011 34TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2011, : 417 - 420
  • [48] A Data Clustering Approach Using Bees Algorithm with a Memory Scheme
    Nemmich, Mohamed Amine
    Debbat, Fatima
    Slimane, Mohamed
    ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2019, 50 : 261 - 270
  • [49] A Novel Approach for Clustering Data Streams Using Granularity Technique
    Kaneriya, Ankur
    Shukla, Madhu
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 586 - 590
  • [50] Clustering High-Dimensional Stock Data using Data Mining Approach
    Indriyanti, Dhea
    Dhini, Arian
    2019 16TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM2019), 2019,