Automated multidimensional phenotypic profiling using large public microarray repositories

被引:20
|
作者
Xu, Min [1 ]
Li, Wenyuan [1 ]
James, Gareth M. [2 ]
Mehan, Michael R. [1 ]
Zhou, Xianghong Jasmine [1 ]
机构
[1] Univ So Calif, Dept Biol Sci, Los Angeles, CA 90089 USA
[2] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
genotype-phenotype association; phenotype prediction; phenotype profiling; REFRACTORY-ANEMIA; PHENOME; LEUKEMIA; NETWORK;
D O I
10.1073/pnas.0900883106
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phenotypes are complex, and difficult to quantify in a high-throughput fashion. The lack of comprehensive phenotype data can prevent or distort genotype-phenotype mapping. Here, we describe "PhenoProfiler,'' a computational method that enables in silico phenotype profiling. Drawing on the principle that similar gene expression patterns are likely to be associated with similar phenotype patterns, PhenoProfiler supplements the missing quantitative phenotype information for a given microarray dataset based on other well-characterized microarray datasets. We applied our method to 587 human microarray datasets covering >14,000 samples, and confirmed that the predicted phenotype profiles are highly consistent with true phenotype descriptions. PhenoProfiler offers several unique capabilities: (i) automated, multidimensional phenotype profiling, facilitating the analysis and treatment design of complex diseases; (ii) the extrapolation of phenotype profiles beyond provided classes; and (iii) the detection of confounding phenotype factors that could otherwise bias biological inferences. Finally, because no direct comparisons are made between gene expression values from different datasets, the method can use the entire body of cross-platform microarray data. This work has produced a compendium of phenotype profiles for the National Center for Biotechnology Information GEO datasets, which can facilitate an unbiased understanding of the transcriptome-phenome mapping. The continued accumulation of microarray data will further increase the power of PhenoProfiler, by increasing the variety and the quality of phenotypes to be profiled.
引用
收藏
页码:12323 / 12328
页数:6
相关论文
共 50 条
  • [41] Expression profiling using a hexamer-based universal microarray
    Roth, ME
    Feng, L
    McConnell, KJ
    Schaffer, PJ
    Guerra, CE
    Affourtit, JP
    Piper, KR
    Guccione, L
    Hariharan, J
    Ford, MJ
    Powell, SW
    Krishnaswamy, H
    Lane, J
    Guccione, L
    Intrieri, G
    Merkel, JS
    Perbost, C
    Valerio, A
    Zolla, B
    Graham, CD
    Hnath, J
    Michaelson, C
    Wang, RX
    Ying, B
    Halling, C
    Parman, CE
    Raha, D
    Orr, B
    Jedrzkiewicz, B
    Liao, J
    Tevelev, A
    Mattessich, MJ
    Kranz, DM
    Lacey, M
    Kaufman, JC
    Kim, J
    Latimer, DR
    Lizardi, PM
    NATURE BIOTECHNOLOGY, 2004, 22 (04) : 418 - 426
  • [42] Microarray retriever: a web-based tool for searching and large scale retrieval of public microarray data
    Ivliev, Alexander E.
    't Hoen, Peter A. C.
    Villerius, Michel P.
    den Dunnen, Johan T.
    Brandt, Bernd W.
    NUCLEIC ACIDS RESEARCH, 2008, 36 : W327 - W331
  • [43] Automated cooking and frying control using a gas sensor microarray
    Ehrmann, S
    Jüngst, J
    Goschnick, J
    SENSORS AND ACTUATORS B-CHEMICAL, 2000, 66 (1-3) : 43 - 45
  • [44] Automated segmentation of microarray spots using fuzzy clustering approaches
    Wang, YP
    Gunampally
    Reddy, M
    Cai, WW
    2005 IEEE Workshop on Machine Learning for Signal Processing (MLSP), 2005, : 387 - 391
  • [45] Automated cooking and frying control using a gas sensor microarray
    Ehrmann, S
    Jüngst, J
    Goschnick, J
    TECHNICAL DIGEST OF THE SEVENTH INTERNATIONAL MEETING ON CHEMICAL SENSORS, 1998, : 861 - 863
  • [46] Development of a Learner Profiling System Using Multidimensional Characteristics Analysis
    Park, Kinam
    Ji, Hyesung
    Lim, Heuiseok
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [47] Software Development Using Context Aware Searching Of Components In Large Repositories
    Paul, Sayan
    Makkar, Tushar
    Chandrasekaran, K.
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 765 - 772
  • [48] Using Language-Based Search in Mining Large Software Repositories
    Abu Bakar, Normi Sham Awang
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 160 - 168
  • [49] Rapid and automated multidimensional fluorescence microscopy profiling of 3D human breast cultures
    Park, Catherine C.
    Georgescu, Walter
    Polyzos, Aris
    Pham, Christopher
    Ahmed, Kazi M.
    Zhang, Hui
    Costes, Sylvain V.
    INTEGRATIVE BIOLOGY, 2013, 5 (04) : 681 - 691
  • [50] Phenotypic profiling of antifungal agents using multiparametric yeast signatures
    Gebre, Abraham Abera
    Okada, Hiroki
    Kim, Cholgwang
    Kubo, Karen
    Ohnuki, Shinsuke
    Ohya, Yoshikazu
    YEAST, 2015, 32 : S178 - S179