Annotating genes of known and unknown function by large-scale coexpression analysis

被引:130
|
作者
Horan, Kevin [1 ]
Jang, Charles [1 ]
Bailey-Serres, Julia [1 ]
Mittler, Ron [3 ,4 ]
Shelton, Christian [2 ]
Harper, Jeff F. [3 ]
Zhu, Jian-Kang [1 ]
Cushman, John C. [3 ]
Gollery, Martin [5 ]
Girke, Thomas [1 ]
机构
[1] Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA
[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[3] Univ Nevada, Dept Biochem & Mol Biol, Reno, NV 89557 USA
[4] Hebrew Univ Jerusalem, Dept Plant Sci, IL-91904 Jerusalem, Israel
[5] TimeLogic, Incline Village, NV 89451 USA
关键词
D O I
10.1104/pp.108.117366
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
About 40% of the proteins encoded in eukaryotic genomes are proteins of unknown function (PUFs). Their functional characterization remains one of the main challenges in modern biology. In this study we identified the PUF encoding genes from Arabidopsis ( Arabidopsis thaliana) using a combination of sequence similarity, domain-based, and empirical approaches. Large-scale gene expression analyses of 1,310 publicly available Affymetrix chips were performed to associate the identified PUF genes with regulatory networks and biological processes of known function. To generate quality results, the study was restricted to expression sets with replicated samples. First, genome-wide clustering and gene function enrichment analysis of clusters allowed us to associate 1,541 PUF genes with tightly coexpressed genes for proteins of known function (PKFs). Over 70% of them could be assigned to more specific biological process annotations than the ones available in the current Gene Ontology release. The most highly overrepresented functional categories in the obtained clusters were ribosome assembly, photosynthesis, and cell wall pathways. Interestingly, the majority of the PUF genes appeared to be controlled by the same regulatory networks as most PKF genes, because clusters enriched in PUF genes were extremely rare. Second, large-scale analysis of differentially expressed genes was applied to identify a comprehensive set of abiotic stress-response genes. This analysis resulted in the identification of 269 PKF and 104 PUF genes that responded to a wide variety of abiotic stresses, whereas 608 PKF and 206 PUF genes responded predominantly to specific stress treatments. The provided coexpression and differentially expressed gene data represent an important resource for guiding future functional characterization experiments of PUF and PKF genes. Finally, the public Plant Gene Expression Database (http://bioweb.ucr.edu/PED) was developed as part of this project to provide efficient access and mining tools for the vast gene expression data of this study.
引用
收藏
页码:41 / 57
页数:17
相关论文
共 50 条
  • [1] Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes
    Bianchi, MM
    Ngo, S
    Vandenbol, M
    Sartori, G
    Morlupi, A
    Ricci, C
    Stefani, S
    Morlino, GB
    Hilger, F
    Carignani, G
    Slonimski, PP
    Frontali, L
    YEAST, 2001, 18 (15) : 1397 - 1412
  • [2] Optimization of a large-scale gene disruption protocol in Dictyostelium and analysis of conserved genes of unknown function
    Torija, Patricia
    Robles, Alicia
    Escalante, Ricardo
    BMC MICROBIOLOGY, 2006, 6 (1)
  • [3] Optimization of a large-scale gene disruption protocol in Dictyostelium and analysis of conserved genes of unknown function
    Patricia Torija
    Alicia Robles
    Ricardo Escalante
    BMC Microbiology, 6
  • [4] Known and Unknown Facts of LoRa: Experiences from a Large-scale Measurement Study
    Liando, Jansen C.
    Gamage, Amalinda
    Tengourtius, Agustinus W.
    Li, Mo
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2019, 15 (02)
  • [5] Correlating transcriptional networks to breast cancer survival: a large-scale coexpression analysis
    Clarke, Colin
    Madden, Stephen F.
    Doolan, Padraig
    Aherne, Sinead T.
    Joyce, Helena
    O'Driscoll, Lorraine
    Gallagher, William M.
    Hennessy, Bryan T.
    Moriarty, Michael
    Crown, John
    Kennedy, Susan
    Clynes, Martin
    CARCINOGENESIS, 2013, 34 (10) : 2300 - 2308
  • [6] Large-scale identification of human genes implicated in epidermal barrier function
    Toulza, Eve
    Mattiuzzo, Nicolas R.
    Galliano, Marie-Florence
    Jonca, Nathalie
    Dossat, Carole
    Jacob, Daniel
    de Daruvar, Antoine
    Wincker, Patrick
    Serre, Guy
    Guerrin, Marina
    GENOME BIOLOGY, 2007, 8 (06)
  • [7] Large-scale identification of human genes implicated in epidermal barrier function
    Eve Toulza
    Nicolas R Mattiuzzo
    Marie-Florence Galliano
    Nathalie Jonca
    Carole Dossat
    Daniel Jacob
    Antoine de Daruvar
    Patrick Wincker
    Guy Serre
    Marina Guerrin
    Genome Biology, 8
  • [8] Large-scale identification of genes implicated in kidney glomerulus development and function
    Takemoto, M
    He, LQ
    Norlin, J
    Patrakka, J
    Xiao, ZJ
    Petrova, T
    Bondjers, C
    Asp, J
    Wallgard, E
    Sun, Y
    Samuelsson, T
    Mostad, P
    Lundin, S
    Miura, N
    Sado, Y
    Alitalo, K
    Quaggin, SE
    Tryggvason, K
    Betsholtz, C
    EMBO JOURNAL, 2006, 25 (05): : 1160 - 1174
  • [9] Large-Scale Exome Sequencing Identifies Novel Genes for Cognitive Function
    Chen, Chia-Yen
    Lam, Max
    Tian, Ruoyu
    Liu, Jimmy
    Tsai, Ellen
    Whelan, Christopher
    Sexton, David
    John, Sally
    Huang, Hailiang
    Ge, Tian
    Lencz, Todd
    Runz, Heiko
    BIOLOGICAL PSYCHIATRY, 2021, 89 (09) : S45 - S45
  • [10] AN ANALYSIS OF THE FORECASTING FUNCTION IN LARGE-SCALE INVENTORY SYSTEMS
    MOORE, RI
    COX, JF
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1992, 30 (09) : 1987 - 2010