Annotating genes of known and unknown function by large-scale coexpression analysis

被引:130
|
作者
Horan, Kevin [1 ]
Jang, Charles [1 ]
Bailey-Serres, Julia [1 ]
Mittler, Ron [3 ,4 ]
Shelton, Christian [2 ]
Harper, Jeff F. [3 ]
Zhu, Jian-Kang [1 ]
Cushman, John C. [3 ]
Gollery, Martin [5 ]
Girke, Thomas [1 ]
机构
[1] Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA
[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[3] Univ Nevada, Dept Biochem & Mol Biol, Reno, NV 89557 USA
[4] Hebrew Univ Jerusalem, Dept Plant Sci, IL-91904 Jerusalem, Israel
[5] TimeLogic, Incline Village, NV 89451 USA
关键词
D O I
10.1104/pp.108.117366
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
About 40% of the proteins encoded in eukaryotic genomes are proteins of unknown function (PUFs). Their functional characterization remains one of the main challenges in modern biology. In this study we identified the PUF encoding genes from Arabidopsis ( Arabidopsis thaliana) using a combination of sequence similarity, domain-based, and empirical approaches. Large-scale gene expression analyses of 1,310 publicly available Affymetrix chips were performed to associate the identified PUF genes with regulatory networks and biological processes of known function. To generate quality results, the study was restricted to expression sets with replicated samples. First, genome-wide clustering and gene function enrichment analysis of clusters allowed us to associate 1,541 PUF genes with tightly coexpressed genes for proteins of known function (PKFs). Over 70% of them could be assigned to more specific biological process annotations than the ones available in the current Gene Ontology release. The most highly overrepresented functional categories in the obtained clusters were ribosome assembly, photosynthesis, and cell wall pathways. Interestingly, the majority of the PUF genes appeared to be controlled by the same regulatory networks as most PKF genes, because clusters enriched in PUF genes were extremely rare. Second, large-scale analysis of differentially expressed genes was applied to identify a comprehensive set of abiotic stress-response genes. This analysis resulted in the identification of 269 PKF and 104 PUF genes that responded to a wide variety of abiotic stresses, whereas 608 PKF and 206 PUF genes responded predominantly to specific stress treatments. The provided coexpression and differentially expressed gene data represent an important resource for guiding future functional characterization experiments of PUF and PKF genes. Finally, the public Plant Gene Expression Database (http://bioweb.ucr.edu/PED) was developed as part of this project to provide efficient access and mining tools for the vast gene expression data of this study.
引用
收藏
页码:41 / 57
页数:17
相关论文
共 50 条
  • [31] Large-scale screens for cancer genes in the mouse
    van der Weyden, Louise
    Rust, Alistair
    Perez-Mancera, Pedro
    Tuveson, David
    Jonkers, Jos
    Mullighan, Charles
    Winton, Doug
    Adams, David
    CANCER RESEARCH, 2012, 72
  • [32] Large-scale tolerance analysis
    Fimmel, D
    Quitzk, S
    Schwarz, W
    INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2004, : 33 - 38
  • [33] Large-scale lexical analysis
    Thurmair, Gr.
    Aleksic, V.
    Schwarz, Chr.
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2849 - 2855
  • [34] Correlating Transcriptional Networks to Papillary Renal Cell Carcinoma Survival: A Large-Scale Coexpression Analysis and Clinical Validation
    Feng, Xingliang
    Zhang, Meng
    Meng, Jialin
    Wang, Yongqiang
    Liu, Yi
    Liang, Chaozhao
    Fan, Song
    ONCOLOGY RESEARCH, 2020, 28 (03) : 285 - 297
  • [35] Compact Unknown Tag Identification for Large-Scale RFID Systems
    Lin, Kai
    Chen, Honglong
    Yan, Na
    Ni, Zhichen
    Li, Zhe
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 700 - 707
  • [36] Visual servoing over unknown, unstructured, large-scale scenes
    Silveira, Geraldo
    Malis, Ezio
    Rives, Patrick
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 4142 - +
  • [37] Decentralized control of large-scale power systems with unknown interconnections
    Jain, S
    Khorrami, F
    Fardanesh, B
    INTERNATIONAL JOURNAL OF CONTROL, 1996, 63 (03) : 591 - 608
  • [38] Fast unknown tag identification in large-scale RFID systems
    Fu, Yu
    Qian, Zhihong
    Ji, Guang
    Gao, Xin
    Zhu, Qiao
    2017 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2017, : 539 - 544
  • [39] An evolutionary navigator for autonomous agents on unknown large-scale environments
    Alfaro, Teddy
    Riff, Maria-Cristina
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2008, 14 (01): : 105 - 116