A semiautomated approach to gene discovery through expressed sequence tag data mining: Discovery of new human transporter genes

被引:13
|
作者
Brown, S
Chang, JL
Sadee, W
Babbitt, PC
机构
[1] Univ Calif San Francisco, Sch Pharm, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Sch Pharm, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[3] MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02141 USA
[4] Ohio State Univ, Med Ctr, Columbus, OH 43210 USA
来源
AAPS PHARMSCI | 2003年 / 5卷 / 01期
关键词
major facilitator superfamily; transporters; superfamily analysis; expressed sequence tags; data mining;
D O I
10.1208/ps050101
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Identification and functional characterization of the genes in the human genome remain a major challenge. A principal source of publicly available information used for this purpose is the National Center for Biotechnology Information database of expressed sequence tags (dbEST), which contains over 4 million human ESTs. To extract the information buried in this data more effectively, we have developed a semiautomated method to mine dbEST for uncharacterized human genes. Starting with a single protein input sequence, a family of related proteins from all species is compiled. This entire family is then used to mine the human EST database for new gene candidates. Evaluation of putative new gene candidates in the context of a family of characterized proteins provides a framework for inference of the structure and function of the new genes. When applied to a test data set of 28 families within the major facilitator superfamily (MFS) of membrane transporters, our protocol found 73 previously characterized human MFS genes and 43 new MFS gene candidates. Development of this approach provided insights into the problems and pitfalls of automated data mining using public databases.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A semiautomated approach to gene discovery through expressed sequence tag data mining: Discovery of new human transporter genes
    Shoshana Brown
    Jean l. Chang
    Wolfgang Sadee
    Patricia C. Babbitt
    AAPS PharmSci, 5
  • [2] Gene discovery through expressed sequence tag sequencing in Trypanosoma cruzi
    Verdun, RE
    Di Paolo, N
    Urmenyi, TP
    Rondinelli, E
    Frasch, ACC
    Sanchez, DO
    INFECTION AND IMMUNITY, 1998, 66 (11) : 5393 - 5398
  • [3] Gene discovery in Daphnia by expressed sequence tag sequencing
    Colbourne, JK
    Lindquist, E
    Bauer, D
    Brokstein, P
    Thomas, WK
    Andrews, J
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2005, 45 (06) : 979 - 979
  • [4] Update of the gene discovery program in Schistosoma mansoni with the expressed sequence tag approach
    Rabelo, EML
    Franco, GR
    Azevedo, VAC
    Pena, HB
    Santos, TM
    Meira, WSF
    Rodrigues, NA
    Ortega, JM
    Pena, SDJ
    MEMORIAS DO INSTITUTO OSWALDO CRUZ, 1997, 92 (05): : 625 - 629
  • [5] Discovery of three genes specifically expressed in human prostate by expressed sequence tag database analysis
    Vasmatzis, G
    Essand, M
    Brinkmann, U
    Lee, B
    Pastan, I
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (01) : 300 - 304
  • [6] An expressed sequence tag (EST) data mining strategy succeeding in the discovery of new G-protein coupled receptors
    Wittenberger, T
    Schaller, HC
    Hellebrand, S
    JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (03) : 799 - 813
  • [7] AU-rich transient response transcripts in the human genome: expressed sequence tag clustering and gene discovery approach
    Khabar, KSA
    Bakheet, T
    Williams, BRG
    GENOMICS, 2005, 85 (02) : 165 - 175
  • [8] Immune gene discovery by expressed sequence tag analysis of spleen in the duck (Anas platyrhynchos)
    Xia, Jianguo
    Radford, Cynthia
    Guo, Xiaoxin
    Magor, Katharine E.
    DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY, 2007, 31 (03): : 272 - 285
  • [9] Novel function discovery through sequence and structural data mining
    Lobb, Briallen
    Doxey, Andrew C.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2016, 38 : 53 - 61
  • [10] Discovery of Phytophthora infestans Genes Expressed in Planta through Mining of cDNA Libraries
    Sierra, Roberto
    Rodriguez-R, Luis M.
    Chaves, Diego
    Pinzon, Andres
    Grajales, Alejandro
    Rojas, Alejandro
    Mutis, Gabriel
    Cardenas, Martha
    Burbano, Daniel
    Jimenez, Pedro
    Bernal, Adriana
    Restrepo, Silvia
    PLOS ONE, 2010, 5 (03):