Sequencing, Mapping, and Analysis of 27,455 Maize Full-Length cDNAs

被引:121
|
作者
Soderlund, Carol [1 ]
Descour, Anne [1 ]
Kudrna, Dave [2 ]
Bomhoff, Matthew [1 ]
Boyd, Lomax [1 ]
Currie, Jennifer [2 ]
Angelova, Angelina [2 ]
Collura, Kristi [2 ]
Wissotski, Marina [2 ]
Ashley, Elizabeth [2 ]
Morrow, Darren [3 ]
Fernandes, John [3 ]
Walbot, Virginia [3 ]
Yu, Yeisoo [2 ]
机构
[1] Univ Arizona, Inst BIO5, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Plant Sci, Arizona Genom Inst, Tucson, AZ 85721 USA
[3] Stanford Univ, Dept Biol, Stanford, CA 94305 USA
来源
PLOS GENETICS | 2009年 / 5卷 / 11期
基金
美国国家科学基金会;
关键词
GENOME; GENE; TRANSCRIPTION; EXPRESSION; RESOURCE; SORGHUM; ANNOTATION; DUPLICATE; DIVERSITY; CYTOSCAPE;
D O I
10.1371/journal.pgen.1000740
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Full-length cDNA (FLcDNA) sequencing establishes the precise primary structure of individual gene transcripts. From two libraries representing 27 B73 tissues and abiotic stress treatments, 27,455 high-quality FLcDNAs were sequenced. The average transcript length was 1.44 kb including 218 bases and 321 bases of 59 and 39 UTR, respectively, with 8.6% of the FLcDNAs encoding predicted proteins of fewer than 100 amino acids. Approximately 94% of the FLcDNAs were stringently mapped to the maize genome. Although nearly two-thirds of this genome is composed of transposable elements (TEs), only 5.6% of the FLcDNAs contained TE sequences in coding or UTR regions. Approximately 7.2% of the FLcDNAs are putative transcription factors, suggesting that rare transcripts are well-enriched in our FLcDNA set. Protein similarity searching identified 1,737 maize transcripts not present in rice, sorghum, Arabidopsis, or poplar annotated genes. A strict FLcDNA assembly generated 24,467 non-redundant sequences, of which 88% have non-maize protein matches. The FLcDNAs were also assembled with 41,759 FLcDNAs in GenBank from other projects, where semi-strict parameters were used to identify 13,368 potentially unique non-redundant sequences from this project. The libraries, ESTs, and FLcDNA sequences produced from this project are publicly available. The annotated EST and FLcDNA assemblies are available through the maize FLcDNA web resource (www.maizecdna.org).
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Resources for full-length cDNAs
    Kristiansen, TZ
    Pandey, A
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2002, 27 (05) : 266 - 267
  • [2] A database for chicken full-length cDNAs
    Wang, Y.
    Wang, G.
    Leung, F. C.
    [J]. POULTRY SCIENCE, 2006, 85 : 74 - 74
  • [3] Database for chicken full-length cDNAs
    Wang, Yong
    Wang, Zhenggang
    Li, Juan
    Wang, Yajun
    Leung, Frederick C. C.
    [J]. PHYSIOLOGICAL GENOMICS, 2007, 28 (02) : 141 - 145
  • [4] A collection of poplar full-length cDNAs
    Nanjo, T
    Sakurai, T
    Tokoki, Y
    Toyoda, A
    Nishiguchi, M
    Futamura, N
    Igasaki, T
    Kado, T
    Seki, M
    Sakaki, Y
    Shinozaki, K
    Shinohara, K
    [J]. PLANT AND CELL PHYSIOLOGY, 2006, 47 : S243 - S243
  • [5] Complete sequencing and characterization of 21,243 full-length human cDNAs
    Toshio Ota
    Yutaka Suzuki
    Tetsuo Nishikawa
    Tetsuji Otsuki
    Tomoyasu Sugiyama
    Ryotaro Irie
    Ai Wakamatsu
    Koji Hayashi
    Hiroyuki Sato
    Keiichi Nagai
    Kouichi Kimura
    Hiroshi Makita
    Mitsuo Sekine
    Masaya Obayashi
    Tatsunari Nishi
    Toshikazu Shibahara
    Toshihiro Tanaka
    Shizuko Ishii
    Jun-ichi Yamamoto
    Kaoru Saito
    Yuri Kawai
    Yuko Isono
    Yoshitaka Nakamura
    Kenji Nagahari
    Katsuhiko Murakami
    Tomohiro Yasuda
    Takao Iwayanagi
    Masako Wagatsuma
    Akiko Shiratori
    Hiroaki Sudo
    Takehiko Hosoiri
    Yoshiko Kaku
    Hiroyo Kodaira
    Hiroshi Kondo
    Masanori Sugawara
    Makiko Takahashi
    Katsuhiro Kanda
    Takahide Yokoi
    Takako Furuya
    Emiko Kikkawa
    Yuhi Omura
    Kumi Abe
    Kumiko Kamihara
    Naoko Katsuta
    Kazuomi Sato
    Machiko Tanikawa
    Makoto Yamazaki
    Ken Ninomiya
    Tadashi Ishibashi
    Hiromichi Yamashita
    [J]. Nature Genetics, 2004, 36 : 40 - 45
  • [6] Complete sequencing and characterization of 21,243 full-length human cDNAs
    Ota, T
    Suzuki, Y
    Nishikawa, T
    Otsuki, T
    Sugiyama, T
    Irie, R
    Wakamatsu, A
    Hayashi, K
    Sato, H
    Nagai, K
    Kimura, K
    Makita, H
    Sekine, M
    Obayashi, M
    Nishi, T
    Shibahara, T
    Tanaka, T
    Ishii, S
    Yamamoto, J
    Saito, K
    Kawai, Y
    Isono, Y
    Nakamura, Y
    Nagahari, K
    Murakami, K
    Yasuda, T
    Iwayanagi, T
    Wagatsuma, M
    Shiratori, A
    Sudo, H
    Hosoiri, T
    Kaku, Y
    Kodaira, H
    Kondo, H
    Sugawara, M
    Takahashi, M
    Kanda, K
    Yokoi, T
    Furuya, T
    Kikkawa, E
    Omura, Y
    Abe, K
    Kamihara, K
    Katsuta, N
    Sato, K
    Tanikawa, M
    Yamazaki, M
    Ninomiya, K
    Ishibashi, T
    Yamashita, H
    [J]. NATURE GENETICS, 2004, 36 (01) : 40 - 45
  • [7] Efficient Plant Gene Identification Based on lnterspecies Mapping of Full-Length cDNAs
    Amano, Naoki
    Tanaka, Tsuyoshi
    Numa, Hisataka
    Sakai, Hiroaki
    Itoh, Takeshi
    [J]. DNA RESEARCH, 2010, 17 (05) : 271 - 279
  • [8] Splice variation in mouse full-length cDNAs identified by mapping to the mouse genome
    Zavolan, M
    van Nimwegen, E
    Gaasterland, T
    [J]. GENOME RESEARCH, 2002, 12 (09) : 1377 - 1385
  • [9] CATALOGING HUMAN GENES BY SINGLE-PASS AND FULL-LENGTH SEQUENCING AND PHYSICAL AND GENETIC-MAPPING OF BRAIN CDNAS
    WILCOX, AS
    STEVENS, TJ
    BERRY, R
    RUBANO, T
    WALTER, N
    HOPKINS, JA
    GLOD, J
    ORPANA, AK
    SIKELA, JM
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 1994, : 209 - 209
  • [10] Purification, full-length sequencing and genomic origin mapping of eccDNA
    Wang, Yuangao
    Wang, Meng
    Zhang, Yi
    [J]. NATURE PROTOCOLS, 2023, 18 (03) : 683 - 699