Comprehensive annotation of the Chinese tree shrew genome by large-scale RNA sequencing and long-read isoform sequencing

被引:26
|
作者
Ye, Mao-Sen [1 ,2 ]
Zhang, Jin-Yan [1 ,2 ]
Yu, Dan-Dan [1 ,3 ]
Xu, Min [1 ,3 ]
Xu, Ling [1 ,3 ]
Lv, Long-Bao [3 ]
Zhu, Qi-Yun [4 ]
Fan, Yu [1 ,3 ]
Yao, Yong-Gang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Key Lab Anim Models & Human Dis Mech, Kunming Inst Zool, KIZ CUHK Joint Lab Bioresources & Mol Res Common, Kunming 650204, Yunnan, Peoples R China
[2] Univ Chinese Acad Sci, Kunming Coll Life Sci, Kunming 650204, Yunnan, Peoples R China
[3] Chinese Acad Sci, Natl Resource Ctr Nonhuman Primates, Kunming Inst Zool, Natl Res Facil Phenotyp & Genet Anal Model Anim P, Kunming 650107, Yunnan, Peoples R China
[4] Chinese Acad Agr Sci, Lanzhou Vet Res Inst, State Key Lab Vet Etiol Biol, Lanzhou 730046, Gansu, Peoples R China
基金
中国国家自然科学基金;
关键词
Tree shrew; Genome annotation; Transcriptome; Gene family; Virus infection; TUPAIA-BELANGERI; ANIMAL-MODELS; INDUCED MYOPIA; GENE; PROTEIN; FAMILY; TRANSCRIPTOME; BIOGENESIS; GENERATION; PRIMATES;
D O I
10.24272/j.issn.2095-8137.2021.272
中图分类号
Q95 [动物学];
学科分类号
071002 ;
摘要
The Chinese tree shrew (Tupaia belangeri chinensis) is emerging as an important experimental animal in multiple fields of biomedical research. Comprehensive reference genome annotation for both mRNA and long non-coding RNA (lncRNA) is crucial for developing animal models using this species. In the current study, we collected a total of 234 high-quality RNA sequencing (RNA-seq) datasets and two long-read isoform sequencing (ISO-seq) datasets and improved the annotation of our previously assembled high-quality chromosome-level tree shrew genome. We obtained a total of 3 514 newly annotated coding genes and 50 576 lncRNA genes. We also characterized the tissue-specific expression patterns and alternative splicing patterns of mRNAs and lncRNAs and mapped the orthologous relationships among 11 mammalian species using the current annotated genome. We identified 144 tree shrew-specific gene families, including interleukin 6 (IL6) and STT3 oligosaccharyltransferase complex catalytic subunit B (STT3B), which underwent significant changes in size. Comparison of the overall expression patterns in tissues and pathways across four species (human, rhesus monkey, tree shrew, and mouse) indicated that tree shrews are more similar to primates than to mice at the tissue-transcriptome level. Notably, the newly annotated purine rich element binding protein A (PURA) gene and the STT3B gene family showed dysregulation upon viral infection. The updated version of the tree shrew genome annotation (KIZ version 3: TS_3.0) is available at http://www. treeshrewdb.org and provides an essential reference for basic and biomedical studies using tree shrew animal models.
引用
收藏
页码:692 / 709
页数:18
相关论文
共 50 条
  • [21] The Research of a Large-Scale Analysis Platform for MNS Blood Group Identification Based on Long-Read Sequencing
    Xu, Hua
    Su, Xiaomin
    Zuo, Qinqin
    Zhang, Liangzi
    Chu, Xiaoyue
    TRANSFUSION MEDICINE REVIEWS, 2024, 38 (04)
  • [22] High resolution annotation of zebrafish transcriptome using long-read sequencing
    Nudelman, German
    Frasca, Antonio
    Kent, Brandon
    Sadler, Kirsten C.
    Sealfon, Stuart C.
    Walsh, Martin J.
    Zaslavsky, Elena
    GENOME RESEARCH, 2018, 28 (09) : 1415 - 1425
  • [23] Unraveling metagenomics through long-read sequencing: a comprehensive review
    Kim, Chankyung
    Pongpanich, Monnat
    Porntaveetus, Thantrira
    JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [24] Unraveling metagenomics through long-read sequencing: a comprehensive review
    Chankyung Kim
    Monnat Pongpanich
    Thantrira Porntaveetus
    Journal of Translational Medicine, 22
  • [25] Comparison of long-read methods for sequencing and assembly of a plant genome
    Murigneux, Valentine
    Rai, Subash Kumar
    Furtado, Agnelo
    Bruxner, Timothy J. C.
    Tian, Wei
    Harliwong, Ivon
    Wei, Hanmin
    Yang, Bicheng
    Ye, Qianyu
    Anderson, Ellis
    Mao, Qing
    Drmanac, Radoje
    Wang, Ou
    Peters, Brock A.
    Xu, Mengyang
    Wu, Pei
    Topp, Bruce
    Coin, Lachlan J. M.
    Henry, Robert J.
    GIGASCIENCE, 2020, 9 (12):
  • [26] Long-read sequencing of the zebrafish genome reorganizes genomic architecture
    Chernyavskaya, Yelena
    Zhang, Xiaofei
    Liu, Jinze
    Blackburn, Jessica
    BMC GENOMICS, 2022, 23 (01)
  • [27] Expanding an expanded genome: long-read sequencing of Trypanosoma cruzi
    Berna, Luisa
    Rodriguez, Matias
    Laura Chiribao, Maria
    Parodi-Talice, Adriana
    Pita, Sebastian
    Rijo, Gaston
    Alvarez-Valin, Fernando
    Robello, Carlos
    MICROBIAL GENOMICS, 2018, 4 (05):
  • [28] Long-read genome sequencing for the molecular diagnosis of neurodevelopmental disorders
    Hiatt, Susan M.
    Lawlor, James M. J.
    Handley, Lori H.
    Ramaker, Ryne C.
    Rogers, Brianne B.
    Partridge, E. Christopher
    Boston, Lori Beth
    Williams, Melissa
    Plott, Christopher B.
    Jenkins, Jerry
    Gray, David E.
    Holt, James M.
    Bowling, Kevin M.
    Bebin, E. Martina
    Grimwood, Jane
    Schmutz, Jeremy
    Cooper, Gregory M.
    HUMAN GENETICS AND GENOMICS ADVANCES, 2021, 2 (02):
  • [29] Long-read sequencing of the zebrafish genome reorganizes genomic architecture
    Yelena Chernyavskaya
    Xiaofei Zhang
    Jinze Liu
    Jessica Blackburn
    BMC Genomics, 23
  • [30] Long-read sequencing to understand genome biology and cell function
    Kraft, Florian
    Kurth, Ingo
    INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 2020, 126