TrAnnoScope: A Modular Snakemake Pipeline for Full-Length Transcriptome Analysis and Functional Annotation

被引:0
|
作者
Pektas, Aysevil [1 ]
Panitz, Frank [1 ,2 ]
Thomsen, Bo [1 ]
机构
[1] Aarhus Univ, Dept Mol Biol & Genet, DK-8000 Aarhus, Denmark
[2] Nat Resources Inst Finland Luke, Appl Stat Methods, Turku 20520, Finland
关键词
RNA-Seq; reproducible pipeline; high-performance computing (HPC); transcriptome analysis; functional annotation; Iso-Seq; snakemake; long-read sequencing; PROTEIN; DATABASE; MODEL;
D O I
10.3390/genes15121547
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background/Objectives: Transcriptome assembly and functional annotation are essential in understanding gene expression and biological function. Nevertheless, many existing pipelines lack the flexibility to integrate both short- and long-read sequencing data or fail to provide a complete, customizable workflow for transcriptome analysis, particularly for non-model organisms. Methods: We present TrAnnoScope, a transcriptome analysis pipeline designed to process Illumina short-read and PacBio long-read data. The pipeline provides a complete, customizable workflow to generate high-quality, full-length (FL) transcripts with broad functional annotation. Its modular design allows users to adapt specific analysis steps for other sequencing platforms or data types. The pipeline encompasses steps from quality control to functional annotation, employing tools and established databases such as SwissProt, Pfam, Gene Ontology (GO), the Kyoto Encyclopedia of Genes and Genomes (KEGG), and Eukaryotic Orthologous Groups (KOG). As a case study, TrAnnoScope was applied to RNA-Seq and Iso-Seq data from zebra finch brain, ovary, and testis tissue. Results: The zebra finch transcriptome generated by TrAnnoScope from the brain, ovary, and testis tissue demonstrated strong alignment with the reference genome (99.63%), and it was found that 93.95% of the matched protein sequences in the zebra finch proteome were captured as nearly complete. Functional annotation provided matches to known protein databases and assigned relevant functional terms to the majority of the transcripts. Conclusions: TrAnnoScope successfully integrates short and long sequencing technologies to generate transcriptomes with minimal user input. Its modularity and ease of use make it a valuable tool for researchers analyzing complex datasets, particularly for non-model organisms.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Analysis of Chromosomal Numbers, Mitochondrial Genome, and Full-Length Transcriptome of Onychostoma brevibarba
    Fangzhou Hu
    Jingjing Fan
    Chang Wu
    Ming Zhu
    Yunfan Zhou
    Shi Wang
    Chun Zhang
    Min Tao
    Rurong Zhao
    Chenchen Tang
    Kaikun Luo
    Qinbo Qin
    Ming Ma
    Bo Chen
    Jinpu Wang
    Aiguo Zhou
    Liangxiong Bai
    Shaojun Liu
    Marine Biotechnology, 2019, 21 : 515 - 525
  • [42] Full-Length SMRT Transcriptome Sequencing and SSR Analysis of Bactrocera dorsalis (Hendel)
    Ouyang, Huili
    Wang, Xiaoyun
    Zheng, Xialin
    Lu, Wen
    Qin, Fengping
    Chen, Chao
    INSECTS, 2021, 12 (10)
  • [43] Full-Length Transcriptome Analysis of the Genes Involved in Tocopherol Biosynthesis in Torreya grandis
    Lou, Heqiang
    Ding, Mingzhu
    Wu, Jiasheng
    Zhang, Feicui
    Chen, Wenchao
    Yang, Yi
    Suo, Jinwei
    Yu, Weiwu
    Xu, Chuanmei
    Song, Lili
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2019, 67 (07) : 1877 - 1888
  • [44] Full-length transcriptome analysis of Zanthoxylum nitidum (Roxb.) DC.
    Zhu, Yanxia
    Huang, Yanfen
    Wei, Kunhua
    Yu, Junnan
    Jiang, Jianping
    PEERJ, 2023, 11
  • [45] Full-length transcriptome analysis provides insights into flavonoid biosynthesis in Ranunculus japonicus
    Xu, Jingyao
    Shan, Tingyu
    Zhang, Jingjing
    Zhong, Xinxin
    Tao, Yijia
    Wu, Jiawen
    PHYSIOLOGIA PLANTARUM, 2023, 175 (04)
  • [46] Analysis of Chromosomal Numbers, Mitochondrial Genome, and Full-Length Transcriptome of Onychostoma brevibarba
    Hu, Fangzhou
    Fan, Jingjing
    Wu, Chang
    Zhu, Ming
    Zhou, Yunfan
    Wang, Shi
    Zhang, Chun
    Tao, Min
    Zhao, Rurong
    Tang, Chenchen
    Luo, Kaikun
    Qin, Qinbo
    Ma, Ming
    Chen, Bo
    Wang, Jinpu
    Zhou, Aiguo
    Bai, Liangxiong
    Liu, Shaojun
    MARINE BIOTECHNOLOGY, 2019, 21 (04) : 515 - 525
  • [47] Full-Length Transcriptome Sequencing and Modular Organization Analysis of the Naringin/Neoeriocitrin-Related Gene Expression Pattern in Drynaria roosii
    Sun, Mei-Yu
    Li, Jing-Yi
    Li, Dong
    Huang, Feng-Jie
    Wang, Di
    Li, Hui
    Xing, Quan
    Zhu, Hui-Bin
    Shi, Lei
    PLANT AND CELL PHYSIOLOGY, 2018, 59 (07) : 1398 - 1414
  • [48] Full-Length Transcriptome Construction of the Blue Crab Callinectes sapidus
    Gao, Baoquan
    Lv, Jianjian
    Meng, Xianliang
    Li, Jitao
    Li, Yukun
    Liu, Ping
    Li, Jian
    FRONTIERS IN MARINE SCIENCE, 2022, 9
  • [49] SMRT sequencing of full-length transcriptome of seagrasses Zostera japonica
    Chen, Siting
    Qiu, Guanglong
    Yang, Mingliu
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [50] Revealing the full-length transcriptome of caucasian clover rhizome development
    Xiujie Yin
    Kun Yi
    Yihang Zhao
    Yao Hu
    Xu Li
    Taotao He
    Jiaxue Liu
    Guowen Cui
    BMC Plant Biology, 20