Evolinc:A Tool for the Identification and Evolutionary Comparison of Long Intergenic Non-coding RNAs

被引:23
|
作者
Nelson, Andrew D. L. [1 ]
Devisetty, Upendra K. [2 ]
Palos, Kyle [1 ]
Haug-Baltzell, Asher K. [3 ]
Lyons, Eric [2 ,3 ]
Beilstein, Mark A. [1 ]
机构
[1] Univ Arizona, Sch Plant Sci, Beilstein Lab, Tucson, AZ 85721 USA
[2] Univ Arizona, Bio5, CyVerse, Tucson, AZ USA
[3] Univ Arizona, Genet Grad Interdisciplinary Grp, Lyons Lab, Tucson, AZ USA
基金
美国国家科学基金会;
关键词
lincRNAs; comparative genomics; pipeline; evolution; molecular; comparative transcriptomics; TRANSPOSABLE ELEMENTS; GENOME; REVEALS; ARCHITECTURE; ANNOTATION; LINCRNAS; PLATFORM; LESSONS; MAMMALS;
D O I
10.3389/fgene.2017.00052
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Long intergenic non-coding RNAs (lincRNAs) are an abundant and functionally diverse class of eukaryotic transcripts. Reported lincRNA repertoires in mammals vary, but are commonly in the thousands to tens of thousands of transcripts, covering similar to 90% of the genome. In addition to elucidating function, there is particular interest in understanding the origin and evolution of lincRNAs. Aside from mammals, lincRNA populations have been sparsely sampled, precluding evolutionary analyses focused on their emergence and persistence. Here we present Evolinc, a two-module pipeline designed to facilitate lincRNA discovery and characterize aspects of lincRNA evolution. The first module (Evolinc-I) is a lincRNA identification workflow that also facilitates downstream differential expression analysis and genome browser visualization of identified lincRNAs. The second module (Evolinc-II) is a genomic and transcriptomic comparative analysis workflow that determines the phylogenetic depth to which a lincRNA locus is conserved within a user-defined group of related species. Here we validate lincRNA catalogs generated with Evolinc-I against previously annotated Arabidopsis and human lincRNA data. Evolinc-I recapitulated earlier findings and uncovered an additional 70 Arabidopsis and 43 human lincRNAs. We demonstrate the usefulness of Evolinc-II by examining the evolutionary histories of a public dataset of 5,361 Arabidopsis lincRNAs. We used Evolinc-II to winnow this dataset to 40 lincRNAs conserved across species in Brassicaceae. Finally, we show how Evolinc-II can be used to recover the evolutionary history of a known lincRNA, the human telomerase RNA (TERC). These latter analyses revealed unexpected duplication events as well as the loss and subsequent acquisition of a novel TERC locus in the lineage leading to mice and rats. The Evolinc pipeline is currently integrated in CyVerse's Discovery Environment and is free for use by researchers.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] Identification and functional annotation of long intergenic non-coding RNAs in Brassicaceae
    Palos, Kyle
    Dittrich, Anna C. Nelson
    Yu, Li'ang
    Brock, Jordan R.
    Railey, Caylyn E.
    Wu, Hsin-Yen Larry
    Sokolowska, Ewelina
    Skirycz, Aleksandra
    Hsu, Polly Yingshan
    Gregory, Brian D.
    Lyons, Eric
    Beilstein, Mark A.
    Nelson, Andrew D. L.
    PLANT CELL, 2022, 34 (09): : 3233 - 3260
  • [2] Systematic identification of long intergenic non-coding RNAs expressed in bovine oocytes
    Wang, Jian
    Koganti, Prasanthi P.
    Yao, Jianbo
    REPRODUCTIVE BIOLOGY AND ENDOCRINOLOGY, 2020, 18 (01)
  • [3] Systematic identification of long intergenic non-coding RNAs expressed in bovine oocytes
    Jian Wang
    Prasanthi P. Koganti
    Jianbo Yao
    Reproductive Biology and Endocrinology, 18
  • [4] Systematic Identification of Long Intergenic Non-coding RNAs expressed in Hepatocellular Carcinoma
    Zhang, Yuji
    Nair, Asha
    Roberts, Lewis R.
    Patel, Tushar
    HEPATOLOGY, 2014, 60 : 849A - 850A
  • [5] Identification and evolutionary analysis of long non-coding RNAs in zebra finch
    Chih-Kuan Chen
    Chun-Ping Yu
    Sung-Chou Li
    Siao-Man Wu
    Mei-Yeh Jade Lu
    Yi-Hua Chen
    Di-Rong Chen
    Chen Siang Ng
    Chau-Ti Ting
    Wen-Hsiung Li
    BMC Genomics, 18
  • [6] Identification and evolutionary analysis of long non-coding RNAs in zebra finch
    Chen, Chih-Kuan
    Yu, Chun-Ping
    Li, Sung-Chou
    Wu, Siao-Man
    Lu, Mei-Yeh Jade
    Chen, Yi-Hua
    Chen, Di-Rong
    Ng, Chen Siang
    Ting, Chau-Ti
    Li, Wen-Hsiung
    BMC GENOMICS, 2017, 18
  • [7] Identification and characterization of long intergenic non-coding RNAs related to mouse liver development
    Jie Lv
    Zhijun Huang
    Hui Liu
    Hongbo Liu
    Wei Cui
    Bao Li
    Hongjuan He
    Jing Guo
    Qi Liu
    Yan Zhang
    Qiong Wu
    Molecular Genetics and Genomics, 2014, 289 : 1225 - 1235
  • [9] Functions of long intergenic non-coding (linc) RNAs in plants
    Masashi Yamada
    Journal of Plant Research, 2017, 130 : 67 - 73
  • [10] Identification and characterization of long intergenic non-coding RNAs related to mouse liver development
    Lv, Jie
    Huang, Zhijun
    Liu, Hui
    Liu, Hongbo
    Cui, Wei
    Li, Bao
    He, Hongjuan
    Guo, Jing
    Liu, Qi
    Zhang, Yan
    Wu, Qiong
    MOLECULAR GENETICS AND GENOMICS, 2014, 289 (06) : 1225 - 1235