SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data

被引:133
|
作者
Jia, Wenlong [1 ,2 ]
Qiu, Kunlong [1 ,2 ]
He, Minghui [1 ,2 ]
Song, Pengfei [2 ]
Zhou, Quan [1 ,2 ,3 ]
Zhou, Feng [2 ,4 ]
Yu, Yuan [2 ]
Zhu, Dandan [2 ]
Nickerson, Michael L. [5 ]
Wan, Shengqing [1 ,2 ]
Liao, Xiangke [6 ]
Zhu, Xiaoqian [6 ,7 ]
Peng, Shaoliang [6 ,7 ]
Li, Yingrui [1 ,2 ]
Wang, Jun [1 ,2 ,8 ,9 ]
Guo, Guangwu [1 ,2 ]
机构
[1] BGI Tech Solut Co Ltd, Shenzhen 518083, Peoples R China
[2] BGI Shenzhen, Shenzhen 518083, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[4] S China Univ Technol, Guangzhou Higher Educ Mega Ctr, Sch Biosci & Bioengn, Guangzhou 510006, Guangdong, Peoples R China
[5] NCI, Canc & Inflammat Program, NIH, Frederick, MD 21702 USA
[6] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
[7] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[8] Univ Copenhagen, Novo Nordisk Fdn Ctr Basic Metab Res, DK-1165 Copenhagen, Denmark
[9] Univ Copenhagen, Dept Biol, DK-1165 Copenhagen, Denmark
来源
GENOME BIOLOGY | 2013年 / 14卷 / 02期
基金
国家高技术研究发展计划(863计划);
关键词
GENE FUSIONS; BREAST-CANCER; IDENTIFICATION; ULTRAFAST; DISCOVERY; ALIGNMENT; TOOL;
D O I
10.1186/gb-2013-14-2-r12
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We have developed a new method, SOAPfuse, to identify fusion transcripts from paired-end RNA-Seq data. SOAPfuse applies an improved partial exhaustion algorithm to construct a library of fusion junction sequences, which can be used to efficiently identify fusion events, and employs a series of filters to nominate high-confidence fusion transcripts. Compared with other released tools, SOAPfuse achieves higher detection efficiency and consumed less computing resources. We applied SOAPfuse to RNA-Seq data from two bladder cancer cell lines, and confirmed 15 fusion transcripts, including several novel events common to both cell lines. SOAPfuse is available at http://soap.genomics.org.cn/soapfuse.html.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
    McPherson, Andrew
    Hormozdiari, Fereydoun
    Zayed, Abdalnasser
    Giuliany, Ryan
    Ha, Gavin
    Sun, Mark G. F.
    Griffith, Malachi
    Moussavi, Alireza Heravi
    Senz, Janine
    Melnyk, Nataliya
    Pacheco, Marina
    Marra, Marco A.
    Hirst, Martin
    Nielsen, Torsten O.
    Sahinalp, S. Cenk
    Huntsman, David
    Shah, Sohrab P.
    PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (05)
  • [32] QUANTIFYING ALTERNATIVE SPLICING FROM PAIRED-END RNA-SEQUENCING DATA
    Rossell, David
    Attolini, Camille Stephan-Otto
    Kroiss, Manuel
    Stoecker, Almond
    ANNALS OF APPLIED STATISTICS, 2014, 8 (01): : 309 - 330
  • [33] DETECTION OF BACTERIAL SMALL TRANSCRIPTS FROM RNA-SEQ DATA: A COMPARATIVE ASSESSMENT
    Pena-Castillo, Lourdes
    Grull, Marc
    Mulligan, Martin E.
    Lang, Andrew S.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016, 2016, : 456 - 467
  • [34] Identifying differential expression for RNA-seq data with no replication
    Gim, Jungsoo
    Park, Taesung
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [35] A Hybrid Clustering Algorithm for Identifying Cell Types from Single-Cell RNA-Seq Data
    Zhu, Xiaoshu
    Li, Hong-Dong
    Xu, Yunpei
    Guo, Lilu
    Wu, Fang-Xiang
    Duan, Guihua
    Wang, Jianxin
    GENES, 2019, 10 (02)
  • [36] An Efficient Algorithm for Sensitively Detecting Circular RNA from RNA-seq Data
    Zhang, Xuanping
    Wang, Yidan
    Zhao, Zhongmeng
    Wang, Jiayin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (10)
  • [37] Bellerophontes: an RNA-Seq data analysis framework for chimeric transcripts discovery based on accurate fusion model
    Abate, Francesco
    Acquaviva, Andrea
    Paciello, Giulia
    Foti, Carmelo
    Ficarra, Elisa
    Ferrarini, Alberto
    Delledonne, Massimo
    Iacobucci, Ilaria
    Soverini, Simona
    Martinelli, Giovanni
    Macii, Enrico
    BIOINFORMATICS, 2012, 28 (16) : 2114 - 2121
  • [38] Identification of fusion genes in breast cancer by paired-end RNA-sequencing
    Henrik Edgren
    Astrid Murumagi
    Sara Kangaspeska
    Daniel Nicorici
    Vesa Hongisto
    Kristine Kleivi
    Inga H Rye
    Sandra Nyberg
    Maija Wolf
    Anne-Lise Borresen-Dale
    Olli Kallioniemi
    Genome Biology, 12
  • [39] Identification of fusion genes in breast cancer by paired-end RNA-sequencing
    Edgren, Henrik
    Murumagi, Astrid
    Kangaspeska, Sara
    Nicorici, Daniel
    Hongisto, Vesa
    Kleivi, Kristine
    Rye, Inga H.
    Nyberg, Sandra
    Wolf, Maija
    Borresen-Dale, Anne-Lise
    Kallioniemi, Olli
    GENOME BIOLOGY, 2011, 12 (01):
  • [40] Transcriptator: Computational Pipeline to Annotate Transcripts and Assembled Reads from RNA-Seq Data
    Tripathi, Kumar Parijat
    Evangelista, Daniela
    Cassandra, Raffaele
    Guarracino, Mario R.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2014, 2015, 8623 : 156 - 169