Detecting Fusion Genes in Long-Read Transcriptome Sequencing Data with FUGAREC

被引:2
|
作者
Masuda K. [1 ]
Sota Y. [2 ]
Matsuda H. [1 ]
机构
[1] Graduate School of Information Science and Technology, Osaka University, Osaka, Suita
[2] Graduate School of Medicine, Osaka University, Osaka, Suita
基金
日本学术振兴会;
关键词
fusion gene; long-read sequencing; RNA sequencing;
D O I
10.2197/ipsjtbio.17.1
中图分类号
学科分类号
摘要
Fusion genes are important targets and biomarkers for cancer therapy. Methods of accurately detecting fusion genes are needed in clinical practice. RNA-seq is widely used to detect active fusion genes. Long-read RNA-seq can sequence the full length of mRNA, and long-read RNA-seq is expected to detect fusion genes that cannot be detected by short-read RNA-seq. However, long-read RNA-seq has high basecalling error rates, and gap sequences may occur near the breakpoints of long reads that are not aligned to the genome. When gap sequences occur, it is impossible to identify the correct fusion gene or breakpoint using existing methods. To address these challenges in fusion gene detection, we introduce a novel algorithm, FUGAREC (fusion detection with gap re-alignment and breakpoint clustering). FUGAREC uniquely combines gap sequence re-alignment with breakpoint clustering. This approach not only enhances the detection of previously undetectable fusion genes but also significantly reduces false positives. We demonstrate that FUGAREC has high fusion gene detection performance on both simulated data and sequenced data of a breast cancer cell line. © 2024 Information Processing Society of Japan.
引用
收藏
页码:1 / 9
页数:8
相关论文
共 50 条
  • [31] Transcriptome innovations in primates revealed by single-molecule long-read sequencing
    Ferrandez-Peral, Luis
    Zhan, Xiaoyu
    Alvarez-Estape, Marina
    Chiva, Cristina
    Esteller-Cucala, Paula
    Garcia-Perez, Raquel
    Julia, Eva
    Lizano, Esther
    Fornas, Oscar
    Sabido, Eduard
    Li, Qiye
    Marques-Bonet, Tomas
    Juan, David
    Zhang, Guojie
    GENOME RESEARCH, 2022, 32 (08) : 1448 - 1462
  • [32] Long-read sequencing of Chrysanthemum morifolium transcriptome reveals flavonoid biosynthesis and regulation
    Tao Wang
    Feng Yang
    Qiaosheng Guo
    Qingjun Zou
    Wenyan Zhang
    Lin Zuo
    Plant Growth Regulation, 2020, 92 : 559 - 569
  • [33] A global survey of alternative splicing of HBV transcriptome using long-read sequencing
    Guan, Guiwen
    Zou, Jun
    Zhang, Ting
    Lu, Fengmin
    Chen, Xiangmei
    JOURNAL OF HEPATOLOGY, 2022, 76 (01) : 234 - 236
  • [34] Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing
    Wang, Bo
    Tseng, Elizabeth
    Regulski, Michael
    Clark, Tyson A.
    Hon, Ting
    Jiao, Yinping
    Lu, Zhenyuan
    Olson, Andrew
    Stein, Joshua C.
    Ware, Doreen
    NATURE COMMUNICATIONS, 2016, 7
  • [35] Long-read sequencing uncovers a complex transcriptome topology in varicella zoster virus
    István Prazsák
    Norbert Moldován
    Zsolt Balázs
    Dóra Tombácz
    Klára Megyeri
    Attila Szűcs
    Zsolt Csabai
    Zsolt Boldogkői
    BMC Genomics, 19
  • [36] Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing
    Bo Wang
    Elizabeth Tseng
    Michael Regulski
    Tyson A Clark
    Ting Hon
    Yinping Jiao
    Zhenyuan Lu
    Andrew Olson
    Joshua C. Stein
    Doreen Ware
    Nature Communications, 7
  • [37] Multiple Long-Read Sequencing Survey of Herpes Simplex Virus Dynamic Transcriptome
    Tombacz, Dora
    Moldovan, Norbert
    Balazs, Zsolt
    Gulyas, Gabor
    Csabai, Zsolt
    Boldogkoi, Miklos
    Snyder, Michael
    Boklogkoi, Zsolt
    FRONTIERS IN GENETICS, 2019, 10
  • [38] Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
    Zsolt Balázs
    Dóra Tombácz
    Attila Szűcs
    Michael Snyder
    Zsolt Boldogkői
    Scientific Data, 4
  • [39] Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
    Balazs, Zsolt
    Tombacz, Dora
    Szucs, Attila
    Snyder, Michael
    Boldogkoi, Zsolt
    SCIENTIFIC DATA, 2017, 4
  • [40] Lytic Transcriptome Dataset of Varicella Zoster Virus Generated by Long-Read Sequencing
    Tombacz, Dora
    Prazsak, Istvan
    Moldovan, Norbert
    Szucs, Attila
    Boldogkoi, Zsolt
    FRONTIERS IN GENETICS, 2018, 9