Detecting Fusion Genes in Long-Read Transcriptome Sequencing Data with FUGAREC

被引:2
|
作者
Masuda K. [1 ]
Sota Y. [2 ]
Matsuda H. [1 ]
机构
[1] Graduate School of Information Science and Technology, Osaka University, Osaka, Suita
[2] Graduate School of Medicine, Osaka University, Osaka, Suita
基金
日本学术振兴会;
关键词
fusion gene; long-read sequencing; RNA sequencing;
D O I
10.2197/ipsjtbio.17.1
中图分类号
学科分类号
摘要
Fusion genes are important targets and biomarkers for cancer therapy. Methods of accurately detecting fusion genes are needed in clinical practice. RNA-seq is widely used to detect active fusion genes. Long-read RNA-seq can sequence the full length of mRNA, and long-read RNA-seq is expected to detect fusion genes that cannot be detected by short-read RNA-seq. However, long-read RNA-seq has high basecalling error rates, and gap sequences may occur near the breakpoints of long reads that are not aligned to the genome. When gap sequences occur, it is impossible to identify the correct fusion gene or breakpoint using existing methods. To address these challenges in fusion gene detection, we introduce a novel algorithm, FUGAREC (fusion detection with gap re-alignment and breakpoint clustering). FUGAREC uniquely combines gap sequence re-alignment with breakpoint clustering. This approach not only enhances the detection of previously undetectable fusion genes but also significantly reduces false positives. We demonstrate that FUGAREC has high fusion gene detection performance on both simulated data and sequenced data of a breast cancer cell line. © 2024 Information Processing Society of Japan.
引用
收藏
页码:1 / 9
页数:8
相关论文
共 50 条
  • [41] Single-molecule long-read sequencing facilitates shrimp transcriptome research
    Zeng, Digang
    Chen, Xiuli
    Peng, Jinxia
    Yang, Chunling
    Peng, Min
    Zhu, Weilin
    Xie, Daxiang
    He, Pingping
    Wei, Pinyuan
    Lin, Yong
    Zhao, Yongzhen
    Chen, Xiaohan
    SCIENTIFIC REPORTS, 2018, 8
  • [42] NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy
    de Koning, Willem
    Miladi, Milad
    Hiltemann, Saskia
    Heikema, Astrid
    Hays, John P.
    Flemming, Stephan
    van den Beek, Marius
    Mustafa, Dana A.
    Backofen, Rolf
    Gruening, Bjoern
    Stubbs, Andrew P.
    GIGASCIENCE, 2020, 9 (10):
  • [43] Genomics in the long-read sequencing era
    van Dijk, Erwin L.
    Naquin, Delphine
    Gorrichon, Kevin
    Jaszczyszyn, Yan
    Ouazahrou, Rania
    Thermes, Claude
    Hernandez, Celine
    TRENDS IN GENETICS, 2023, 39 (09) : 649 - 671
  • [44] Long-read sequencing for brain tumors
    Shelton, William J.
    Zandpazandi, Sara
    Nix, J. Stephen
    Gokden, Murat
    Bauer, Michael
    Ryan, Katie Rose
    Wardell, Christopher P.
    Vaske, Olena Morozova
    Rodriguez, Analiz
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [45] Long-read sequencing in human genetics
    Kraft, Florian
    Kurth, Ingo
    MEDIZINISCHE GENETIK, 2019, 31 (02) : 198 - 204
  • [46] Method of the year: long-read sequencing
    Marx, Vivien
    NATURE METHODS, 2023, 20 (01) : 6 - 11
  • [47] Reimagining Long-Read DNA Sequencing
    不详
    CHEMICAL ENGINEERING PROGRESS, 2017, 113 (10) : 28 - 28
  • [48] Long-read sequencing in fungal identification
    Hoang, Minh Thuy Vi
    Irinyi, Laszlo
    Meyer, Wieland
    MICROBIOLOGY AUSTRALIA, 2022, 43 (01) : 14 - 18
  • [49] Transcriptomics in the era of long-read sequencing
    Monzo, Carolina
    Liu, Tianyuan
    Conesa, Ana
    NATURE REVIEWS GENETICS, 2025,
  • [50] Long-read sequencing goes clinical
    Neveling, K.
    Derks, R.
    Kwint, M.
    van de Vorst, M.
    Gardeitchik, T.
    Nelen, M.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 516 - 516