BatAlign: an incremental method for accurate alignment of sequencing reads

被引:8
|
作者
Lim, Jing-Quan [1 ,2 ]
Tennakoon, Chandana [1 ,3 ,4 ,5 ]
Guan, Peiyong [1 ]
Sung, Wing-Kin [1 ,4 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117417, Singapore
[2] Natl Canc Ctr Singapore, Div Med Sci, Lab Canc Epigenome, Singapore 169610, Singapore
[3] NUS Grad Sch Integrat Sci & Engn, CeLS, Singapore 117456, Singapore
[4] Genome Inst Singapore, Dept Computat & Syst Biol, Singapore 138672, Singapore
[5] UAE Univ, Al Ain, U Arab Emirates
关键词
IDENTIFICATION; ALGORITHM; INSERTION; INDELS; FORMAT;
D O I
10.1093/nar/gkv533
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural variations (SVs) play a crucial role in genetic diversity. However, the alignments of reads near/across SVs are made inaccurate by the presence of polymorphisms. BatAlign is an algorithm that integrated two strategies called 'Reverse-Alignment' and 'Deep-Scan' to improve the accuracy of read-alignment. In our experiments, BatAlign was able to obtain the highest F-measures in read-alignments on mismatch-aberrant, indel-aberrant, concordantly/discordantly paired and SV-spanning data sets. On real data, the alignments of BatAlign were able to recover 4.3% more PCR-validated SVs with 73.3% less callings. These suggest BatAlign to be effective in detecting SVs and other polymorphic-variants accurately using high-throughput data.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
    Liao, Yang
    Smyth, Gordon K.
    Shi, Wei
    NUCLEIC ACIDS RESEARCH, 2019, 47 (08)
  • [32] Accurate mapping of tRNA reads
    Hoffmann, Anne
    Fallmann, Joerg
    Vilardo, Elisa
    Moerl, Mario
    Stadler, Peter F.
    Amman, Fabian
    BIOINFORMATICS, 2018, 34 (07) : 1116 - 1124
  • [33] Identifying and quantifying isoforms from accurate full-length transcriptome sequencing reads with Mandalorion
    Roger Volden
    Kayla D. Schimke
    Ashley Byrne
    Danilo Dubocanin
    Matthew Adams
    Christopher Vollmers
    Genome Biology, 24
  • [34] Low-depth Raw Reads of Nanopore Sequencing Enables Rapid and Accurate Bacterial Identification
    Gao, W.
    Li, H.
    Hong, W.
    Liu, S.
    Li, J.
    Wu, Y.
    Liu, Y.
    Fan, X.
    Wang, H.
    Wang, M.
    Yang, B.
    Wang, T.
    APPLIED BIOCHEMISTRY AND MICROBIOLOGY, 2023, 59 (06) : 968 - 974
  • [35] Low-coverage genotyping-by-sequencing with accurate long HiFi reads and optimized imputation
    Eberle, Michael
    Busby, George
    Kintzle, Jen
    Di Domenico, Paolo
    Conception, Gregory
    Henno, Geoff
    Botta, Giordano
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 607 - 607
  • [36] Identifying and quantifying isoforms from accurate full-length transcriptome sequencing reads with Mandalorion
    Volden, Roger
    Schimke, Kayla D.
    Byrne, Ashley
    Dubocanin, Danilo
    Adams, Matthew
    Vollmers, Christopher
    GENOME BIOLOGY, 2023, 24 (01)
  • [37] cutPrimers: A New Tool for Accurate Cutting of Primers from Reads of Targeted Next Generation Sequencing
    Kechin, Andrey
    Boyarskikh, Uljana
    Kel, Alexander
    Filipenko, Maxim
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (11) : 1138 - 1143
  • [38] Low-depth Raw Reads of Nanopore Sequencing Enables Rapid and Accurate Bacterial Identification
    W. Gao
    H. Li
    W. Hong
    S. Liu
    J. Li
    Y. Wu
    Y. Liu
    X. Fan
    H. Wang
    M. Wang
    B. Yang
    T. Wang
    Applied Biochemistry and Microbiology, 2023, 59 : 968 - 974
  • [39] BrownieAligner: accurate alignment of Illumina sequencing data to de Bruijn graphs
    Mahdi Heydari
    Giles Miclotte
    Yves Van de Peer
    Jan Fostier
    BMC Bioinformatics, 19
  • [40] BrownieAligner: accurate alignment of Illumina sequencing data to de Bruijn graphs
    Heydari, Mahdi
    Miclotte, Giles
    Van de Peer, Yves
    Fostier, Jan
    BMC BIOINFORMATICS, 2018, 19