BatAlign: an incremental method for accurate alignment of sequencing reads

被引:8
|
作者
Lim, Jing-Quan [1 ,2 ]
Tennakoon, Chandana [1 ,3 ,4 ,5 ]
Guan, Peiyong [1 ]
Sung, Wing-Kin [1 ,4 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117417, Singapore
[2] Natl Canc Ctr Singapore, Div Med Sci, Lab Canc Epigenome, Singapore 169610, Singapore
[3] NUS Grad Sch Integrat Sci & Engn, CeLS, Singapore 117456, Singapore
[4] Genome Inst Singapore, Dept Computat & Syst Biol, Singapore 138672, Singapore
[5] UAE Univ, Al Ain, U Arab Emirates
关键词
IDENTIFICATION; ALGORITHM; INSERTION; INDELS; FORMAT;
D O I
10.1093/nar/gkv533
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural variations (SVs) play a crucial role in genetic diversity. However, the alignments of reads near/across SVs are made inaccurate by the presence of polymorphisms. BatAlign is an algorithm that integrated two strategies called 'Reverse-Alignment' and 'Deep-Scan' to improve the accuracy of read-alignment. In our experiments, BatAlign was able to obtain the highest F-measures in read-alignments on mismatch-aberrant, indel-aberrant, concordantly/discordantly paired and SV-spanning data sets. On real data, the alignments of BatAlign were able to recover 4.3% more PCR-validated SVs with 73.3% less callings. These suggest BatAlign to be effective in detecting SVs and other polymorphic-variants accurately using high-throughput data.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] An Incremental Method for Accurate Iris Segmentation
    Li, Peihua
    Liu, Xiaomin
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1118 - 1121
  • [22] Method for the Identification of Plant DNA in Food Using Alignment-Free Analysis of Sequencing Reads: A Case Study on Lupin
    Raime, Kairi
    Krjutskov, Kaarel
    Remm, Maido
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [23] A Statistical Method for Alignment-Free Analysis of Sequencing Reads with Applications in Copy Number Determination and Plasmid Integration Detection
    Mols, Mart
    HUMAN HEREDITY, 2017, 83 (01) : 18 - 18
  • [24] INC-Seq: accurate single molecule reads using nanopore sequencing
    Li, Chenhao
    Chng, Kern Rei
    Boey, Esther Jia Hui
    Ng, Amanda Hui Qi
    Wilm, Andreas
    Nagarajan, Niranjan
    GIGASCIENCE, 2016, 5
  • [25] A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
    Kim, Jeongkyu
    Ji, Mingeun
    Yi, Gangman
    IEEE ACCESS, 2020, 8 : 189811 - 189822
  • [26] Alignment-Free Sequence Comparison Based on Next-Generation Sequencing Reads
    Song, Kai
    Ren, Jie
    Zhai, Zhiyuan
    Liu, Xuemei
    Deng, Minghua
    Sun, Fengzhu
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (02) : 64 - 79
  • [27] Fast alignment filtering of nanopore sequencing reads using locality-sensitive hashing
    Wang, Jeremy R.
    Jones, Corbin D.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 127 - 130
  • [28] GeneToCN: an alignment-free method for gene copy number estimation directly from next-generation sequencing reads
    Pajuste, Fanny-Dhelia
    Remm, Maido
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [29] GeneToCN: an alignment-free method for gene copy number estimation directly from next-generation sequencing reads
    Fanny-Dhelia Pajuste
    Maido Remm
    Scientific Reports, 13
  • [30] ResSeq: Enhancing Short-Read Sequencing Alignment By Rescuing Error-Containing Reads
    Feng, Weixing
    Sang, Peichao
    Lian, Deyuan
    Dong, Yansheng
    Song, Fengfei
    Li, Meng
    He, Bo
    Cao, Fenglin
    Liu, Yunlong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (04) : 795 - 798