COSINE: non-seeding method for mapping long noisy sequences

被引:3
|
作者
Afshar, Pegah Tootoonchi [1 ]
Wong, Wing Hung [2 ,3 ]
机构
[1] Stanford Univ, Sch Engn, Dept Elect Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Biomed Data Sci, Stanford, CA 94305 USA
基金
美国国家卫生研究院;
关键词
FAST FOURIER-TRANSFORM; GENERATION;
D O I
10.1093/nar/gkx511
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Third generation sequencing (TGS) are highly promising technologies but the long and noisy reads from TGS are difficult to align using existing algorithms. Here, we present COSINE, a conceptually new method designed specifically for aligning long reads contaminated by a high level of errors. COSINE computes the context similarity of two stretches of nucleobases given the similarity over distributions of their short k-mers (k = 3-4) along the sequences. The results on simulated and real data show that COSINE achieves high sensitivity and specificity under a wide range of read accuracies. When the error rate is high, COSINE can offer substantial advantages over existing alignment methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Long-read mapping to repetitive reference sequences using Winnowmap2
    Chirag Jain
    Arang Rhie
    Nancy F. Hansen
    Sergey Koren
    Adam M. Phillippy
    Nature Methods, 2022, 19 : 705 - 710
  • [42] Improved Method of Detecting Bowel Sounds For Automatic Long Analysis Under Noisy Environments
    Yamada, Yoshiyuki
    Sakata, Osamu
    TENTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2019, 2019, 11071
  • [43] A Method for Localizing Non-Reference Sequences to the Human Genome
    Chrisman, Brianna Sierra
    Paskov, Kelley M.
    He, Chloe
    Jung, Jae-Yoon
    Stockham, Nate
    Washington, Peter Yigitcan
    Wall, Dennis Paul
    BIOCOMPUTING 2022, PSB 2022, 2022, : 313 - 324
  • [44] Strain mapping by non destructive method : Laue microdiffraction
    Geandier, G.
    Malard, B.
    Goudeau, Ph.
    Tamura, N.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2007, 63 : S70 - S71
  • [45] A non-destructive method for mapping formation damage
    Khan, MA
    Jilani, SZ
    Menouar, H
    Al-Majed, AA
    ULTRASONICS, 2001, 39 (05) : 321 - 328
  • [46] Novel method for mapping non-detection zone
    Liu, Furong
    Kang, Yong
    Duan, Shanxu
    Wang, Hui
    Wang, Zhifeng
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2007, 22 (10): : 167 - 172
  • [47] Long Non-coding RNA in Plants in the Era of Reference Sequences
    Budak, Hikmet
    Kaya, Sezgi Biyiklioglu
    Cagirici, Halise Busra
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [48] kngMap: Sensitive and Fast Mapping Algorithm for Noisy Long Reads Based on the K-Mer Neighborhood Graph
    Wei, Ze-Gang
    Fan, Xing-Guo
    Zhang, Hao
    Zhang, Xiao-Dan
    Liu, Fei
    Qian, Yu
    Zhang, Shao-Wu
    FRONTIERS IN GENETICS, 2022, 13
  • [49] Sensitivity of Uniformly Convergent Mapping Sequences in Non-Autonomous Discrete Dynamical Systems
    Jiang, Yongxi
    Yang, Xiaofang
    Lu, Tianxiu
    FRACTAL AND FRACTIONAL, 2022, 6 (06)
  • [50] Tabu search method for determining sequences of amino acids in long polypeptides
    Blazewicz, J
    Borowski, M
    Formanowicz, P
    Stobiecki, M
    APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2005, 3449 : 22 - 32