Analysis of nanopore data using hidden Markov models

被引:22
|
作者
Schreiber, Jacob [1 ]
Karplus, Kevin [1 ]
机构
[1] Univ Calif Santa Cruz, Dept Biomol Engn, Nanopore Grp, Santa Cruz, CA 95064 USA
关键词
DNA; MOLECULES;
D O I
10.1093/bioinformatics/btv046
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Nanopore-based sequencing techniques can reconstruct properties of biosequences by analyzing the sequence-dependent ionic current steps produced as biomolecules pass through a pore. Typically this involves alignment of new data to a reference, where both reference construction and alignment have been performed by hand. Results: We propose an automated method for aligning nanopore data to a reference through the use of hidden Markov models. Several features that arise from prior processing steps and from the class of enzyme used can be simply incorporated into the model. Previously, the M2MspA nanopore was shown to be sensitive enough to distinguish between cytosine, methylcytosine and hydroxymethylcytosine. We validated our automated methodology on a subset of that data by automatically calculating an error rate for the distinction between the three cytosine variants and show that the automated methodology produces a 2-3% error rate, lower than the 10% error rate from previous manual segmentation and alignment. .
引用
收藏
页码:1897 / 1903
页数:7
相关论文
共 50 条
  • [1] Automatic Analysis of Large-scale Nanopore Data Using Hidden Markov Models
    Zhang, Jianhua
    Liu, Xiuling
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 16759 - 16766
  • [2] Analysis of default data using hidden Markov models
    Giampieri, G
    Davis, M
    Crowder, M
    [J]. QUANTITATIVE FINANCE, 2005, 5 (01) : 27 - 34
  • [3] High-bandwidth nanopore data analysis by using a modified hidden Markov model
    Zhang, Jianhua
    Liu, Xiuling
    Ying, Yi-Lun
    Gu, Zhen
    Meng, Fu-Na
    Long, Yi-Tao
    [J]. NANOSCALE, 2017, 9 (10) : 3458 - 3465
  • [4] The analysis of hospital infection data using hidden Markov models
    Cooper, B
    Lipsitch, M
    [J]. BIOSTATISTICS, 2004, 5 (02) : 223 - 237
  • [5] Analysis of animal accelerometer data using hidden Markov models
    Leos-Barajas, Vianey
    Photopoulou, Theoni
    Langrock, Roland
    Patterson, Toby A.
    Watanabe, Yuuki Y.
    Murgatroyd, Megan
    Papastamatiou, Yannis P.
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2017, 8 (02): : 161 - 173
  • [6] Statistical analysis of uniparental disomy data using hidden Markov models
    Zhao, H
    Li, J
    Robinson, WP
    [J]. BIOMETRICS, 2001, 57 (04) : 1074 - 1079
  • [7] Hidden Markov models for the analysis and classification of ultrasound data
    Moldenhauer, J
    Beth, T
    Mende, U
    [J]. CARS 2004: COMPUTER ASSISTED RADIOLOGY AND SURGERY, PROCEEDINGS, 2004, 1268 : 231 - 236
  • [8] Tor traffic analysis using Hidden Markov Models
    Zhioua, Sami
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2013, 6 (09) : 1075 - 1086
  • [9] Analysis of swallowing sounds using hidden Markov models
    Mohammad Aboofazeli
    Zahra Moussavi
    [J]. Medical & Biological Engineering & Computing, 2008, 46 : 307 - 314
  • [10] Music analysis using hidden Markov mixture models
    Qi, Yuting
    Paisley, John William.
    Carin, Lawrence
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (11) : 5209 - 5224