PAGANtec: OpenMP Parallel Error Correction for Next-Generation Sequencing Data

被引:3
|
作者
Joppich, Markus [1 ,2 ,3 ]
Schmidl, Dirk [1 ]
Bolger, Anthony M. [2 ]
Kuhlen, Torsten [1 ]
Usadel, Bjoern [2 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, JARA High Performance Comp, Aachen, Germany
[2] Rhein Westfal TH Aachen, Inst Bot & Mol Genet, Aachen, Germany
[3] Univ Munich, Inst Informat, D-80539 Munich, Germany
关键词
D O I
10.1007/978-3-319-24595-9_1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Next-generation sequencing techniques reduced the cost of sequencing a genome rapidly, but came with a relatively high error rate. Therefore, error correction of this data is a necessary task before assembly can take place. Since the input data is huge and error correction is compute intensive, parallelizing this work on a modern shared-memory system can help to keep the runtime feasible. In this work we present PAGANtec, a tool for error correction of next-generation sequencing data, based on the novel PAGAN graph structure. PAGANtec was parallelized with OpenMP and a performance analysis and tuning was done. The analysis led to the awareness, that OpenMP tasks are a more suitable paradigm for this work than traditional work-sharing.
引用
收藏
页码:3 / 17
页数:15
相关论文
共 50 条
  • [1] MapReduce for accurate error correction of next-generation sequencing data
    Zhao, Liang
    Chen, Qingfeng
    Li, Wencui
    Jiang, Peng
    Wong, Limsoon
    Li, Jinyan
    [J]. BIOINFORMATICS, 2017, 33 (23) : 3844 - 3851
  • [2] Effects of error-correction of heterozygous next-generation sequencing data
    Fujimoto, M. Stanley
    Bodily, Paul M.
    Okuda, Nozomu
    Clement, Mark J.
    Snell, Quinn
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [3] Effects of error-correction of heterozygous next-generation sequencing data
    M Stanley Fujimoto
    Paul M Bodily
    Nozomu Okuda
    Mark J Clement
    Quinn Snell
    [J]. BMC Bioinformatics, 15
  • [4] Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies
    Zagordi, Osvaldo
    Klein, Rolf
    Daeumer, Martin
    Beerenwinkel, Niko
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (21) : 7400 - 7409
  • [5] Benchmarking of computational error-correction methods for next-generation sequencing data
    Keith Mitchell
    Jaqueline J. Brito
    Igor Mandric
    Qiaozhen Wu
    Sergey Knyazev
    Sei Chang
    Lana S. Martin
    Aaron Karlsberg
    Ekaterina Gerasimov
    Russell Littman
    Brian L. Hill
    Nicholas C. Wu
    Harry Taegyun Yang
    Kevin Hsieh
    Linus Chen
    Eli Littman
    Taylor Shabani
    German Enik
    Douglas Yao
    Ren Sun
    Jan Schroeder
    Eleazar Eskin
    Alex Zelikovsky
    Pavel Skums
    Mihai Pop
    Serghei Mangul
    [J]. Genome Biology, 21
  • [6] Benchmarking of computational error-correction methods for next-generation sequencing data
    Mitchell, Keith
    Brito, Jaqueline J.
    Mandric, Igor
    Wu, Qiaozhen
    Knyazev, Sergey
    Chang, Sei
    Martin, Lana S.
    Karlsberg, Aaron
    Gerasimov, Ekaterina
    Littman, Russell
    Hill, Brian L.
    Wu, Nicholas C.
    Yang, Harry
    Hsieh, Kevin
    Chen, Linus
    Littman, Eli
    Shabani, Taylor
    Enik, German
    Yao, Douglas
    Sun, Ren
    Schroeder, Jan
    Eskin, Eleazar
    Zelikovsky, Alex
    Skums, Pavel
    Pop, Mihai
    Mangul, Serghei
    [J]. ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [7] Benchmarking of computational error-correction methods for next-generation sequencing data
    Mitchell, Keith
    Brito, Jaqueline J.
    Mandric, Igor
    Wu, Qiaozhen
    Knyazev, Sergey
    Chang, Sei
    Martin, Lana S.
    Karlsberg, Aaron
    Gerasimov, Ekaterina
    Littman, Russell
    Hill, Brian L.
    Wu, Nicholas C.
    Yang, Harry Taegyun
    Hsieh, Kevin
    Chen, Linus
    Littman, Eli
    Shabani, Taylor
    Enik, German
    Yao, Douglas
    Sun, Ren
    Schroeder, Jan
    Eskin, Eleazar
    Zelikovsky, Alex
    Skums, Pavel
    Pop, Mihai
    Mangul, Serghei
    [J]. GENOME BIOLOGY, 2020, 21 (01)
  • [8] Efficient error correction for next-generation sequencing of viral amplicons
    Skums, Pavel
    Dimitrova, Zoya
    Campo, David S.
    Vaughan, Gilberto
    Rossi, Livia
    Forbi, Joseph C.
    Yokosawa, Jonny
    Zelikovsky, Alex
    Khudyakov, Yury
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [9] Efficient error correction for next-generation sequencing of viral amplicons
    Pavel Skums
    Zoya Dimitrova
    David S Campo
    Gilberto Vaughan
    Livia Rossi
    Joseph C Forbi
    Jonny Yokosawa
    Alex Zelikovsky
    Yury Khudyakov
    [J]. BMC Bioinformatics, 13
  • [10] A systematic comparison of error correction enzymes by next-generation sequencing
    Lubock, Nathan B.
    Zhang, Di
    Sidore, Angus M.
    Church, George M.
    Kosuri, Sriram
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (15) : 9206 - 9217