Improving read alignment through the generation of alternative reference via iterative strategy

被引:0
|
作者
Lina Bu
Qi Wang
Wenjin Gu
Ruifei Yang
Di Zhu
Zhuo Song
Xiaojun Liu
Yiqiang Zhao
机构
[1] China Agricultural University,State Key Laboratory of Agrobiotechnology, College of Biological Sciences
[2] Genetalks Biotech. Co.,College of Animal Science and Veterinary Medicine
[3] Ltd,undefined
[4] Henan Agricultural University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
There is generally one standard reference sequence for each species. When extensive variations exist in other breeds of the species, it can lead to ambiguous alignment and inaccurate variant calling and, in turn, compromise the accuracy of downstream analysis. Here, with the help of the FPGA hardware platform, we present a method that generates an alternative reference via an iterative strategy to improve the read alignment for breeds that are genetically distant to the reference breed. Compared to the published reference genomes, by using the alternative reference sequences we built, the mapping rates of Chinese indigenous pigs and chickens were improved by 0.61–1.68% and 0.09–0.45%, respectively. These sequences also enable researchers to recover highly variable regions that could be missed using public reference sequences. We also determined that the optimal number of iterations needed to generate alternative reference sequences were seven and five for pigs and chickens, respectively. Our results show that, for genetically distant breeds, generating an alternative reference sequence can facilitate read alignment and variant calling and improve the accuracy of downstream analyses.
引用
收藏
相关论文
共 15 条
  • [1] Improving read alignment through the generation of alternative reference via iterative strategy
    Bu, Lina
    Wang, Qi
    Gu, Wenjin
    Yang, Ruifei
    Zhu, Di
    Song, Zhuo
    Liu, Xiaojun
    Zhao, Yiqiang
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [2] Improving the Accuracy of Industrial Robots via Iterative Reference Trajectory Modification
    Biagiotti, Luigi
    Moriello, Lorenzo
    Melchiorri, Claudio
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2020, 28 (03) : 831 - 843
  • [3] Improving Reference-driven Undersampled MRI Reconstruction via Iterative Data Correction
    Wang, Guisong
    Du, Xiaofeng
    He, Yifan
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [4] Enhancing strategy for Industry 4.0 implementation through maturity models and standard reference architectures alignment
    Angreani, Linda Salma
    Vijaya, Annas
    Wicaksono, Hendro
    JOURNAL OF MANUFACTURING TECHNOLOGY MANAGEMENT, 2024, 35 (04) : 848 - 873
  • [5] Cryptic peroxisomal targeting via alternative splicing and stop codon read-through in fungi
    Johannes Freitag
    Julia Ast
    Michael Bölker
    Nature, 2012, 485 : 522 - 525
  • [6] Cryptic peroxisomal targeting via alternative splicing and stop codon read-through in fungi
    Freitag, Johannes
    Ast, Julia
    Boelker, Michael
    NATURE, 2012, 485 (7399) : 522 - U135
  • [7] Lightweight Read Reference Voltage Calibration Strategy for Improving 3-D TLC NAND Flash Memory Reliability
    Feng, Hua
    Wei, Debao
    Wang, Yongchao
    Song, Yu
    Piao, Zhelong
    Qiao, Liyan
    IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2023, 23 (03) : 370 - 379
  • [8] Fault-Ride Through Strategy for Islanded Microgrids Via Dynamically Reconfigurable Voltage Reference
    Shen, Xia
    Huang, Wen
    Shen, Chao
    Shen, Yang
    Shuai, Zhikang
    2022 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2022,
  • [9] Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA
    Nils Homer
    Stanley F Nelson
    Genome Biology, 11
  • [10] Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA
    Homer, Nils
    Nelson, Stanley F.
    GENOME BIOLOGY, 2010, 11 (10):