A structural variation genotyping algorithm enhanced by CNV quantitative transfer

被引:0
|
作者
Zheng, Tian [1 ,2 ]
Qian, Xinyang [1 ,2 ]
Wang, Jiayin [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Dept Comp Sci & Technol, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Shaanxi Engn Res Ctr Med & Hlth Big Data, Inst Data Sci & Informat Qual, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
genotyping; copy number variations; transfer earning; COPY NUMBER;
D O I
10.1007/s11704-021-1177-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Genotyping of structural variations considering copy number variations (CNVs) is an infancy and challenging problem. CNVs, a prevalent form of critical genetic variations that cause abnormal copy numbers of large genomic regions in cells, often affect transcription and contribute to a variety of diseases. The characteristics of CNVs often lead to the ambiguity and confusion of existing genotyping features and algorithms, which may cause heterozygous variations to be erroneously genotyped as homozygous variations and seriously affect the accuracy of downstream analysis. As the allelic copy number increases, the error rate of genotyping increases sharply. Some instances with different copy numbers play an auxiliary role in the genotyping classification problem, but some will seriously interfere with the accuracy of the model. Motivated by these, we propose a transfer learning-based method to genotype structural variations accurately considering CNVs. The method first divides the instances with different allelic copy numbers and trains the basic machine learning framework with different genotype datasets. It maximizes the weights of the instances that contribute to classification and minimizes the weights of the instances that hinder correct genotyping. By adjusting the weights of the instances with different allelic copy numbers, the contribution of all the instances to genotyping can be maximized, and the genotyping errors of heterozygote variations caused by CNVs can be minimized. We applied the proposed method to both the simulated and real datasets, and compared it to some popular algorithms including GATK, Facets and Gindel. The experimental results demonstrate that the proposed method outperforms the others in terms of accuracy, stability and efficiency. The source codes have been uploaded at github/TrinaZ/CNVtransfer for academic use only.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] A structural variation genotyping algorithm enhanced by CNV quantitative transfer
    ZHENG Tian
    QIAN Xinyang
    WANG Jiayin
    Frontiers of Computer Science, 2022, 16 (06)
  • [2] A structural variation genotyping algorithm enhanced by CNV quantitative transfer
    Tian Zheng
    Xinyang Qian
    Jiayin Wang
    Frontiers of Computer Science, 2022, 16
  • [3] Integrated Genotyping of Structural Variation
    Fan, Xian
    Nakhleh, Luay
    Chen, Ken
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 47 - 48
  • [4] Improving the detection and genotyping of structural variation
    Kehr, B.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 16 - 16
  • [5] Genome structural variation discovery and genotyping
    Can Alkan
    Bradley P. Coe
    Evan E. Eichler
    Nature Reviews Genetics, 2011, 12 : 363 - 376
  • [6] Evaluation of computational genotyping of structural variation for clinical diagnoses
    Chander, Varuna
    Gibbs, Richard A.
    Sedlazeck, Fritz J.
    GIGASCIENCE, 2019, 8 (09):
  • [7] Technical considerations for genotyping multi-allelic copy number variation (CNV), in regions of segmental duplication
    Stuart Cantsilieris
    Patrick S Western
    Paul N Baird
    Stefan J White
    BMC Genomics, 15
  • [8] Technical considerations for genotyping multi-allelic copy number variation (CNV), in regions of segmental duplication
    Cantsilieris, Stuart
    Western, Patrick S.
    Baird, Paul N.
    White, Stefan J.
    BMC GENOMICS, 2014, 15
  • [9] DNA copy number and structural variation (CNV) contributions to adult and childhood obesity
    Phillips, Megan
    Babu, Jeganathan Ramesh
    Wang, Xu
    Geetha, Thangiah
    BIOCHEMICAL SOCIETY TRANSACTIONS, 2020, 48 (04) : 1819 - 1828
  • [10] High-throughput genotyping of intermediate-size structural variation
    Newman, TL
    Rieder, MJ
    Morrison, VA
    Sharp, AJ
    Smith, JD
    Sprague, LJ
    Kaul, R
    Carlson, CS
    Olson, MV
    Nickerson, DA
    Eichler, EE
    HUMAN MOLECULAR GENETICS, 2006, 15 (07) : 1159 - 1167