Fast and Efficient Estimation of Individual Ancestry Coefficients

被引:536
|
作者
Frichot, Eric [1 ]
Mathieu, Francois [1 ]
Trouillon, Theo [1 ,2 ]
Bouchard, Guillaume [2 ]
Francois, Olivier [1 ]
机构
[1] Univ Grenoble 1, Ctr Natl Rech Sci, Tech Ingn Med & Complexite Informat Math & Applic, Unite Mixte Rech 5525, F-38042 Grenoble, France
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
关键词
inference of population structure; ancestry coefficients; nonnegative matrix factorization algorithms; NONNEGATIVE MATRIX FACTORIZATION; POPULATION-STRUCTURE; LEAST-SQUARES; COMPONENTS; ADMIXTURE; ASSOCIATION; ALGORITHM; NUMBER;
D O I
10.1534/genetics.113.160572
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Inference of individual ancestry coefficients, which is important for population genetic and association studies, is commonly performed using computer-intensive likelihood algorithms. With the availability of large population genomic data sets, fast versions of likelihood algorithms have attracted considerable attention. Reducing the computational burden of estimation algorithms remains, however, a major challenge. Here, we present a fast and efficient method for estimating individual ancestry coefficients based on sparse nonnegative matrix factorization algorithms. We implemented our method in the computer program sNMF and applied it to human and plant data sets. The performances of sNMF were then compared to the likelihood algorithm implemented in the computer program ADMIXTURE. Without loss of accuracy, sNMF computed estimates of ancestry coefficients with runtimes similar to 10-30 times shorter than those of ADMIXTURE.
引用
收藏
页码:973 / +
页数:19
相关论文
共 50 条
  • [1] Fast and Efficient Sinusoidal Frequency Estimation by Using the DFT Coefficients
    Serbes, Ahmet
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (03) : 2333 - 2342
  • [2] Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
    David H Alexander
    Kenneth Lange
    [J]. BMC Bioinformatics, 12
  • [3] Genetic admixture and estimation of individual ancestry.
    Hua, T.
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (11) : S83 - S83
  • [4] Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
    Alexander, David H.
    Lange, Kenneth
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [5] Fast and Effective Clustering Method for Ancestry Estimation
    Budiarto, Arif
    Mahesworo, Bharuno
    Baurley, James
    Suparyanto, Teddy
    Pardamean, Bens
    [J]. 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 306 - 312
  • [6] Fast model-based estimation of ancestry in unrelated individuals
    Alexander, David H.
    Novembre, John
    Lange, Kenneth
    [J]. GENOME RESEARCH, 2009, 19 (09) : 1655 - 1664
  • [7] Efficient Methods for the Estimation of Homogenized Coefficients
    Mourrat, J-C
    [J]. FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2019, 19 (02) : 435 - 483
  • [8] Efficient Methods for the Estimation of Homogenized Coefficients
    J.-C. Mourrat
    [J]. Foundations of Computational Mathematics, 2019, 19 : 435 - 483
  • [9] Fast and Efficient Estimation of Frequencies
    Serbes, Ahmet
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (06) : 4054 - 4066
  • [10] Efficient estimation for the Cox model with varying coefficients
    Chen, Kani
    Lin, Huazhen
    Zhou, Yong
    [J]. BIOMETRIKA, 2012, 99 (02) : 379 - 392