MLIP: using multiple processors to compute the posterior probability of linkage

被引:2
|
作者
Govil, Manika [1 ,2 ]
Segre, Alberto M. [3 ]
Vieland, Veronica J. [4 ,5 ]
机构
[1] Univ Pittsburgh, Sch Dent Med, Dept Oral Biol, Pittsburgh, PA 15260 USA
[2] Univ Pittsburgh, Sch Dent Med, Ctr Craniofacial & Dent Genet, Pittsburgh, PA USA
[3] Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USA
[4] Nationwide Childrens Hosp, Res Inst, Battelle Ctr Math Med, Columbus, OH USA
[5] Ohio State Univ, Columbus, OH 43210 USA
关键词
D O I
10.1186/1471-2105-9-S6-S2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Localization of complex traits by genetic linkage analysis may involve exploration of a vast multidimensional parameter space. The posterior probability of linkage (PPL), a class of statistics for complex trait genetic mapping in humans, is designed to model the trait model complexity represented by the multidimensional parameter space in a mathematically rigorous fashion. However, the method requires the evaluation of integrals with no functional form, making it difficult to compute, and thus further test, develop and apply. This paper describes MLIP, a multiprocessor two-point genetic linkage analysis system that supports statistical calculations, such as the PPL, based on the full parameter space implicit in the linkage likelihood. Results: The fundamental question we address here is whether the use of additional processors effectively reduces total computation time for a PPL calculation. We use a variety of data-both simulated and real-to explore the question " how close can we get?" to linear speedup. Empirical results of our study show that MLIP does significantly speed up two-point log-likelihood ratio calculations over a grid space of model parameters. Conclusion: Observed performance of the program is dependent on characteristics of the data including granularity of the parameter grid space being explored and pedigree size and structure. While work continues to further optimize performance, the current version of the program can already be used to efficiently compute the PPL. Thanks to MLIP, full multidimensional genome scans are now routinely being completed at our centers with runtimes on the order of days, not months or years.
引用
收藏
页数:12
相关论文
共 50 条