Log-linear models for mutations in the HIV genome

被引:1
|
作者
Ahn, C.
Koch, G. G.
Paynter, L.
Preisser, J. S.
Seillier-Moiseiwitsch, F.
机构
[1] Univ N Carolina, Sch Publ Hlth, Dept Biostat, Chapel Hill, NC 27599 USA
[2] Georgetown Univ, Dept Biostat Bioinformat & Biomath, Washington, DC 20057 USA
基金
美国国家卫生研究院;
关键词
HIV genome; consensus; correlated mutation; collapsibility; conditional association; marginal association; log-linear models;
D O I
10.1016/j.jspi.2007.03.007
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We discuss a general application of categorical data analysis to mutations along the HIV genome. We consider a multidimensional table for several positions at the same time. Due to the complexity of the multidimensional table, we may collapse it by pooling some categories. However. the association between the remaining variables may not be the same as before collapsing. We discuss the collapsibility of tables and the change in the meaning of parameters after collapsing categories. We also address this problem with a log-linear model. We present a parameterization with the consensus output as the reference cell as is appropriate to explain genomic mutations in HIV. We also consider live null hypotheses and some classical methods to address them. We illustrate methods for six positions along the HIV genome, through consideration of all triples of positions. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:3227 / 3239
页数:13
相关论文
共 50 条