Bayesian and likelihood inference for 2x2 ecological tables: An incomplete-data approach

被引:14
|
作者
Imai, Kosuke [1 ]
Lu, Ying [2 ]
Strauss, Aaron [1 ]
机构
[1] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
[2] Univ Colorado, Dept Sociol, Boulder, CO 80309 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/pan/mpm017
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
Ecological inference is a statistical problem where aggregate- level data are used to make inferences about individual- level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 x 2 ecological tables by applying the general statistical framework of incomplete data. We first show that the ecological inference problem can be decomposed into three factors: distributional effects, which address the possible misspecification of parametric modeling assumptions about the unknown distribution of missing data; contextual effects, which represent the possible correlation between missing data and observed variables; and aggregation effects, which are directly related to the loss of information caused by data aggregation. We then examine how these three factors affect inference and offer new statistical methods to address each of them. To deal with distributional effects, we propose a nonparametric Bayesian model based on a Dirichlet process prior, which relaxes common parametric assumptions. We also identify the statistical adjustments necessary to account for contextual effects. Finally, although little can be done to cope with aggregation effects, we offer a method to quantify the magnitude of such effects in order to formally assess its severity. We use simulated and real data sets to empirically investigate the consequences of these three factors and to evaluate the performance of our proposed methods. C code, along with an easy-to-use R interface, is publicly available for implementing our proposed methods ( Imai, Lu, and Strauss, forthcoming).
引用
收藏
页码:41 / 69
页数:29
相关论文
共 50 条
  • [1] Ecological inference for 2x2 tables
    Wakefield, J
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2004, 167 : 385 - 426
  • [2] Ecological inference for 2x2 tables - Discussion
    Best, N
    Chambers, R
    Jackson, C
    Richardson, S
    Atkinson, AC
    Firth, D
    Jiang, WX
    Tanner, MA
    Fienberg, SE
    Robert, CP
    Davison, AC
    Semadeni, C
    Besag, J
    Corder, JK
    Wolbrecht, C
    Richardson, T
    Salway, R
    Sheppard, L
    Thomson, SR
    Waller, LA
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2004, 167 : 426 - 445
  • [3] Simple methods for ecological inference in 2x2 tables
    Chambers, RL
    Steel, DG
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2001, 164 : 175 - 192
  • [4] METAANALYSIS FOR 2X2 TABLES - A BAYESIAN-APPROACH
    CARLIN, JB
    [J]. STATISTICS IN MEDICINE, 1992, 11 (02) : 141 - 158
  • [5] Inference with overlapping 2x2 tables
    Smith, WB
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2000, 29 (01) : 237 - 241
  • [6] BAYESIAN-INFERENCE IN 2X2 AND 2X2X2 CONTINGENCY-TABLES
    LATORRE, G
    [J]. BIOMETRICS, 1985, 41 (01) : 322 - 322
  • [7] THE NEW APPROACH IN THE ANALYSIS OF 2X2 TABLES
    SHVYRKOV, VV
    [J]. QUALITY & QUANTITY, 1995, 29 (02) : 113 - 124
  • [8] Applied Bayesian modeling and causal inference from incomplete-data perspectives.
    Raessler, Susanne
    [J]. BIOMETRICS, 2006, 62 (03) : 948 - 948
  • [9] 2X2 TABLES
    BARNARD, GA
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1982, 31 (03) : 304 - 305
  • [10] 2X2 TABLES
    BROWN, GW
    [J]. AMERICAN JOURNAL OF DISEASES OF CHILDREN, 1985, 139 (04): : 410 - 416