Bayesian Semiparametric Density Deconvolution in the Presence of Conditionally Heteroscedastic Measurement Errors

被引:17
|
作者
Sarkar, Abhra [1 ]
Mallick, Bani K. [1 ]
Staudenmayer, John [2 ]
Pati, Debdeep [3 ]
Carroll, Raymond J. [1 ]
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
[2] Univ Massachusetts, Dept Math & Stat, Amherst, MA 01003 USA
[3] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
基金
美国国家科学基金会;
关键词
Measurement errors; Skew-normal distribution; Conditional heteroscedasticity; Dirichlet process mixture models; B-spline; Variance function; REGRESSION CALIBRATION; VARIABLE SELECTION; VARIANCE FUNCTIONS; OPTIMAL RATES; MODELS; DISTRIBUTIONS; CONVERGENCE; VALIDATION; DESIGNS;
D O I
10.1080/10618600.2014.899237
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of estimating the density of a random variable when precise measurements on the variable are not available, but replicated proxies contaminated with measurement error are available for sufficiently many subjects. Under the assumption of additive measurement errors this reduces to a problem of deconvolution of densities. Deconvolution methods often make restrictive and unrealistic assumptions about the density of interest and the distribution of measurement errors, for example, normality and homoscedasticity and thus independence from the variable of interest. This article relaxes these assumptions and introduces novel Bayesian semiparametric methodology based on Dirichlet process mixture models for robust deconvolution of densities in the presence of conditionally heteroscedastic measurement errors. In particular, the models can adapt to asymmetry, heavy tails, and multimodality. In simulation experiments, we show that our methods vastly outperform a recent Bayesian approach based on estimating the densities via mixtures of splines. We apply our methods to data from nutritional epidemiology. Even in the special case when the measurement errors are homoscedastic, our methodology is novel and dominates other methods that have been proposed previously. Additional simulation results, instructions on getting access to the dataset and R programs implementing our methods are included as part of online supplementary materials.
引用
收藏
页码:1101 / 1125
页数:25
相关论文
共 50 条