QUANTILE REGRESSION DECOMPOSITION ANALYSIS OF DISPARITY RESEARCH USING COMPLEX SURVEY DATA: APPLICATION TO DISPARITIES IN BMI AND TELOMERE LENGTH BETWEEN US MINORITY AND WHITE POPULATION GROUPS

被引:0
|
作者
Hong, Hyokyoung G. [1 ]
Graubard, Barry I. [1 ]
Gastwirth, Joseph L. [2 ]
Kim, Mi- Ok [3 ]
机构
[1] NCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USA
[2] George Washington Univ, Dept Stat, Washington, DC 20052 USA
[3] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
来源
ANNALS OF APPLIED STATISTICS | 2024年 / 18卷 / 03期
关键词
Complex survey data; disparity decomposition; perturbation-based variance estima- tion; Peters-Belson; quantile regression; PETERS-BELSON METHOD; HEALTH DISPARITIES;
D O I
10.1214/23-AOAS1868
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We develop a quantile regression decomposition (QRD) method for analyzing observed disparities (OD) between population groups in socioeconomic and health-related outcomes for complex survey data. The conventional decomposition approaches use the conditional mean regression to decompose the disparity into two parts, the part explained by the difference arising from the different distributions in the explanatory covariates and the remaining part, which is unexplained by the covariates. Many socioeconomic and health outcomes exhibit heteroscedastic distributions, where the magnitude of observed disparities varies across different quantiles of these outcomes. Thus, differences in the explanatory covariates may account for varying differences in the OD across the quantiles of the outcome. The QRD can identify where there are greater differences in the outcome distribution, for example, 90th quantile, and how important the covariates are in explaining those differences. Much socioeconomic and health research relies on complex surveys, such as the National Health and Nutrition Examination Survey (NHANES), that oversample individuals from disadvantaged/minority population groups in order to provide improved precision. QRD has not been extended to the complex survey setting. We improve the QRD approach proposed in Machado and Mata (2005) to yield more reliable estimates at the quantiles, where the data are sparse, and extend it to the complex survey setting. We also propose a perturbation-based variance estimation method. Simulation studies indicate that the estimates of the unexplained portions of the OD across quantiles are unbiased and the coverage of the confidence intervals are close to nominal value. This methodology is used to study disparities in body mass index (BMI) and telomere length between race/ethnic groups estimated from the NHANES data.
引用
收藏
页码:2012 / 2033
页数:22
相关论文
empty
未找到相关数据