QUANTILE REGRESSION DECOMPOSITION ANALYSIS OF DISPARITY RESEARCH USING COMPLEX SURVEY DATA: APPLICATION TO DISPARITIES IN BMI AND TELOMERE LENGTH BETWEEN US MINORITY AND WHITE POPULATION GROUPS
被引:0
|
作者:
Hong, Hyokyoung G.
论文数: 0引用数: 0
h-index: 0
机构:
NCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USANCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USA
Hong, Hyokyoung G.
[1
]
Graubard, Barry I.
论文数: 0引用数: 0
h-index: 0
机构:
NCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USANCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USA
Graubard, Barry I.
[1
]
Gastwirth, Joseph L.
论文数: 0引用数: 0
h-index: 0
机构:
George Washington Univ, Dept Stat, Washington, DC 20052 USANCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USA
Gastwirth, Joseph L.
[2
]
Kim, Mi- Ok
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USANCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USA
Kim, Mi- Ok
[3
]
机构:
[1] NCI, Biosta Branch, Div Canc Epidemiol & Genet, NIH, Bethesda, MD 20892 USA
[2] George Washington Univ, Dept Stat, Washington, DC 20052 USA
[3] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
We develop a quantile regression decomposition (QRD) method for analyzing observed disparities (OD) between population groups in socioeconomic and health-related outcomes for complex survey data. The conventional decomposition approaches use the conditional mean regression to decompose the disparity into two parts, the part explained by the difference arising from the different distributions in the explanatory covariates and the remaining part, which is unexplained by the covariates. Many socioeconomic and health outcomes exhibit heteroscedastic distributions, where the magnitude of observed disparities varies across different quantiles of these outcomes. Thus, differences in the explanatory covariates may account for varying differences in the OD across the quantiles of the outcome. The QRD can identify where there are greater differences in the outcome distribution, for example, 90th quantile, and how important the covariates are in explaining those differences. Much socioeconomic and health research relies on complex surveys, such as the National Health and Nutrition Examination Survey (NHANES), that oversample individuals from disadvantaged/minority population groups in order to provide improved precision. QRD has not been extended to the complex survey setting. We improve the QRD approach proposed in Machado and Mata (2005) to yield more reliable estimates at the quantiles, where the data are sparse, and extend it to the complex survey setting. We also propose a perturbation-based variance estimation method. Simulation studies indicate that the estimates of the unexplained portions of the OD across quantiles are unbiased and the coverage of the confidence intervals are close to nominal value. This methodology is used to study disparities in body mass index (BMI) and telomere length between race/ethnic groups estimated from the NHANES data.