Imputing Missing Race/Ethnicity in Pediatric Electronic Health Records: Reducing Bias with Use of US Census Location and Surname Data

被引:33
|
作者
Grundmeier, Robert W. [1 ]
Song, Lihai [1 ]
Ramos, Mark J. [1 ]
Fiks, Alexander G. [1 ]
Elliott, Marc N. [2 ]
Fremont, Allen [2 ]
Pace, Wilson [3 ]
Wasserman, Richard C. [4 ]
Localio, Russell [5 ]
机构
[1] Childrens Hosp Philadelphia, Philadelphia, PA 19104 USA
[2] RAND Corp, Santa Monica, CA USA
[3] Univ Colorado Denver, DART Net Inst, Aurora, CO USA
[4] Univ Vermont, Burlington, VT USA
[5] Univ Penn, Perelman Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
关键词
Multiple imputation; US Census location and surname data; race and ethnicity; health disparities; MULTIPLE IMPUTATION; DISPARITIES; CARE; RACE; REGRESSION; ETHNICITY; QUALITY;
D O I
10.1111/1475-6773.12295
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
ObjectiveTo assess the utility of imputing race/ethnicity using U.S. Census race/ethnicity, residential address, and surname information compared to standard missing data methods in a pediatric cohort. Data Sources/Study SettingElectronic health record data from 30 pediatric practices with known race/ethnicity. Study DesignIn a simulation experiment, we constructed dichotomous and continuous outcomes with pre-specified associations with known race/ethnicity. Bias was introduced by nonrandomly setting race/ethnicity to missing. We compared typical methods for handling missing race/ethnicity (multiple imputation alone with clinical factors, complete case analysis, indicator variables) to multiple imputation incorporating surname and address information. Principal FindingsImputation using U.S. Census information reduced bias for both continuous and dichotomous outcomes. ConclusionsThe new method reduces bias when race/ethnicity is partially, nonrandomly missing.
引用
收藏
页码:946 / 960
页数:15
相关论文
共 20 条
  • [1] Race and Ethnicity Data in Electronic Health Records-Striving for Clarity
    Yemane, Lahia
    Mateo, Camila M.
    Desai, Angel N.
    [J]. JAMA NETWORK OPEN, 2024, 7 (03)
  • [2] Use of name recognition software, census data and multiple imputation to predict missing data on ethnicity: application to cancer registry records
    Ronan Ryan
    Sally Vernon
    Gill Lawrence
    Sue Wilson
    [J]. BMC Medical Informatics and Decision Making, 12
  • [3] Use of name recognition software, census data and multiple imputation to predict missing data on ethnicity: application to cancer registry records
    Ryan, Ronan
    Vernon, Sally
    Lawrence, Gill
    Wilson, Sue
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [4] Health Equity Beyond Data Health Care Worker Perceptions of Race, Ethnicity, and Language Data Collection in Electronic Health Records
    Cruz, Taylor M.
    Smith, Sheridan A.
    [J]. MEDICAL CARE, 2021, 59 (05) : 379 - 385
  • [5] Adjusting for selection bias due to missing data in electronic health records-based research
    Peskoe, Sarah B.
    Arterburn, David
    Coleman, Karen J.
    Herrinton, Lisa J.
    Daniels, Michael J.
    Haneuse, Sebastien
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (10) : 2221 - 2238
  • [6] plasmode simulation to address confounding bias due to missing data in a large electronic health records dataset
    Puzhko, Svetlana
    Bartlett, Gillian
    Schuster, Tibor
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 388 - 388
  • [7] Addressing bias in preterm birth research: The role of advanced imputation techniques for missing race and ethnicity in perinatal health data
    Scroggins, Jihye Kim
    Hulchafo, Ismael Ibrahim
    Topaz, Maxim
    Cato, Kenrick
    Barcelona, Veronica
    [J]. ANNALS OF EPIDEMIOLOGY, 2024, 94 : 120 - 126
  • [8] Race and Ethnicity Data Quality and Imputation Using US Census Data in an Integrated Health System: The Kaiser Permanente Southern California Experience
    Derose, Stephen F.
    Contreras, Richard
    Coleman, Karen J.
    Koebnick, Corinna
    Jacobsen, Steven J.
    [J]. MEDICAL CARE RESEARCH AND REVIEW, 2013, 70 (03) : 330 - 345
  • [9] Investigating Bias from Missing Data in an Electronic Health Records-Based Study of Weight Loss After Bariatric Surgery
    Koffman, Lily
    Levis, Alexander W.
    Arterburn, David
    Coleman, Karen J.
    Herrinton, Lisa J.
    Cooper, Julie
    Ewing, John
    Fischer, Heidi
    Fraser, James R.
    Johnson, Eric
    Taylor, Brianna
    Theis, Mary Kay
    Liu, Liyan
    Courcoulas, Anita
    Li, Robert
    Fisher, David P.
    Amsden, Laura
    Haneuse, Sebastien
    [J]. OBESITY SURGERY, 2021, 31 (05) : 2125 - 2135
  • [10] Investigating Bias from Missing Data in an Electronic Health Records-Based Study of Weight Loss After Bariatric Surgery
    Lily Koffman
    Alexander W. Levis
    David Arterburn
    Karen J. Coleman
    Lisa J. Herrinton
    Julie Cooper
    John Ewing
    Heidi Fischer
    James R. Fraser
    Eric Johnson
    Brianna Taylor
    Mary Kay Theis
    Liyan Liu
    Anita Courcoulas
    Robert Li
    David P. Fisher
    Laura Amsden
    Sebastien Haneuse
    [J]. Obesity Surgery, 2021, 31 : 2125 - 2135