VOLUME 50 | NUMBER 4 | AUGUST 2015
Imputing Missing Race/Ethnicity in Pediatric Electronic Health Records: Reducing Bias with Use of U.S. Census Location and Surname Data
Keywords: Multiple imputation; U.S. Census location and surname data; race and ethnicity; health disparities
Objective: To assess the utility of imputing race/ethnicity using U.S. Census race/ethnicity, residential address, and surname information compared to standard missing data methods in a pediatric cohort.
Data Sources/Study Setting: Electronic health record data from 30 pediatric practices with known race/ethnicity.
Study Design: In a simulation experiment, we constructed dichotomous and continuous outcomes with pre-specified associations with known race/ethnicity. Bias was introduced by nonrandomly setting race/ethnicity to missing. We compared typical methods for handling missing race/ethnicity (multiple imputation alone with clinical factors, complete case analysis, indicator variables) to multiple imputation incorporating surname and address information.
Principal Findings: Imputation using U.S. Census information reduced bias for both continuous and dichotomous outcomes.
Conclusions: The new method reduces bias when race/ethnicity is partially, nonrandomly missing.
Copyright© 2017, Health Research & Educational Trust. All rights reserved. Content Disclaimer
Health Research & Educational Trust, 155 North Wacker, 4th Floor Chicago, IL 60606 (312) 422.2600