VOLUME 53 | NUMBER 6 | DECEMBER 2018
Matching and Regression to the Mean in DifferenceinDifferences Analysis
Objective: To demonstrate regression to the mean bias introduced by matching on preperiod variables in differenceindifferences studies.
Data Sources: Simulated data.
Study Design: We performed a Monte Carlo simulation to estimate the effect of a placebo intervention on simulated longitudinal data for units in treatment and control groups using unmatched and matched differenceindifferences analyses. We varied the preperiod level and trend differences between the treatment and control groups, and the serial correlation of the matching variables. We assessed estimator bias as the mean absolute deviation of estimated program effects from the true value of zero.
Principal Findings: When preperiod outcome level is correlated with treatment assignment, an unmatched analysis is unbiased, but matching units on preperiod outcome levels produces biased estimates. The bias increases with greater preperiod level differences and weaker serial correlation in the outcome. This problem extends to matching on preperiod level of a timevarying covariate. When treatment assignment is correlated with preperiod trend only, the unmatched analysis is biased, and matching units on preperiod level or trend does not introduce additional bias.
Conclusions: Researchers should be aware of the threat of regression to the mean when constructing matched samples for differenceindifferences. We provide guidance on when to incorporate matching in this study design.
Copyright© 2018, Health Research & Educational Trust. All rights reserved. Content Disclaimer
Health Research & Educational Trust, 155 North Wacker, 4th Floor Chicago, IL 60606 (312) 422.2600