CEM: Coarsened Exact Matching Software

Authors:  Stefano Iacus, Gary King, Giuseppe Porro

This program is designed to improve the estimation of causal effects via an extremely powerful method of matching that is widely applicable and exceptionally easy to understand and use (if you understand how to draw a histogram, you will understand this method). The program implements the Coarsened Exact Matching (CEM) algorithm described in:

Stefano M. Iacus, Gary King, and Giuseppe Porro, "Causal Inference Without Balance Checking: Coarsened Exact Matching" and "Multivariate Matching Methods That are Monotonic Imbalance Bounding".

Matching is a nonparametric method of preprocessing data to control for some or all of the potentially confounding influence of pretreatment control variables by reducing imbalance between the treated and control groups. After preprocessing in this way, any method of analysis that would have been used without matching can be applied to estimate causal effects, although some methods will have even better properties. CEM is a Monotonoic Imbalance Bounding (MIB) matching method --- which means that the balance between the treated and control groups is chosen by the user ex ante rather than discovered through the usual laborious process of checking after the fact and repeatedly reestimating, and so that adjusting the imbalance on one variable has no effect on the maximum imbalance of any other. CEM also strictly bounds through ex ante user choice both the degree of model dependence and the average treatment effect estimation error, eliminates the need for a separate procedure to restrict data to common empirical support, meets the congruence principle, is robust to measurement error, works well with multiple imputation methods for missing data, can be completely automated, and is extremely fast computationally even with very large data sets. After preprocessing data with CEM, the analyst may then use a simple difference in means or whatever statistical model they would have applied without matching. CEM also works well for multicategory treatments, determining blocks in experimental designs, and evaluating extreme counterfactuals.

  • CEM Package for R:
  • MatchIt Package for R: Most of the features of CEM are also available through the MatchIt Package front end.

  • CEM for Stata (version 10 or later):

    • To install, type:
      net from http://gking.harvard.edu/cem/
      net install cem
    • You can also install from the SSC:
      ssc install cem
    • For documentation, type "help cem" or download PDF (or the published version in The Stata Journal: PDF).

  • CEM for SPSS: Website

  • Please send ALL questions, bugs, and requests to our mailing List (you must subscribe to send): [Un]Subscribe, or Browse/Search Archives

Recommended Releases

Version Package Date
1.1.14 Download (1.91 MB) Release info Oct 1 2014

Recent Releases

Version Package Date
1.1.8 Download (673.86 KB) Release info Mar 19 2014
1.1.7 Download (673.84 KB) Release info Mar 18 2014
1.1.6 Download (710.09 KB) Release info Mar 18 2014
1.1.5 Download (1.64 MB) Release info Mar 12 2014
1.1.4 Download (868.78 KB) Release info Jun 11 2012