Ecological Regression With Partial Identification

Abstract
Ecological inference (EI) is the process of learning about individual behavior from aggregate data. We relax assumptions by allowing for “linear contextual effects,” which previous works have regarded as plausible but avoided due to non-identification, a problem we sidestep by deriving bounds instead of point estimates. In this way, we offer a conceptual framework to improve on the Duncan-Davis bound, derived more than sixty-five years ago. To study the effectiveness of our approach, we collect and analyze 8,430 2x2 EI datasets with known ground truth from several sources — thus bringing considerably more data to bear on the problem than the existing dozen or so datasets available in the literature for evaluating EI estimators. For the 88% of real data sets in our collection that fit a proposed rule, our approach reduces the width of the Duncan-Davis bound, on average, by about 44%, while still capturing the true district level parameter about 99% of the time. The remaining 12% revert to the Duncan-Davis bound.
Easy-to-use software is available that implements all the methods described in the paper.
See Also
- [Dataset] Replication Data for: Ecological Regression with Partial Identification
- [Book] A Solution to the Ecological Inference Problem: Reconstructing Individual Behavior from Aggregate Data (1997)
- [Paper] Aggregation Among Binary, Count, and Duration Models: Estimating the Same Quantities from Different Levels of Data (2001)
- [Paper] Bayesian and Frequentist Inference for Ecological Inference: The RxC Case (2001)
- [Paper] Binomial-Beta Hierarchical Models for Ecological Inference (1999)
- [Paper] Did Illegal Overseas Absentee Ballots Decide the 2000 U.S. Presidential Election? (2004)
- [Book] Ecological Inference (2006)
- [Book] Ecological Inference: New Methodological Strategies (2004)