Ecological Regression With Partial Identification

Abstract
Ecological inference (EI) is the process of learning about individual behavior from aggregate data. We relax assumptions by allowing for ``linear contextual effects,’’ which previous works have regarded as plausible but avoided due to non-identification, a problem we sidestep by deriving bounds instead of point estimates. In this way, we offer a conceptual framework to improve on the Duncan-Davis bound, derived more than sixty-five years ago. To study the effectiveness of our approach, we collect and analyze 8,430 2x2 EI datasets with known ground truth from several sources — thus bringing considerably more data to bear on the problem than the existing dozen or so datasets available in the literature for evaluating EI estimators. For the 88% of real data sets in our collection that fit a proposed rule, our approach reduces the width of the Duncan-Davis bound, on average, by about 44%, while still capturing the true district level parameter about 99% of the time. The remaining 12% revert to the Duncan-Davis bound.
Easy-to-use software is available that implements all the methods described in the paper.
See Also
- [Dataset] Replication Data for: Ecological Regression with Partial Identification
- [Book] A Solution to the Ecological Inference Problem: Reconstructing Individual Behavior from Aggregate Data (1997)
- [Paper] Aggregation Among Binary, Count, and Duration Models: Estimating the Same Quantities from Different Levels of Data (2001)
- [Paper] Bayesian and Frequentist Inference for Ecological Inference: The RxC Case (2001)
- [Paper] Binomial-Beta Hierarchical Models for Ecological Inference (1999)
- [Paper] Did Illegal Overseas Absentee Ballots Decide the 2000 U.S. Presidential Election? (2004)
- [Book] Ecological Inference (2006)
- [Book] Ecological Inference: New Methodological Strategies (2004)