Sampling‐Based versus Design‐Based Uncertainty in Regression Analysis
Consider a researcher estimating the parameters of a regression function based on data for all 50 states in the United States or on data for all visits to a website. What is the interpretation of the estimated parameters and the standard errors? In practice, researchers typically assume that the sample is randomly drawn from a large population of interest and report standard errors that are designed to capture sampling variation. This is common even in applications where it is difficult to articulate what that population of interest is, and how it differs from the sample. In this article, we explore an alternative approach to inference, which is partly design‐based. In a design‐based setting, the values of some of the regressors can be manipulated, perhaps through a policy intervention. Design‐based uncertainty emanates from lack of knowledge about the values that the regression outcome would have taken under alternative interventions. We derive standard errors that account for design‐based uncertainty instead of, or in addition to, sampling‐based uncertainty. We show that our standard errors in general are smaller than the usual infinite‐population sampling‐based standard errors and provide conditions under which they coincide.
Year of publication: |
2020
|
---|---|
Authors: | Abadie, Alberto ; Athey, Susan ; Imbens, Guido W. ; Wooldridge, Jeffrey M. |
Published in: |
Econometrica. - The Econometric Society, ISSN 0012-9682, ZDB-ID 1477253-X. - Vol. 88.2020, 1, p. 265-296
|
Publisher: |
The Econometric Society |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
When should you adjust standard errors for clustering?
Abadie, Alberto, (2023)
-
Finite population causal standard errors
Abadie, Alberto, (2014)
-
When should you adjust standard errors for clustering?
Abadie, Alberto, (2017)
- More ...