Featured Logic chart

Regression Discontinuity Designs: A Comprehensive Guide

Regression Discontinuity (RD) designs are powerful quasi-experimental methods used to estimate causal effects. They leverage arbitrary cutoffs where the probability of receiving treatment changes abruptly, allowing for a robust comparison of outcomes for units just above and below the threshold. This approach effectively isolates the local average treatment effect, providing strong causal inference when randomized controlled trials are impractical or unethical.

Key Takeaways

1

RD designs exploit sharp cutoffs for robust causal inference.

2

Sharp RD: Treatment assignment is deterministic at the threshold.

3

Fuzzy RD: Treatment probability jumps discontinuously at cutoff.

4

Validity relies on continuity of potential outcomes around the cutoff.

5

Local linear regression is a best practice for RD estimation.

Regression Discontinuity Designs: A Comprehensive Guide

What are Regression Discontinuity (RD) Designs?

Regression Discontinuity (RD) designs are quasi-experimental research methods that estimate causal effects by exploiting an arbitrary cutoff point, or threshold, where the probability of receiving a treatment or intervention changes abruptly. This technique allows researchers to compare outcomes for individuals or units just above and below the threshold, effectively mimicking a randomized experiment in a narrow window around the cutoff. The core idea is that units very close to the threshold are similar in all unobserved characteristics, differing only in their treatment status. RD designs are particularly valuable when true randomization is not feasible, providing a rigorous way to infer causality. They come in two main types: Sharp RD, where treatment is deterministically assigned based on the cutoff, and Fuzzy RD, where the probability of treatment changes discontinuously.

  • Definition: Probability of treatment changes discontinuously at a specific threshold.
  • Purpose: Exploits arbitrary rules for robust causal inference.
  • Sharp RD: Treatment is deterministic and discontinuous at the cutoff.
  • Fuzzy RD: Treatment is partly determined by the running variable, showing a partial jump in probability.
  • Continuity Assumption: Potential outcomes are continuous in the running variable around the cutoff.
  • Validity Checks: Include balance tests for pre-determined characteristics and no bunching tests for the running variable density.

How are Sharp Regression Discontinuity Designs Implemented?

Implementing Sharp Regression Discontinuity designs involves estimating the treatment effect at the cutoff point. This can be done using either parametric or non-parametric methods. Parametric RD assumes a known functional form for the relationship between the outcome and the running variable, often using polynomials, and estimates the discontinuity at the cutoff. However, non-parametric RD, particularly local linear regression, is generally considered best practice because it avoids strong functional form assumptions. Local linear regression minimizes a weighted sum of squared errors, focusing on data points close to the cutoff using a kernel function and a carefully selected bandwidth. This bandwidth determines the range of data used for estimation, balancing bias and variance. Accounting for potential asymptotic bias in optimal bandwidths often involves undersmoothing or higher-order approximations to ensure robust results.

  • Parametric RD: Assumes a known functional form for outcomes, typically using polynomials.
  • Method: Estimates Y_i = f(x_i) + ρD_i + η_i, where D_i is the treatment indicator.
  • Non-parametric RD: Aims for no functional form assumptions, focusing on local estimation.
  • Local Linear Regression: Best practice, minimizes weighted sum of squared errors using a kernel function.
  • Bandwidth Selection: Crucial for determining the range of data, often using MSE approximation methods.
  • Bias Accounting: Addresses asymptotic bias through undersmoothing or higher-order approximations for accuracy.
  • Robustness: Requires showing consistent results across different functional forms and bandwidths.

What Robustness Tests and Considerations are Important for RD Designs?

Ensuring the credibility of Regression Discontinuity (RD) findings requires rigorous robustness tests and careful consideration of potential critiques. Robustness checks typically involve demonstrating that the estimated treatment effect remains consistent across different estimation methods, functional forms, and bandwidth choices. It is also vital to confirm that pre-determined characteristics of the units do not exhibit discontinuities at the cutoff, which would violate the core continuity assumption. Furthermore, a 'no bunching' test, such as the McCrary test, verifies that individuals are not manipulating the running variable to sort themselves into or out of treatment. Critiques often revolve around potential unobserved changes or systematic manipulation of the running variable near the cutoff. While RD designs offer strong internal validity by providing clean identification of a causal effect, their external validity is typically limited to the local effect for 'compliers' at the threshold, meaning the findings may not generalize broadly to the entire population.

  • Robustness: Results should be unaffected by lottery fixed effects and similar across estimation methods.
  • Balance Test: Pre-determined characteristics must be smooth at the cutoff, indicating no pre-treatment differences.
  • No Bunching Test: Density of the running variable should be smooth, preventing manipulation concerns.
  • Reasons for Failure: Manipulable running variable or systematic cutoff can invalidate results.
  • Possible Critiques: Consider changes in population composition or behavioral responses like travel for visa collection.
  • Internal Validity: RD designs offer clean identification of causal effects around the cutoff.
  • External Validity: The estimated effect is local, applying specifically to compliers at the threshold.

When is a Difference-in-Discontinuity Design Used?

A Difference-in-Discontinuity (DiD-RD) design is employed when researchers need to account for time-varying factors that might confound the treatment effect estimated by a standard RD design. This advanced method combines the sharp cutoff variation of an RD design with variation over time, typically using panel data (observations before and after the cutoff or policy change). The core concept is to capture the treatment effect net of other changes occurring simultaneously. For instance, if a policy change creates a discontinuity at a specific threshold, and there are also general time trends affecting outcomes, DiD-RD can isolate the policy's impact by comparing the discontinuity in the treatment group to a control group or period where the discontinuity does not apply. An example is Pinotti (2017), which used panel data around 'click days' for immigrant legalization to assess its impact on crime rates, effectively controlling for other temporal shifts.

  • Concept: Combines variation at a cutoff with variation over time, often using panel data.
  • Example: Pinotti (2017) used panel data around 'click days' for immigrant legalization.
  • Purpose: Captures the treatment effect net of other changes or time trends.
  • Application: Useful when a policy or intervention creates a discontinuity that also interacts with time-dependent factors.

How is Fuzzy Regression Discontinuity Implemented and Estimated?

Fuzzy Regression Discontinuity (Fuzzy RD) is implemented when the treatment assignment is not perfectly determined by the cutoff but rather the probability of treatment changes discontinuously at the threshold. This scenario is typically analyzed using an Instrumental Variable (IV) approach, where the cutoff indicator (Z_i = 1 if x_i >= x_0) serves as the instrument for the actual treatment received (D_i). The causal effect estimated in Fuzzy RD is the Local Average Treatment Effect (LATE), which applies specifically to 'compliers'—individuals whose treatment status is influenced by the cutoff—around the threshold x_0. Estimation can be parametric, often using Two-Stage Least Squares (TSLS), where the first stage models treatment probability based on the instrument and running variable, and the second stage uses the predicted treatment. Non-parametric estimation, such as the Wald Estimator or local linear regression, directly calculates the ratio of the jump in the outcome to the jump in treatment probability at the cutoff, focusing on a restricted sample near the threshold.

  • Fuzzy RD as Instrumental Variable (IV): The cutoff (Z_i) acts as an instrument for the treatment (D_i).
  • LATE (Local Average Treatment Effect): Estimates E[Y_1i - Y_0i | D_1i > D_0i], local for compliers at x_0.
  • Parametric Estimation: Achieved via Two-Stage Least Squares (TSLS).
  • First Stage: D_i = π_0 + π_1 Z_i + π_x f(x_i) + ε_i, predicting treatment.
  • Second Stage: Y_i = f(x_i) + ρ D̂_i + η_i, estimating the outcome with predicted treatment.
  • Non-parametric Estimation (Wald Estimator): ρ = ΔE[Y_i] / ΔE[D_i], restricting sample to a small range.
  • Non-parametric (Local Linear Regression): Estimates separate local linear regressions for Y_i and D_i, then takes their ratio.

What are Practical Applications of Fuzzy Regression Discontinuity?

Fuzzy Regression Discontinuity designs are widely applied in economics and social sciences to address selection bias and estimate causal effects in real-world settings. One notable application is Bleemer & Mehta (2022), which investigated the wage returns to economics degrees. They exploited a GPA threshold (2.8) for declaring an economics major, using it as a fuzzy cutoff to estimate the causal impact of majoring in economics on future earnings, finding a significant increase of $22,123. Another compelling example is Pinotti (2017), who studied the effect of immigrant legalization on crime rates. This research utilized Italy's 'Click days' for residence permits, where the time of application served as the running variable, creating a fuzzy discontinuity in legal status. Pinotti found that obtaining legal status decreased the crime rate by -0.006, demonstrating the design's ability to isolate the causal impact of policy interventions on social outcomes.

  • Bleemer & Mehta (2022): Studied wage returns to economics degrees.
  • Design: Exploited a GPA threshold (2.8) for economics major declaration.
  • Finding: Majoring in economics causally increases earnings by $22,123.
  • Pinotti (2017): Investigated immigrant legalization's effect on crime.
  • Design: Utilized Italy's 'Click days' for residence permits, with application time as the running variable.
  • Finding: Legal status decreased the crime rate by -0.006, highlighting policy impact.

Frequently Asked Questions

Q

What is the main difference between Sharp and Fuzzy RD?

A

Sharp RD features a deterministic treatment assignment at the cutoff, meaning everyone above or below receives treatment. Fuzzy RD, however, involves a discontinuous jump in the probability of receiving treatment, not a certainty.

Q

Why is the 'continuity assumption' crucial in RD designs?

A

The continuity assumption posits that, in the absence of treatment, potential outcomes would be smooth across the cutoff. This allows researchers to attribute any observed abrupt change in outcomes precisely to the treatment effect at the threshold.

Q

What is the 'local average treatment effect' in RD?

A

The local average treatment effect (LATE) is the causal impact estimated specifically for individuals very close to the cutoff point. It represents the effect for those whose treatment status is most directly influenced by crossing the threshold.

Related Mind Maps

View All

No Related Mind Maps Found

We couldn't find any related mind maps at the moment. Check back later or explore our other content.

Explore Mind Maps

Browse Categories

All Categories

© 3axislabs, Inc 2025. All rights reserved.