Optimal Data-Driven Regression Discontinuity Plots

被引:206
|
作者
Calonico, Sebastian [1 ]
Cattaneo, Matias D. [2 ]
Titiunik, Rocio [3 ]
机构
[1] Univ Miami, Dept Econ, Coral Gables, FL 33124 USA
[2] Univ Michigan, Dept Econ, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Polit Sci, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Binning; Partitioning; RD plots; Tuning parameter selection; ASYMPTOTIC NORMALITY; CONVERGENCE-RATES; INFERENCE; DESIGNS; LIFE;
D O I
10.1080/01621459.2015.1017578
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Exploratory data analysis plays a central role in applied statistics and econometrics. In the popular regression-discontinuity (RD) design, the use of graphical analysis has been strongly advocated because it provides both easy presentation and transparent validation of the design. RD plots are nowadays widely used in applications, despite its formal properties being unknown: these plots are typically presented employing ad hoc choices of tuning parameters, which makes these procedures less automatic and more subjective. In this article, we formally study the most common RD plot based on an evenly spaced binning of the data, and propose several (optimal) data-driven choices for the number of bins depending on the goal of the researcher. These RD plots are constructed either to approximate the underlying unknown regression functions without imposing smoothness in the estimator, or to approximate the underlying variability of the raw data while smoothing out the otherwise uninformative scatterplot of the data. In addition, we introduce an alternative RD plot based on quantile spaced binning, study its formal properties, and propose similar (optimal) data-driven choices for the number of bins. The main proposed data-driven selectors employ spacings estimators, which are simple and easy to implement in applications because they do not require additional choices of tuning parameters. Altogether, our results offer an array of alternative RD plots that are objective and automatic when implemented, providing a reliable benchmark for graphical analysis in RD designs. We illustrate the performance of our automatic RD plots using several empirical examples and a Monte Carlo study. All results are readily available in R and STATA using the software packages described in Calonico, Cattaneo, and Titiunik. Supplementary materials for this article are available online.
引用
收藏
页码:1753 / 1769
页数:17
相关论文
共 50 条
  • [1] Data-driven discontinuity detection in derivatives of a regression function
    Gijbels, I
    Goderniaux, AC
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2004, 33 (04) : 851 - 871
  • [2] Robust data-driven inference in the regression-discontinuity design
    Calonico, Sebastian
    Cattaneo, Matias D.
    Titiunik, Rocio
    [J]. STATA JOURNAL, 2014, 14 (04): : 909 - 946
  • [3] Data-driven rate-optimal specification testing in regression models
    Guerre, E
    Lavergne, P
    [J]. ANNALS OF STATISTICS, 2005, 33 (02): : 840 - 870
  • [4] Data-Driven Optimal Transport
    Trigila, Giulio
    Tabak, Esteban G.
    [J]. COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2016, 69 (04) : 613 - 648
  • [5] Data-driven resistant kernel regression
    Zhou, Jianhua
    Parmeter, Christopher F.
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2024,
  • [6] Data-driven Optimal Control with Data Loss
    Huan, Luo
    Azuma, Shun-ich
    [J]. 2024 SICE INTERNATIONAL SYMPOSIUM ON CONTROL SYSTEMS, SICE ISCS 2024, 2024, : 56 - 59
  • [7] Data-driven discovery of formulas by symbolic regression
    Sun, Sheng
    Ouyang, Runhai
    Zhang, Bochao
    Zhang, Tong-Yi
    [J]. MRS BULLETIN, 2019, 44 (07) : 559 - 564
  • [8] Data-driven discovery of formulas by symbolic regression
    Sheng Sun
    Runhai Ouyang
    Bochao Zhang
    Tong-Yi Zhang
    [J]. MRS Bulletin, 2019, 44 : 559 - 564
  • [9] Data-Driven Fault Diagnosis of Chemical Processes Based on Recurrence Plots
    Ziaei-Halimejani, Hooman
    Zarghami, Reza
    Mansouri, Seyed Soheil
    Mostoufi, Navid
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2021, 60 (07) : 3038 - 3055
  • [10] Extraction of metamaterial constitutive parameters based on data-driven discontinuity detection
    Aladadi, Yosef T.
    Alkanhal, Majeed A. S.
    [J]. OPTICAL MATERIALS EXPRESS, 2019, 9 (09) : 3765 - 3780