Sample Out-of-Sample Inference Based on Wasserstein Distance

被引:3
|
作者
Blanchet, Jose [1 ]
Kang, Yang [2 ]
机构
[1] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA
[2] Columbia Univ, Dept Stat, New York, NY 10027 USA
关键词
nonparametric statistics; probability; distributionally robust optimization; optimal transport; Wasserstein distance; EMPIRICAL LIKELIHOOD; CONFIDENCE-INTERVALS; ESTIMATING EQUATIONS; RATIO; OPTIMIZATION; BANDS;
D O I
10.1287/opre.2020.2028
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present a novel inference approach that we call sample out-of-sample inference. The approach can be used widely, ranging from semisupervised learning to stress testing, and it is fundamental in the application of data-driven distributionally robust optimization. Our method enables measuring the impact of plausible out-of-sample scenarios in a given performance measure of interest, such as a financial loss. The methodology is inspired by empirical likelihood (EL), but we optimize the empirical Wasserstein distance (instead of the empirical likelihood) induced by observations. From a methodological standpoint, our analysis of the asymptotic behavior of the induced Wasserstein-distance profile function shows dramatic qualitative differences relative to EL. For instance, in contrast to EL, which typically yields chi-squared weak convergence limits, our asymptotic distributions are often not chi-squared. Also, the rates of convergence that we obtain have some dependence on the dimension in a nontrivial way but remain controlled as the dimension increases.
引用
收藏
页码:985 / 1013
页数:29
相关论文
共 50 条
  • [41] Supplier satisfaction: Explanation and out-of-sample prediction
    Vos, Frederik G. S.
    Schiele, Holger
    Huttinger, Lisa
    JOURNAL OF BUSINESS RESEARCH, 2016, 69 (10) : 4613 - 4623
  • [42] Control and Out-of-Sample Validation of Dependent Risks
    Gourieroux, Christian
    Liu, Wei
    JOURNAL OF RISK AND INSURANCE, 2009, 76 (03) : 683 - 707
  • [43] Spectral Embedded Clustering: A Framework for In-Sample and Out-of-Sample Spectral Clustering
    Nie, Feiping
    Zeng, Zinan
    Tsang, Ivor W.
    Xu, Dong
    Zhang, Changshui
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (11): : 1796 - 1808
  • [44] Out-of-Sample Embedding of Spherical Manifold Based on Constrained Least Squares
    Zhang, Yongpeng
    Lin, Zenggang
    Yao, Rui
    Zhu, Yu
    Li, Haisen
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 562 - 570
  • [45] An algorithm for enhancing spreadsheet regression with out-of-sample statistics
    Landram, Frank
    Pavur, Robert J.
    Alidaee, Bahram
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2008, 37 (08) : 1578 - 1592
  • [46] DATA REVISIONS AND OUT-OF-SAMPLE STOCK RETURN PREDICTABILITY
    Guo, Hui
    ECONOMIC INQUIRY, 2009, 47 (01) : 81 - 97
  • [47] Optimal out-of-sample forecast evaluation under stationarity
    Stanek, Filip
    JOURNAL OF FORECASTING, 2023, 42 (08) : 2249 - 2279
  • [48] Out-of-sample forecasting of housing bubble tipping points
    Ardila, Diego
    Sanadgol, Dorsa
    Sornette, Didier
    QUANTITATIVE FINANCE AND ECONOMICS, 2018, 2 (04): : 904 - 930
  • [49] An out-of-sample perspective on the assessment of incremental predictive validity
    Pratiwi B.C.
    Dusseldorp E.
    de Rooij M.
    Behaviormetrika, 2024, 51 (2) : 539 - 566
  • [50] Out-of-sample extension of graph adjacency spectral embedding
    Levin, Keith
    Roosta-Khorasani, Farbod
    Mahoney, Michael W.
    Priebe, Carey E.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80