On Tail Decay Rate Estimation of Loss Function Distributions

被引:0
|
作者
Haxholli, Etrit [1 ]
Lorenzi, Marco [1 ]
机构
[1] Univ Cte Azur, Epione Res Grp, Inria, 2004 Rte Lucioles, F-06902 Valbonne, France
关键词
Extreme Value Theory; Tail Modelling; Peaks-Over-Threshold; Cross-Tail-; Estimation; Model Ranking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The study of loss -function distributions is critical to characterize a model's behaviour on a given machine -learning problem. While model quality is commonly measured by the average loss assessed on a testing set, this quantity does not ascertain the existence of the mean of the loss distribution. Conversely, the existence of a distribution's statistical moments can be verified by examining the thickness of its tails. Cross -validation schemes determine a family of testing loss distributions conditioned on the training sets. By marginalizing across training sets, we can recover the overall (marginal) loss distribution, whose tail -shape we aim to estimate. Small sample -sizes diminish the reliability and efficiency of classical tail -estimation methods like Peaks -OverThreshold, and we demonstrate that this effect is notably significant when estimating tails of marginal distributions composed of conditional distributions with substantial taillocation variability. We mitigate this problem by utilizing a result we prove: under certain conditions, the marginal-distribution's tail -shape parameter is the maximum tail -shape parameter across the conditional distributions underlying the marginal. We label the resulting approach as 'cross -tail estimation (CTE)'. We test CTE in a series of experiments on simulated and real data1, showing the improved robustness and quality of tail estimation as compared to classical approaches.
引用
收藏
页数:47
相关论文
共 50 条
  • [41] Performance bounds and a parameter transformation for decay rate estimation
    Tantum, SL
    Collins, LM
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2003, 41 (10): : 2224 - 2231
  • [42] Generalized Wiener estimation algorithms based on a family of heavy-tail distributions
    Deng, G
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 261 - 264
  • [43] Estimation of rate distributions in generalized Kolmogorov community models
    Ackleh, AS
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 1998, 33 (07) : 729 - 745
  • [44] Shrinkage estimation with a matrix loss function
    Abu-Shanab, Reman
    Kent, John T.
    Strawderman, William E.
    ELECTRONIC JOURNAL OF STATISTICS, 2012, 6 : 2347 - 2355
  • [45] ESTIMATION FOR UNIMODAL DENSITIES AND FOR DISTRIBUTIONS WITH MONOTONE FAILURE RATE
    RAO, BLS
    ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (02): : 554 - &
  • [46] MINIMAX ESTIMATION WITH DIVERGENCE LOSS FUNCTION
    KASHYAP, RL
    INFORMATION SCIENCES, 1974, 7 (3-4) : 341 - 364
  • [47] Bias-corrected estimation of stable tail dependence function
    Beirlant, Jan
    Escobar-Bach, Mikael
    Goegebeur, Yuri
    Guillou, Armelle
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 143 : 453 - 466
  • [48] ESTIMATION OF SMALL PROBABILITIES BY LINEARIZATION OF TAIL OF A PROBABILITY DISTRIBUTION FUNCTION
    WEINSTEIN, SB
    IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, 1971, CO19 (06): : 1149 - +
  • [49] Estimation of the Tail of Probability Distribution Through its Characteristic Function
    Karlová A.
    Klebanov L.B.
    Journal of Mathematical Sciences, 2018, 229 (6) : 714 - 718
  • [50] Estimation of phytoplankton loss rate by remote sensing
    Zhai, Li
    Platt, Trevor
    Tang, Charles
    Dowd, Mike
    Sathyendranath, Shubha
    Forget, Marie-Helene
    GEOPHYSICAL RESEARCH LETTERS, 2008, 35 (23)