Discriminant Analysis of Interval Data: An Assessment of Parametric and Distance-Based Approaches

被引:20
|
作者
Silva, A. Pedro Duarte [1 ,2 ]
Brito, Paula [3 ,4 ]
机构
[1] Univ Catolica Portuguesa, Fac Econ & Gestao, Porto, Portugal
[2] Univ Catolica Portuguesa, CEGE, Porto, Portugal
[3] Univ Porto, Fac Econ, P-4100 Porto, Portugal
[4] Univ Porto, LIAAD INESC TEC, P-4100 Porto, Portugal
关键词
Discriminant analysis; Interval data; Parametric modelling of interval data; Symbolic Data Analysis; STATISTICS;
D O I
10.1007/s00357-015-9189-8
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Building on probabilistic models for interval-valued variables, parametric classification rules, based on Normal or Skew-Normal distributions, are derived for interval data. The performance of such rules is then compared with distancebased methods previously investigated. The results show that Gaussian parametric approaches outperform Skew-Normal parametric and distance-based ones in most conditions analyzed. In particular, with heterocedastic data a quadratic Gaussian rule always performs best. Moreover, restricted cases of the variance-covariance matrix lead to parsimonious rules which for small training samples in heterocedastic problems can outperform unrestricted quadratic rules, even in some cases where the model assumed by these rules is not true. These restrictions take into account the particular nature of interval data, where observations are defined by both MidPoints and Ranges, which may or may not be correlated. Under homocedastic conditions linear Gaussian rules are often the best rules, but distance-based methods may perform better in very specific conditions.
引用
收藏
页码:516 / 541
页数:26
相关论文
共 50 条
  • [21] Linear discriminant analysis for interval data
    António Pedro Duarte Silva
    Paula Brito
    Computational Statistics, 2006, 21 : 289 - 308
  • [22] Linear discriminant analysis for interval data
    Duarte Silva, Antonio Pedro
    Brito, Paula
    COMPUTATIONAL STATISTICS, 2006, 21 (02) : 289 - 308
  • [23] Distance-based ANOVA for functional data
    Pedott, Alexandre Homsi
    Fogliatto, Flavio Sanson
    EUROPEAN JOURNAL OF INDUSTRIAL ENGINEERING, 2016, 10 (06) : 760 - 776
  • [24] Non-parametric distance-based classification techniques and their applications
    Pla, Filiberto
    Radeva, Petia
    Vitria, Jordi
    PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (3-4) : 223 - 225
  • [25] Distance-Based Data Mining Over Encrypted Data
    Tex, Christine
    Schaeler, Martin
    Boehm, Klemens
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1264 - 1267
  • [26] Discriminant analysis of interval data using Monte Carlo method in assessment of overlap
    Jahanshahloo, G. R.
    Lotfi, F. Hosseinzadeh
    Balf, F. Rezai
    Rezai, H. Zhiani
    APPLIED MATHEMATICS AND COMPUTATION, 2007, 191 (02) : 521 - 532
  • [27] Non-parametric distance-based classification techniques and their applications
    Filiberto Pla
    Petia Radeva
    Jordi Vitrià
    Pattern Analysis and Applications, 2008, 11 : 223 - 225
  • [28] A distance-based statistical analysis of fuzzy number-valued data
    Sinova, B. (SMIRE@uniovi.es), 1600, Elsevier Inc. (55):
  • [29] Distance-based parametric bootstrap tests for clustering of species ranges
    Hennig, C
    Hausdorf, E
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (04) : 875 - 895
  • [30] DADApy: Distance-based analysis of data-manifolds in Python']Python
    Glielmo, Aldo
    Macocco, Iuri
    Doimo, Diego
    Carli, Matteo
    Zeni, Claudio
    Wild, Romina
    D'Errico, Maria
    Rodriguez, Alex
    Laio, Alessandro
    PATTERNS, 2022, 3 (10):