Predicting outliers

被引:0
|
作者
Torgo, L
Ribeiro, R
机构
[1] Univ Porto, LIACC, FEP, P-4150 Oporto, Portugal
[2] Univ Porto, LIACC, P-4150 Oporto, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a method designed for data mining applications where the main goal is to predict extreme and rare values of a continuous target variable, as well as to understand under which conditions these values occur. Our objective is to induce models that are accurate at predicting these outliers but are also interpretable from the user perspective. We describe a new splitting criterion for regression trees that enables the induction of trees achieving these goals. We evaluate our proposal on several real world problems and contrast the obtained models with standard regression trees. The results of this evaluation show the clear advantage of our proposal in terms of the evaluation statistics that are relevant for these applications.
引用
收藏
页码:447 / 458
页数:12
相关论文
共 50 条
  • [41] M-estimator-based online sequential extreme learning machine for predicting chaotic time series with outliers
    Guo, Wei
    Xu, Tao
    Tang, Keming
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (12): : 4093 - 4110
  • [42] M-estimator-based online sequential extreme learning machine for predicting chaotic time series with outliers
    Wei Guo
    Tao Xu
    Keming Tang
    Neural Computing and Applications, 2017, 28 : 4093 - 4110
  • [43] Check your outliers! An introduction to identifying statistical outliers in R with easystats
    Theriault, Remi
    Ben-Shachar, Mattan S.
    Patil, Indrajeet
    Luedecke, Daniel
    Wiernik, Brenton M.
    Makowski, Dominique
    BEHAVIOR RESEARCH METHODS, 2024, 56 (04) : 4162 - 4172
  • [44] Hiding outliers into crowd: Privacy-preserving data publishing with outliers
    Wang, Hui
    Liu, Ruilin
    DATA & KNOWLEDGE ENGINEERING, 2015, 100 : 94 - 115
  • [45] Outliers in SAR and QSAR: Is unusual binding mode a possible source of outliers?
    Ki Hwan Kim
    Journal of Computer-Aided Molecular Design, 2007, 21 : 63 - 86
  • [46] Variation in Caesarean Section Rates in the US: Outliers, Damned Outliers, and Statistics
    Smith, Gordon C. S.
    PLOS MEDICINE, 2014, 11 (10)
  • [47] Outliers in SAR and QSAR: Is unusual binding mode a possible source of outliers?
    Kim, Ki Hwan
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2007, 21 (1-3) : 63 - 86
  • [48] OUTLIERS The story of success
    Tavris, Carol
    TLS-THE TIMES LITERARY SUPPLEMENT, 2009, (5525): : 25 - 25
  • [49] Outliers: The Story of Success
    Hurteau, Bob
    MISSION STUDIES, 2010, 27 (02): : 239 - 240
  • [50] OUTLIERS: THE STORY OF SUCCESS
    Connolly, Terry
    JOURNAL OF BEHAVIORAL DECISION MAKING, 2010, 23 (03) : 331 - 333