Predicting outliers

被引:0
|
作者
Torgo, L
Ribeiro, R
机构
[1] Univ Porto, LIACC, FEP, P-4150 Oporto, Portugal
[2] Univ Porto, LIACC, P-4150 Oporto, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a method designed for data mining applications where the main goal is to predict extreme and rare values of a continuous target variable, as well as to understand under which conditions these values occur. Our objective is to induce models that are accurate at predicting these outliers but are also interpretable from the user perspective. We describe a new splitting criterion for regression trees that enables the induction of trees achieving these goals. We evaluate our proposal on several real world problems and contrast the obtained models with standard regression trees. The results of this evaluation show the clear advantage of our proposal in terms of the evaluation statistics that are relevant for these applications.
引用
收藏
页码:447 / 458
页数:12
相关论文
共 50 条
  • [1] Predicting outliers in ensemble forecasts
    Siegert, Stefan
    Broecker, Jochen
    Kantz, Holger
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2011, 137 (660) : 1887 - 1897
  • [2] A METHOD FOR PREDICTING CHAOTIC TIME-SERIES WITH OUTLIERS
    ITOH, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1995, 78 (05): : 44 - 53
  • [3] Why predicting outliers in software is a good thing to do!
    Schneidewind, Norm
    Hinchey, Mike
    ICECCS 2008: THIRTEENTH IEEE INTERNATIONAL CONFERENCE ON THE ENGINEERING OF COMPLEX COMPUTER SYSTEMS, PROCEEDINGS, 2008, : 91 - 97
  • [4] Effect of outliers in statistical modelling for predicting the outbreak of anthracnose in grapes (Vitis vinifera)
    Venugopalan, R.
    Rawal, R. D.
    INDIAN JOURNAL OF AGRICULTURAL SCIENCES, 2011, 81 (10): : 945 - 947
  • [5] Predicting Stock Return with Economic Constraint: Can Interquartile Range Truncate the Outliers?
    Dai, Zhifeng
    Chang, Xiaoming
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [6] A New Methodology Based on Imbalanced Classification for Predicting Outliers in Electricity Demand Time Series
    Javier Duque-Pintor, Francisco
    Jesus Fernandez-Gomez, Manuel
    Troncoso, Alicia
    Martinez-Alvarez, Francisco
    ENERGIES, 2016, 9 (09)
  • [7] The importance of climatic factors and outliers in predicting regional monthly campylobacteriosis risk in Georgia, USA
    J. Weisent
    W. Seaver
    A. Odoi
    B. Rohrbach
    International Journal of Biometeorology, 2014, 58 : 1865 - 1878
  • [8] The importance of climatic factors and outliers in predicting regional monthly campylobacteriosis risk in Georgia, USA
    Weisent, J.
    Seaver, W.
    Odoi, A.
    Rohrbach, B.
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2014, 58 (09) : 1865 - 1878
  • [9] Outliers, part I: What are outliers?
    1600, Advanstar Communications Inc. (32):
  • [10] ON THE DETECTION OF MULTIVARIATE DATA OUTLIERS AND REGRESSION OUTLIERS
    LAZRAQ, A
    CLEROUX, R
    DATA ANALYSIS, LEARNING SYMBOLIC AND NUMERIC KNOWLEDGE, 1989, : 133 - 140