A study of re-sampling methods with regression modeling

被引:0
|
作者
Hossain, MA [1 ]
Woodburn, RL [1 ]
机构
[1] Blue Hawk, LLC, Dhaka, Bangladesh
来源
DATA MINING III | 2002年 / 6卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are an overwhelming number of applications of data mining that result in the use of regression models. For example, predicting the propensity of a customer to default on a credit card, or the likelihood that a prospect will respond to a direct marketing campaign. Unfortunately, the implementation constraints for many such useful applications restrict the type of predictive method used to simple linear or logistic regression. While the more sophisticated techniques (e.g Neural Nets [I]) have built in processes that make the resulting model the most predictive and robust, developing a robust linear/logistic regression model requires much care with an experienced hand. In business settings, most predictive models are built on a modeling data set and independently validated on a validation dataset. Often times, the modeling and validation data set have differences that cause the modeler to question whether the model will perform well in the future. This paper explores the use of resampling methods in the model building steps to help to build an optimal sample that not only fits both the modeling and validation sample well, but also holds up robustly. The resampling allows many more sample datasets to be considered and eliminates overfilling of the model sample.
引用
收藏
页码:83 / 91
页数:9
相关论文
共 50 条
  • [31] The Effect of Re-sampling on Incremental Nelder-Mead Simplex Algorithm: Distributed Regression in Wireless Sensor Networks
    Marandi, Parisa Jalili
    Mansooriazdeh, Muharram
    Charkari, Nasrollah Moghadam
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PROCEEDINGS, 2008, 5258 : 420 - 431
  • [32] Generating Virtual Patients by Multivariate and Discrete Re-Sampling Techniques
    D. Teutonico
    F. Musuamba
    H. J. Maas
    A. Facius
    S. Yang
    M. Danhof
    O. Della Pasqua
    [J]. Pharmaceutical Research, 2015, 32 : 3228 - 3237
  • [33] How Re-sampling Helps for Long-Tail Learning?
    Shi, Jiang-Xin
    Wei, Tong
    Xiang, Yuke
    Li, Yu-Feng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [34] Decision Boundary Re-Sampling in Imbalanced Learning for Ulcer Detection
    Lee, Changhoo
    Shin, Dongwook
    Min, Junki
    Cha, Jaemyung
    Lee, Seungkyu
    [J]. IEEE ACCESS, 2020, 8 : 186274 - 186278
  • [35] Monitoring of a batch continuous process using mass re-sampling
    Martin, EB
    Bettoni, A
    Morris, AJ
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 2002, 34 (02) : 171 - 186
  • [36] GENERATION OF HIGH RESOLUTION TEMPERATURE MAPS BY RE-SAMPLING TECHNIQUES
    Ignacio Bayala, Martin
    Eduardo Rivas, Raul
    Scavuzzo, Marcelo
    [J]. INTERCIENCIA, 2013, 38 (07) : 502 - 508
  • [37] Generating Virtual Patients by Multivariate and Discrete Re-Sampling Techniques
    Teutonico, D.
    Musuamba, F.
    Maas, H. J.
    Facius, A.
    Yang, S.
    Danhof, M.
    Della Pasqua, O.
    [J]. PHARMACEUTICAL RESEARCH, 2015, 32 (10) : 3228 - 3237
  • [38] Detection of multiple-strain carriers: The value of re-sampling
    Coen, PG
    Wilks, M
    Dall'Antonia, M
    Millar, M
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 240 (01) : 98 - 103
  • [39] Re-sampling of continuous scanning LDV data for ODS extraction
    Castellini, P.
    Sopranzetti, F.
    Martarelli, M.
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2016, 72-73 : 667 - 677
  • [40] Remote Sensing Image Re-sampling Based on Object Detection
    Zhang, Yaping
    Chen, Xu
    Fug, Xiaoyong
    [J]. 2011 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND MULTIMEDIA COMMUNICATION, 2011, : 37 - 40