A cost-sensitive active learning algorithm: toward imbalanced time series forecasting

被引:0
|
作者
Jing Zhang
Qun Dai
机构
[1] Nanjing University of Aeronautics and Astronautics,College of Computer Science and Technology
来源
关键词
Time series forecasting (TSF); Data imbalance; Cost-sensitive learning (CSL); Active learning (AL);
D O I
暂无
中图分类号
学科分类号
摘要
Recently, many outstanding techniques for Time series forecasting (TSF) have been proposed. These techniques depend on necessary and sufficient data samples, which is the key to train a good predictor. Thus, an Active learning (AL) algorithmic framework based on Support vector regression (SVR) is designed for TSF, with the goal to choose the most valuable samples and reduce the complexity of the training set. To evaluate the quality of samples comprehensively, multiple essential criteria, such as informativeness, representativeness and diversity, are considered in a two clustering-based consecutive stages procedure. In addition, considering the imbalance of time series data, a range of values might be seriously under-represented but extremely important to the user. Thus, it is unreasonable to assign the same prediction cost to each sample. To address this imbalance problem, a multiple criteria cost-sensitive active learning algorithm in the virtue of weight SVR architecture, abbreviated as MAW-SVR, ad hoc for imbalanced TSF, is proposed. By introducing the cost-sensitive scheme, each sample is endowed with a penalty weight, which can be dynamically updated in the AL procedure. The experimental comparisons between MAW-SVR and the other six AL algorithms on a total of thirty time series datasets verify the effectiveness of the proposed algorithm.
引用
收藏
页码:6953 / 6972
页数:19
相关论文
共 50 条
  • [31] Cost-Sensitive Learning for Imbalanced Bad Debt Datasets in Healthcare Industry
    Shi, Donghui
    Guan, Jian
    Zurada, Jozef
    2015 ASIA-PACIFIC CONFERENCE ON COMPUTER-AIDED SYSTEM ENGINEERING - APCASE 2015, 2015, : 30 - 35
  • [32] Cost-Sensitive Latent Space Learning for Imbalanced PolSAR Image Classification
    Wu, Qian
    Hou, Biao
    Wen, Zaidao
    Ren, Zhongle
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 4802 - 4817
  • [33] Resampling and Cost-Sensitive Methods for Imbalanced Multi-instance Learning
    Wang, Xiaoguang
    Liu, Xuan
    Japkowicz, Nathalie
    Matwin, Stan
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 808 - 816
  • [34] Cost-sensitive boosting for classification of imbalanced data
    Sun, Yamnin
    Kamel, Mohamed S.
    Wong, Andrew K. C.
    Wang, Yang
    PATTERN RECOGNITION, 2007, 40 (12) : 3358 - 3378
  • [35] Cost-Sensitive Learning
    Zhou, Zlii-Hua
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2011, 2011, 6820 : 17 - 18
  • [36] Cost-Sensitive Active Learning for Dialogue State Tracking
    Xie, Kaige
    Chang, Cheng
    Ren, Liliang
    Chen, Lu
    Yu, Kai
    19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 209 - 213
  • [37] Annotation cost-sensitive active learning by tree sampling
    Tsou, Yu-Lin
    Lin, Hsuan-Tien
    MACHINE LEARNING, 2019, 108 (05) : 785 - 807
  • [38] A novel cost-sensitive algorithm and new evaluation strategies for regression in imbalanced domains
    Sadouk, Lamyaa
    Gadi, Taoufiq
    Essoufi, El Hassan
    EXPERT SYSTEMS, 2021, 38 (04)
  • [39] Cost-sensitive active learning through statistical methods
    Wang, Min
    Lin, Yao
    Min, Fan
    Liu, Dun
    INFORMATION SCIENCES, 2019, 501 : 460 - 482
  • [40] Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection
    Kuo, Weicheng
    Hane, Christian
    Yuh, Esther
    Mukherjee, Pratik
    Malik, Jitendra
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, PT III, 2018, 11072 : 715 - 723