Class imbalance problem in short-term solar flare prediction

被引:9
|
作者
Wan, Jie [1 ,2 ]
Fu, Jun-Feng [2 ]
Liu, Jin-Fu [3 ]
Shi, Jia-Kui [2 ]
Jin, Cheng-Gang [1 ,2 ]
Zhang, Huai-Peng [3 ]
机构
[1] Harbin Inst Technol, Lab Space Environm & Phys Sci, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China
[3] Harbin Inst Technol, Sch Energy Sci & Engn, Harbin 150001, Peoples R China
关键词
The Sun; Sun; X-rays; gamma rays; sunspots; magnetic fields; flares; methods; data analysis;
D O I
10.1088/1674-4527/21/9/237
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Using data-driven algorithms to accurately forecast solar flares requires reliable data sets. The solar flare dataset is composed of many non-flaring samples with a small percentage of flaring samples. This is called the class imbalance problem in data mining tasks. The prediction model is sensitive to most classes of the original data set during training. Therefore, the class imbalance problem for building up the flare prediction model from observational data should be systematically discussed. Aiming at the problem of class imbalance, three strategies are proposed corresponding to the data set, loss function, and training process: Type I resamples the training samples, including oversampling for the minority class, undersampling, or mixed sampling for the majority class. Type II usually changes the decision-making boundary, assigning the majority and minority categories of prediction loss to different weights. Type III assigns different weights to the training samples, the majority categories are assigned smaller weights, and the minority categories are assigned larger weights to improve the training process of the prediction model. The main work of this paper compares these imbalance processing methods when building a flare prediction model and tries to find the optimal strategy. Our results show that among these strategies, the performance of oversampling and sample weighting is better than other strategies in most parameters, and the generality of resampling and changing the decision boundary is better.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Class imbalance problem in short-term solar flare prediction
    Jie Wan
    Jun-Feng Fu
    Jin-Fu Liu
    Jia-Kui Shi
    Cheng-Gang Jin
    Huai-Peng Zhang
    [J]. Research in Astronomy and Astrophysics, 2021, 21 (09) : 233 - 238
  • [2] SHORT-TERM SOLAR FLARE PREDICTION USING MULTIRESOLUTION PREDICTORS
    Yu, Daren
    Huang, Xin
    Hu, Qinghua
    Zhou, Rui
    Wang, Huaning
    Cui, Yanmei
    [J]. ASTROPHYSICAL JOURNAL, 2010, 709 (01): : 321 - 326
  • [3] Short-Term Solar Flare Prediction Using Predictor Teams
    Xin Huang
    Daren Yu
    Qinghua Hu
    Huaning Wang
    Yanmei Cui
    [J]. Solar Physics, 2010, 263 : 175 - 184
  • [4] Short-Term Solar Flare Prediction Using Predictor Teams
    Huang, Xin
    Yu, Daren
    Hu, Qinghua
    Wang, Huaning
    Cui, Yanmei
    [J]. SOLAR PHYSICS, 2010, 263 (1-2) : 175 - 184
  • [5] Short-Term Solar Flare Prediction Using a Sequential Supervised Learning Method
    Daren Yu
    Xin Huang
    Huaning Wang
    Yanmei Cui
    [J]. Solar Physics, 2009, 255 : 91 - 105
  • [6] Short-Term Solar Flare Prediction Using a Sequential Supervised Learning Method
    Yu, Daren
    Huang, Xin
    Wang, Huaning
    Cui, Yanmei
    [J]. SOLAR PHYSICS, 2009, 255 (01) : 91 - 105
  • [7] SHORT-TERM SOLAR FLARE LEVEL PREDICTION USING A BAYESIAN NETWORK APPROACH
    Yu, Daren
    Huang, Xin
    Wang, Huaning
    Cui, Yanmei
    Hu, Qinghua
    Zhou, Rui
    [J]. ASTROPHYSICAL JOURNAL, 2010, 710 (01): : 869 - 877
  • [8] Short-term solar flare prediction using image-case-based reasoning
    Jin-Fu Liu
    Fei Li
    Huai-Peng Zhang
    Da-Ren Yu
    [J]. Research in Astronomy and Astrophysics, 2017, 17 (11) : 73 - 86
  • [9] Automatic Short-Term Solar Flare Prediction Using Machine Learning and Sunspot Associations
    R. Qahwaji
    T. Colak
    [J]. Solar Physics, 2007, 241 : 195 - 211
  • [10] Automatic short-term solar flare prediction using machine learning and sunspot associations
    Qahwaji, R.
    Colak, T.
    [J]. SOLAR PHYSICS, 2007, 241 (01) : 195 - 211