Stable Learning via Sample Reweighting and Dual Classifiers

被引:0
|
作者
Yang S. [1 ,2 ]
Wang H. [1 ,2 ]
Yu K. [1 ,2 ]
Cao F.-Y. [3 ]
机构
[1] Key Laboratory of Knowledge Engineering with Big Data, Hefei University of Technology, Hefei
[2] School of Computer Science and Information Engineering, Hefei University of Technology, Hefei
[3] School of Computer and Information Technology, Shanxi University, Taiyuan
来源
Ruan Jian Xue Bao/Journal of Software | 2023年 / 34卷 / 07期
关键词
distribution shift; feature selection; sample reweighting; stable learning;
D O I
10.13328/j.cnki.jos.006511
中图分类号
学科分类号
摘要
Stable learning aims to leverage the knowledge obtained only from a single training data to learn a robust prediction model for accurately predicting label of the test data from a different but related distribution. To achieve promising performance on the test data with agnostic distributions, existing stable learning algorithms focus on eliminating the spurious correlations between the features and the class variable. However, these algorithms can only weaken part of the spurious correlations between the features and the class variable, but can not completely eliminate the spurious correlations. Furthermore, these algorithms may encounter the overfitting problem in learning the prediction model. To tackle these issues, this study proposes a sample reweighting and dual classifiers based stable learning algorithm, which jointly optimizes the weights of samples and the parameters of dual classifiers to learn a robust prediction model. Specifically, to estimate the effects of all features on classification, the proposed algorithm balances the distribution of confunders by learning global sample weights to remove the spurious correlations between the features and the class variable. In order to eliminate the spurious correlations between some irrelevant features and the class variable and weaken the influence of irrelevant features on the weighting process of samples, the proposed algorithm selects and removes some irrelevant features before sample reweighting. To further improve the generalization ability of the model, the algorithm constructs two classifiers and learns a prediction model with an optimal hyperplane by minimizing the parameter difference between the two classifiers during learning the prediction model. Using synthetic and real-world datasets, the experiments have validated the effectiveness of the proposed algorithm. © 2023 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:3206 / 3225
页数:19
相关论文
共 23 条
  • [1] Fan CH, Yi JY, Tao JH, Tian ZK, Liu B, Wen ZQ., Gated recurrent fusion with joint training framework for robust end-to-end speech recognition, IEEE/ACM Trans. on Audio, Speech, and Language Processing, 29, pp. 198-209, (2021)
  • [2] Kumar Y, Sahrawat D, Maheshwari S, Mahata D, Stent A, Yin YF, Shah RR, Zimmermann R., Harnessing gans for zero-shot learning of new classes in visual speech recognition, Proc. of the 34th AAAI Conf. on Artificial Intelligence, pp. 2645-2652, (2020)
  • [3] Wang NY, Ye YX, Liu L, Feng LZ, Bao T, Peng T., Language models based on deep learning: A review, Ruan Jian Xue Bao/Journal of Software, 32, 4, pp. 1082-1115, (2021)
  • [4] Pei YT, Huang YP, Zou Q, Zhang XY, Wang S., Effects of image degradation and degradation removal to CNN-based image classification, IEEE Trans. on Pattern Analysis and Machine Intelligence, 43, 4, pp. 1239-1253, (2021)
  • [5] Liu CY, Li J, He L, Plaza A, Li ST, Li B., Naive gabor networks for hyperspectral image classification, IEEE Trans. on Neural Networks and Learning Systems, 32, 1, pp. 376-390, (2021)
  • [6] Cai Q, Pan YW, Wang Y, Liu JG, Yao T, Mei T., Learning a unified sample weighting network for object detection, Proc. of the 2020 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 14161-14170, (2020)
  • [7] Wu Y, Chen YP, Yuan L, Liu ZC, Wang LJ, Li HZ, Fu Y., Rethinking classification and localization for object detection, Proc. of the 2020 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 10183-10192, (2020)
  • [8] Qi L, Yu PZ, Gao Y., Research on weak-supervised person re-identification, Ruan Jian Xue Bao/Journal of Software, 31, 9, pp. 2883-2902, (2020)
  • [9] Zhuang FZ, Qi ZY, Duan KY, Xi DB, Zhu YC, Zhu HS, Xiong H, He Q., A comprehensive survey on transfer learning, Proc. of the IEEE, 109, 1, pp. 43-76, (2021)
  • [10] Cai RC, Chen W, Zhang K, Hao ZF., A survey on non-temporal series observational data based causal discovery, Chinese Journal of Computers, 40, 6, pp. 1470-1490, (2017)