A stochastic gradient descent algorithm for structural risk minimisation

被引:0
|
作者
Ratsaby, J [1 ]
机构
[1] UCL, London WC1E 6BT, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Structural risk minimisation (SRM) is a general complexity regularization method which automatically selects the model complexity that approximately minimises the misclassification error probability of the empirical risk minimiser. It does so by adding a complexity penalty term epsilon(m, k) to the empirical risk of the candidate hypotheses and then for any fixed sample size m it minimises the sum with respect to the model complexity variable k. When learning multicategory classification there are M subsamples m(i), corresponding to the M pattern classes with a priori probabilities p(i), 1 less than or equal to i less than or equal to M. Using the usual representation for a multi-category classifier as M individual boolean classifiers, the penalty becomes Sigma(i=1)(M) P(i)epsilon(m(i), k(i)). If the m(i) are given then the standard SRM trivially applies here by minimizing the penalised empirical risk with respect to k(i),1,..., M. However, in situations where the total sample size Sigma(i=1)(M) m(i), needs to be minimal one needs to also minimize the penalised empirical risk with respect to the variables mi, i = 1,..., M. The obvious problem is that the empirical risk can only be defined after the subsamples (and hence their sizes) are given (known). Utilising an on-line stochastic gradient descent approach, this paper overcomes this difficulty and introduces a sample-querying algorithm which extends the standard SRM principle. It minimises the penalised empirical risk not only with respect to the ki, as the standard SRM does, but also with respect to the m(i,) i = 1,...,M. The challenge here is in defining a stochastic empirical criterion which when minimised yields a sequence of subsample-size vectors which asymptotically achieve the Bayes' optimal error convergence rate.
引用
收藏
页码:205 / 220
页数:16
相关论文
共 50 条
  • [31] Stochastic gradient descent tricks
    Bottou, Léon
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, 7700 LECTURE NO : 421 - 436
  • [32] Byzantine Stochastic Gradient Descent
    Alistarh, Dan
    Allen-Zhu, Zeyuan
    Li, Jerry
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [33] A large-scale stochastic gradient descent algorithm over a graphon
    Chen, Yan
    Li, Tao
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4806 - 4811
  • [34] An improved stochastic gradient descent algorithm based on Renyi differential privacy
    Cheng, XianFu
    Yao, YanQing
    Zhang, Liying
    Liu, Ao
    Li, Zhoujun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10694 - 10714
  • [35] Almost sure convergence rates of stochastic proximal gradient descent algorithm
    Liang, Yuqing
    Xu, Dongpo
    OPTIMIZATION, 2024, 73 (08) : 2413 - 2446
  • [36] A Stochastic Gradient Descent Algorithm for Antenna Tilt Optimization in Cellular Networks
    Liu, Yaxi
    Wei Huangfu
    Zhang, Haijun
    Long, Keping
    2018 10TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2018,
  • [37] An Efficient Stochastic Gradient Descent Algorithm to Maximize the Coverage of Cellular Networks
    Liu, Yaxi
    Wei Huangfu
    Zhang, Haijun
    Long, Keping
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (07) : 3424 - 3436
  • [38] Estimating the atmospheric correlation length with stochastic parallel gradient descent algorithm
    Yazdani, R.
    Hajimahmoodzadeh, M.
    Fallah, H. R.
    APPLIED OPTICS, 2014, 53 (07) : 1442 - 1448
  • [39] SW-SGD: The Sliding Window Stochastic Gradient Descent Algorithm
    Chakroun, Imen
    Haber, Tom
    Ashby, Thomas J.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2318 - 2322
  • [40] Coregistration based on stochastic parallel gradient descent algorithm for SAR interferometry
    Long, Xuejun
    Fu, Sihua
    Yu, Qifeng
    Wang, Sanhong
    Qi, Bo
    Ren, Ge
    REMOTE SENSING LETTERS, 2014, 5 (11) : 991 - 1000