Stability and optimization error of stochastic gradient descent for pairwise learning

被引:9
|
作者
Shen, Wei [1 ]
Yang, Zhenhuan [2 ]
Ying, Yiming [2 ]
Yuan, Xiaoming [3 ]
机构
[1] Hong Kong Baptist Univ, Dept Math, Kowloon Tong, Kowloon, Hong Kong, Peoples R China
[2] SUNY Albany, Dept Math & Stat, Albany, NY 12222 USA
[3] Univ Hong Kong, Dept Math, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Stability; generalization; optimization error; stochastic gradient descent; pairwise learning; minimax statistical error; RANKING; BOUNDS; ALGORITHMS; AREA;
D O I
10.1142/S0219530519400062
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we study the stability and its trade-off with optimization error for stochastic gradient descent (SOD) algorithms in the pairwise learning setting. Pairwise learning refers to a learning task which involves a loss function depending on pairs of instances among which notable examples are bipartite ranking, metric learning, area under ROC curve (AUC) maximization and minimum error entropy (MEE) principle. Our contribution is two-folded. Firstly, we establish the stability results for SGD for pairwise learning in the convex, strongly convex and non-convex settings, from which generalization errors can be naturally derived. Secondly, we establish the trade-off between stability and optimization error of SGD algorithms for pairwise learning. This is achieved by lower-bounding the sum of stability and optimization error by the minimax statistical error over a prescribed class of pairwise loss functions. From this fundamental trade-off, we obtain lower bounds for the optimization error of SGD algorithms and the excess expected risk over a class of pairwise losses. In addition, we illustrate our stability results by giving sonic specific examples of AUC maximization, metric learning and MEE.
引用
收藏
页码:887 / 927
页数:41
相关论文
共 50 条
  • [1] Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning
    Yang, Zhenhuan
    Lei, Yunwen
    Wang, Puyu
    Yang, Tianbao
    Ying, Yiming
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Stability and Differential Privacy of Stochastic Gradient Descent for Pairwise Learning with Non-Smooth Loss
    Yang, Zhenhuan
    Lei, Yunwen
    Lyu, Siwei
    Ying, Yiming
    [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [3] Strong error analysis for stochastic gradient descent optimization algorithms
    Jentzen, Arnulf
    Kuckuck, Benno
    Neufeld, Ariel
    von Wurstemberger, Philippe
    [J]. IMA JOURNAL OF NUMERICAL ANALYSIS, 2021, 41 (01) : 455 - 492
  • [4] Error Analysis of Stochastic Gradient Descent Ranking
    Chen, Hong
    Tang, Yi
    Li, Luoqing
    Yuan, Yuan
    Li, Xuelong
    Tang, Yuanyan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (03) : 898 - 909
  • [5] Towards stability and optimality in stochastic gradient descent
    Toulis, Panos
    Tran, Dustin
    Airoldi, Edoardo M.
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 1290 - 1298
  • [6] Stability and Generalization of Decentralized Stochastic Gradient Descent
    Sun, Tao
    Li, Dongsheng
    Wang, Bao
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9756 - 9764
  • [7] Global Convergence and Stability of Stochastic Gradient Descent
    Patel, Vivak
    Zhang, Shushu
    Tian, Bowen
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [8] Stochastic gradient descent for optimization for nuclear systems
    Austin Williams
    Noah Walton
    Austin Maryanski
    Sandra Bogetic
    Wes Hines
    Vladimir Sobes
    [J]. Scientific Reports, 13
  • [9] Ant colony optimization and stochastic gradient descent
    Meuleau, N
    Dorigo, M
    [J]. ARTIFICIAL LIFE, 2002, 8 (02) : 103 - 121
  • [10] Stochastic Chebyshev Gradient Descent for Spectral Optimization
    Han, Insu
    Avron, Haim
    Shin, Jinwoo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31