Incremental PID Controller-Based Learning Rate Scheduler for Stochastic Gradient Descent

被引:0
|
作者
Wang, Zenghui [1 ]
Zhang, Jun [2 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automation, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China
[2] Anhui Univ, Sch Artificial Intelligence, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; PD control; PI control; Feedback control; Convergence; Oscillators; Optimization; incremental proportional-integral-derivative (PID) controller; learning rate scheduler; stochastic gradient descent (SGD);
D O I
10.1109/TNNLS.2022.3213677
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As we all know, the learning rate plays a vital role in deep neural network (DNN) training. This study introduces an incremental proportional-integral-derivative (PID) controller widely used in automatic control as a learning rate scheduler for stochastic gradient descent (SGD). To automatically calculate the current learning rate, we utilize feedback control to determine the relationship between training losses and learning rates, named incremental PID learning rates, which include PID-Base and PID-Warmup. The new schedulers reduce the dependence on the initial learning rate and achieve higher accuracy. Compared with multistep learning rates (MSLR), cyclical learning rates (CLR), and SGD with warm restarts (SGDR), incremental PID learning rates based on feedback control obtain higher accuracy on CIFAR-10, CIFAR-100, and Tiny-ImageNet-200. We believe that our methods can improve the performance of SGD.
引用
收藏
页码:7060 / 7071
页数:12
相关论文
共 50 条