Adaptive Negative Curvature Descent with Applications in Non-convex Optimization

被引:0
|
作者
Liu, Mingrui [1 ]
Li, Zhe [1 ]
Wang, Xiaoyu [2 ]
Yi, Jinfeng [3 ]
Yang, Tianbao [1 ]
机构
[1] Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USA
[2] Intellifusion, Parlin, NJ USA
[3] JD AI Res, Stanford, CA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Negative curvature descent (NCD) method has been utilized to design deterministic or stochastic algorithms for non-convex optimization aiming at finding second-order stationary points or local minima In existing studies, NCD needs to approximate the smallest eigen-value of the Hessian matrix with a sufficient precision (e.g., epsilon(2) << 1) in order to achieve a sufficiently accurate second-order stationary solution (i.e., lambda(min)(del(2)f(x)) >= -epsilon(2)). One issue with this approach is that the target precision epsilon(2) is usually set to be very small in order to find a high quality solution, which increases the complexity for computing a negative curvature. To address this issue, we propose an adaptive NCD to allow an adaptive error dependent on the current gradient's magnitude in approximating the smallest eigen-value of the Hessian, and to encourage competition between a noisy NCD step and gradient descent step. We consider the applications of the proposed adaptive NCD for both deterministic and stochastic non-convex optimization, and demonstrate that it can help reduce the the overall complexity in computing the negative curvatures during the course of optimization without sacrificing the iteration complexity.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Adaptive Federated Learning for Non-Convex Optimization Problems in Edge Computing Environment
    Qiao, Dewen
    Liu, Guiyan
    Guo, Songtao
    He, Jing
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (05): : 3478 - 3491
  • [42] A new accelerating method for global non-convex quadratic optimization with non-convex quadratic constraints
    Wu, Huizhuo
    Zhang, KeCun
    APPLIED MATHEMATICS AND COMPUTATION, 2008, 197 (02) : 810 - 818
  • [43] Almost sure convergence of stochastic composite objective mirror descent for non-convex non-smooth optimization
    Liang, Yuqing
    Xu, Dongpo
    Zhang, Naimin
    Mandic, Danilo P.
    OPTIMIZATION LETTERS, 2024, 18 (09) : 2113 - 2131
  • [44] Non-Convex Projection Adaptive Hammerstein Filtering
    Liu, Zhaoting
    Bao, Huiming
    Yao, Yingbiao
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (04) : 1813 - 1820
  • [45] A method to construct a quasi-normal cone for non-convex and non-smooth set and its applications to non-convex and non-smooth optimization
    Li, Hongwei
    Zhou, Dequn
    Liu, Qinghuai
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 1585 - +
  • [46] A Non-Euclidean Gradient Descent Framework for Non-Convex Matrix Factorization
    Hsieh, Ya-Ping
    Kao, Yu-Chun
    Mahabadi, Rabeeh Karimi
    Yurtsever, Alp
    Kyrillidis, Anastasios
    Cevher, Volkan
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (22) : 5917 - 5926
  • [47] Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent
    Bai, Qinbo
    Agarwal, Mridul
    Aggarwal, Vaneet
    2020 54TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2020, : 132 - 137
  • [48] GloptiNets: Scalable Non-Convex Optimization with Certificates
    Beugnot, Gaspard
    Mairal, Julien
    Rudi, Alessandro
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [49] AN EFFICIENT ALGORITHM FOR NON-CONVEX SPARSE OPTIMIZATION
    Wang, Yong
    Liu, Wanquan
    Zhou, Guanglu
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2019, 15 (04) : 2009 - 2021
  • [50] STABILITY FOR A CLASS OF NON-CONVEX OPTIMIZATION PROBLEMS
    ZALINESCU, C
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1988, 307 (12): : 643 - 646