No Internal Regret with Non-convex Loss Functions

被引:0
|
作者
Sharma, Dravyansh [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Internal regret is a measure of performance of an online learning algorithm, which measures the change in performance by substituting every occurrence of a given action i by an alternative action j. Algorithms for minimizing internal regret are known for the finite experts setting, including a general reduction to the problem of minimizing external regret for this case. The reduction however crucially depends on the finiteness of the action space. In this work we approach the problem of minimizing internal regret for a continuous action space. For the full information setting, we show how to obtain (O) over tilde(root T) internal regret for the class of Lipschitz functions, as well as non-Lipschitz dispersed functions, i.e. the non-Lipschitzness may not concentrate in a small region of the action space. We also consider extensions to partial feedback settings, and again obtain sublinear internal regret. Finally we discuss applications of internal regret minimization over continuous spaces to correlated equilibria in pricing problems and auction design, as well as to data-driven hyper-parameter tuning.
引用
收藏
页码:14919 / 14927
页数:9
相关论文
共 50 条
  • [1] Regret Bounds for Online Pairwise Learning with Non-Convex Loss Functions Using Stability Analysis
    Lang, Xuancong
    Li, Chunsheng
    Liu, Yong
    Wang, Mei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (12): : 2806 - 2813
  • [2] Efficient Regret Minimization in Non-Convex Games
    Hazan, Elad
    Singh, Karan
    Zhang, Cyril
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [3] Dynamic Local Regret for Non-convex Online Forecasting
    Aydore, Sergul
    Zhu, Tianhao
    Foster, Dean
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Distributed Online Non-convex Optimization with Composite Regret
    Jiang, Zhanhong
    Balu, Aditya
    Lee, Xian Yeow
    Lee, Young M.
    Hegde, Chinmay
    Sarkar, Soumik
    [J]. 2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [5] Differentially Private Empirical Risk Minimization with Non-convex Loss Functions
    Wang, Di
    Chen, Changyou
    Xu, Jinhui
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [6] Online Learning with Non-Convex Losses and Non-Stationary Regret
    Gao, Xiang
    Li, Xiaobo
    Zhang, Shuzhong
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [7] NO-REGRET NON-CONVEX ONLINE META-LEARNING
    Zhuang, Zhenxun
    Wang, Yunlong
    Yu, Kezi
    Lu, Songtao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3942 - 3946
  • [8] THE UNCONDITIONAL MINIMIZATION OF NON-CONVEX FUNCTIONS
    BEREZNEV, VA
    KARMANOV, VG
    TRETYAKOV, AA
    [J]. USSR COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 1987, 27 (11-12): : 101 - 104
  • [9] CONCERNING MAXIMAL CONVEX SUBSETS OF A NON-CONVEX SET OF FUNCTIONS
    SMITH, CR
    [J]. AMERICAN MATHEMATICAL MONTHLY, 1967, 74 (09): : 1178 - &
  • [10] Collective choice functions on non-convex problems
    Mariotti, M
    [J]. ECONOMIC THEORY, 2000, 16 (02) : 457 - 463