A Drifting-Games Analysis for Online Learning and Applications to Boosting

被引:0
|
作者
Luo, Haipeng [1 ]
Schapire, Robert E. [1 ,2 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA
[2] Microsoft Res, New York, NY USA
基金
美国国家科学基金会;
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide a general mechanism to design online learning algorithms based on a minimax analysis within a drifting-games framework. Different online learning settings (Hedge, multi-armed bandit problems and online convex optimization) are studied by converting into various kinds of drifting games. The original minimax analysis for drifting games is then used and generalized by applying a series of relaxations, starting from choosing a convex surrogate of the 0-1 loss function. With different choices of surrogates, we not only recover existing algorithms, but also propose new algorithms that are totally parameter-free and enjoy other useful properties. Moreover, our drifting-games framework naturally allows us to study high probability bounds without resorting to any concentration results, and also a generalized notion of regret that measures how good the algorithm is compared to all but the top small fraction of candidates. Finally, we translate our new Hedge algorithm into a new adaptive boosting algorithm that is computationally faster as shown in experiments, since it ignores a large number of examples on each round.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Learning with continuous experts using drifting games
    Mukherjee, Indraneel
    Schapire, Robert E.
    [J]. THEORETICAL COMPUTER SCIENCE, 2010, 411 (29-30) : 2670 - 2683
  • [2] Learning with Continuous Experts Using Drifting Games
    Mukherjee, Indraneel
    Schapire, Robert E.
    [J]. ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 240 - 255
  • [3] Online Active Learning for Drifting Data Streams
    Liu, Sanmin
    Xue, Shan
    Wu, Jia
    Zhou, Chuan
    Yang, Jian
    Li, Zhao
    Cao, Jie
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 186 - 200
  • [4] Games and simulations in online learning
    Battersby, Diana
    [J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2008, 39 (06) : 1136 - 1136
  • [5] Supporting Online learning with games
    Yao, JingTao
    Kim, DongWon
    Herbert, Joseph P.
    [J]. DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2007, 2007, 6570
  • [6] Games and simulations in online learning
    Deeson, Eric
    [J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2007, 38 (03) : 544 - 545
  • [7] A Boosting Framework on Grounds of Online Learning
    Naghibi, Tofigh
    Pfister, Beat
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [8] Online Reinforcement Learning in Stochastic Games
    Wei, Chen-Yu
    Hong, Yi-Te
    Lu, Chi-Jen
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [9] Online Learning in Unknown Markov Games
    Tian, Yi
    Wang, Yuanhao
    Yu, Tiancheng
    Sra, Suvrit
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7290 - 7300
  • [10] Statistical mechanics of online learning of drifting concepts: A variational approach
    Vicente, R
    Kinouchi, O
    Caticha, N
    [J]. MACHINE LEARNING, 1998, 32 (02) : 179 - 201