A Drifting-Games Analysis for Online Learning and Applications to Boosting

被引：0

作者：

Luo, Haipeng ^{[1
]}

Schapire, Robert E. ^{[1
,2
]}

机构：

[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA

[2] Microsoft Res, New York, NY USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014) | 2014年 / 27卷

基金：

美国国家科学基金会;

关键词：

ALGORITHMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We provide a general mechanism to design online learning algorithms based on a minimax analysis within a drifting-games framework. Different online learning settings (Hedge, multi-armed bandit problems and online convex optimization) are studied by converting into various kinds of drifting games. The original minimax analysis for drifting games is then used and generalized by applying a series of relaxations, starting from choosing a convex surrogate of the 0-1 loss function. With different choices of surrogates, we not only recover existing algorithms, but also propose new algorithms that are totally parameter-free and enjoy other useful properties. Moreover, our drifting-games framework naturally allows us to study high probability bounds without resorting to any concentration results, and also a generalized notion of regret that measures how good the algorithm is compared to all but the top small fraction of candidates. Finally, we translate our new Hedge algorithm into a new adaptive boosting algorithm that is computationally faster as shown in experiments, since it ignores a large number of examples on each round.

引用

页数：9

共 50 条

[1] Learning with continuous experts using drifting games
Mukherjee, Indraneel
Schapire, Robert E.
[J]. THEORETICAL COMPUTER SCIENCE, 2010, 411 (29-30) : 2670 - 2683
[2] Learning with Continuous Experts Using Drifting Games
Mukherjee, Indraneel
Schapire, Robert E.
[J]. ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 240 - 255
[3] Online Active Learning for Drifting Data Streams
Liu, Sanmin
Xue, Shan
Wu, Jia
Zhou, Chuan
Yang, Jian
Li, Zhao
Cao, Jie
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 186 - 200
[4] Games and simulations in online learning
Battersby, Diana
[J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2008, 39 (06) : 1136 - 1136
[5] Supporting Online learning with games
Yao, JingTao
Kim, DongWon
Herbert, Joseph P.
[J]. DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2007, 2007, 6570
[6] Games and simulations in online learning
Deeson, Eric
[J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2007, 38 (03) : 544 - 545
[7] A Boosting Framework on Grounds of Online Learning
Naghibi, Tofigh
Pfister, Beat
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[8] Online Reinforcement Learning in Stochastic Games
Wei, Chen-Yu
Hong, Yi-Te
Lu, Chi-Jen
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[9] Online Learning in Unknown Markov Games
Tian, Yi
Wang, Yuanhao
Yu, Tiancheng
Sra, Suvrit
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7290 - 7300
[10] Statistical mechanics of online learning of drifting concepts: A variational approach
Vicente, R
Kinouchi, O
Caticha, N
[J]. MACHINE LEARNING, 1998, 32 (02) : 179 - 201

← 1 2 3 4 5 →