Optimal Adversarial Strategies in Learning with Expert Advice

被引:0
|
作者
Truong, Anh [1 ]
Kiyavash, Negar [1 ]
机构
[1] Univ Illinois, Dept Ind & Enterprise Syst Engn, Champaign, IL 61820 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose an adversarial setting for the framework of learning with expert advice in which one of the experts has the intention to compromise the recommendation system by providing wrong recommendations. The problem is formulated as a Markov Decision Process (MDP) and solved by dynamic programming. Somewhat surprisingly, we prove that, in the case of logarithmic loss, the optimal strategy for the malicious expert is the greedy policy of lying at every step. Furthermore, a sufficient condition on the loss function is provided that guarantees the optimality of the greedy policy. Our experimental results, however, show that the condition is not necessary since the greedy policy is also optimal when the square loss is used, even though the square loss does not satisfy the condition. Moreover, the experimental results suggest that, for absolute loss, the optimal policy is a threshold one.
引用
收藏
页码:7315 / 7320
页数:6
相关论文
共 50 条
  • [1] Optimal Attack Strategies Against Predictors - Learning From Expert Advice
    Anh Truong
    Etesami, S. Rasoul
    Etesami, Jalal
    Kiyavash, Negar
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (01) : 6 - 19
  • [2] Learning with expert advice
    Molnar, Krisztina
    JOURNAL OF THE EUROPEAN ECONOMIC ASSOCIATION, 2007, 5 (2-3) : 420 - 432
  • [3] Optimal Adversarial Policies in the Multiplicative Learning System With a Malicious Expert
    Etesami, S. Rasoul
    Kiyavash, Negar
    Leon, Vincent
    Poor, H. Vincent
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 2276 - 2287
  • [4] Multitask learning with expert advice
    Abernethy, Jacob
    Bartlett, Peter
    Rakhlin, Alexander
    LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 484 - +
  • [5] Optimal discovery with probabilistic expert advice
    Bubeck, Sebastien
    Ernst, Damien
    Garivier, Aurelien
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 6808 - 6812
  • [6] Learning the Learning Rate for Prediction with Expert Advice
    Koolen, Wouter M.
    van Erven, Tim
    Grunwald, Peter D.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [7] Selective Labeling in Learning with Expert Advice
    Truong, Anh
    Etesami, S. Rasoul
    Kiyavash, Negar
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 2537 - 2542
  • [8] Online Active Learning with Expert Advice
    Hao, Shuji
    Hu, Peiying
    Zhao, Peilin
    Hoi, Steven C. H.
    Miao, Chunyan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (05)
  • [9] Query Strategies, Assemble! Active Learning with Expert Advice for Low-resource Natural Language Processing
    Mendonca, Vania
    Sardinha, Alberto
    Coheur, Luisa
    Santos, Ana Lucia
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [10] Optimal Prediction Using Expert Advice and Randomized Littlestone Dimension
    Filmus, Yuval
    Hanneke, Steve
    Mehalel, Idan
    Moran, Shay
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195 : 773 - 836