Easy Learning from Label Proportions

被引:0
|
作者
Busa-Fekete, Robert [1 ]
Choi, Heejin [2 ,3 ]
Dick, Travis [1 ]
Gentile, Claudio [1 ]
Medina, Andres Munoz [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Coupang Inc, Seattle, WA USA
[3] Google, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of Learning from Label Proportions (LLP), a weakly supervised classification setup where instances are grouped into i.i.d. "bags", and only the frequency of class labels at each bag is available. Albeit, the objective of the learner is to achieve low task loss at an individual instance level. Here we propose EASYLLP, a flexible and simple-to-implement debiasing approach based on aggregate labels, which operates on arbitrary loss functions. Our technique allows us to accurately estimate the expected loss of an arbitrary model at an individual level. We elucidate the differences between our method and standard methods based on label proportion matching, in terms of applicability and optimality conditions. We showcase the flexibility of our approach compared to alternatives by applying our method to popular learning frameworks, like Empirical Risk Minimization (ERM) and Stochastic Gradient Descent (SGD) with provable guarantees on instance level performance. Finally, we validate our theoretical results on multiple datasets, empirically illustrating the conditions under which our algorithm is expected to perform better or worse than previous LLP approaches.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions
    Ardehaly, Ehsan Mohammady
    Culotta, Aron
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 733 - 738
  • [42] Evaluation in Learning from Label Proportions: An Approximation to the Precision-Recall Curve
    Hernandez-Gonzalez, Jeronimo
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2018, 2018, 11160 : 76 - 86
  • [43] Multi-Class Learning from Label Proportions for Bank Customer Classification
    Qian, Yaxing
    Tong, Qiang
    Wang, Bo
    7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 421 - 428
  • [44] Learning from Label Proportions via an Iterative Weighting Scheme and Discriminant Analysis
    Perez-Ortiz, M.
    Gutierrez, P. A.
    Carbonero-Ruz, M.
    Hervas-Martinez, C.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2016, 2016, 9868 : 79 - 88
  • [45] MixBag: Bag-Level Data Augmentation for Learning from Label Proportions
    Asanomi, Takanori
    Matsuo, Shinnosuke
    Suehiro, Daiki
    Bise, Ryoma
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16524 - 16533
  • [46] A Study on Mobile Customer Chum Based on Learning from Soft Label Proportions
    Lu, Kaili
    Zhao, Xingqiu
    Wang, Bo
    7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 413 - 420
  • [47] Learning from label proportions in brain-computer interfaces: Online unsupervised learning with guarantees
    Huebner, David
    Verhoeven, Thibault
    Schmid, Konstantin
    Mueller, Klaus-Robert
    Tangermann, Michael
    Kindermans, Pieter-Jan
    PLOS ONE, 2017, 12 (04):
  • [48] A Study on Customer Churn of Commercial Banks Based on Learning from Label Proportions
    Li, Yue
    Wang, Bo
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1241 - 1247
  • [49] Estimating Labels from Label Proportions
    Quadrianto, Novi
    Smola, Alex J.
    Caetano, Tiberio S.
    Le, Quoc V.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 2349 - 2374
  • [50] Hierarchical Active Learning with Label Proportions on Data Regions
    Luo Z.
    Gao Q.
    He Y.
    Wang H.
    Hauskrecht M.
    Li T.
    IEEE Transactions on Knowledge and Data Engineering, 2024, 36 (12) : 1 - 13