On the Complexity of Learning from Label Proportions

被引:0
|
作者
Fish, Benjamin [1 ]
Reyzin, Lev [1 ]
机构
[1] Univ Illinois, Dept Math Stat & Comp Sci, Chicago, IL 60022 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the problem of learning with label proportions (also known as the problem of estimating class ratios), the training data is unlabeled, and only the proportions of examples receiving each label are given. The goal is to learn a hypothesis that predicts the proportions of labels on the distribution underlying the sample. This model of learning is useful in a wide variety of settings, including predicting the number of votes for candidates in political elections from polls. In this paper, we resolve foundational questions regarding the computational complexity of learning in this setting. We formalize a simple version of the setting, and we compare the computational complexity of learning in this model to classical PAC learning. Perhaps surprisingly, we show that what can be learned efficiently in this model is a strict subset of what may be leaned efficiently in PAC, under standard complexity assumptions. We give a characterization in terms of VC dimension, and we show that there are non-trivial problems in this model that can be efficiently learned. We also give an algorithm that demonstrates the feasibility of learning under well-behaved distributions.
引用
收藏
页码:1675 / 1681
页数:7
相关论文
共 50 条
  • [1] Learning from Label Proportions by Learning with Label Noise
    Zhang, Jianxin
    Wang, Yutong
    Scott, Clayton
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Easy Learning from Label Proportions
    Busa-Fekete, Robert
    Choi, Heejin
    Dick, Travis
    Gentile, Claudio
    Medina, Andres Munoz
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] A framework for evaluation in learning from label proportions
    Jerónimo Hernández-González
    [J]. Progress in Artificial Intelligence, 2019, 8 : 359 - 373
  • [4] Learning from label proportions with pinball loss
    Yong Shi
    Limeng Cui
    Zhensong Chen
    Zhiquan Qi
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 187 - 205
  • [5] Learning from label proportions with pinball loss
    Shi, Yong
    Cui, Limeng
    Chen, Zhensong
    Qi, Zhiquan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (01) : 187 - 205
  • [6] Laplacian SVM for Learning from Label Proportions
    Cui, Limeng
    Chen, Zhensong
    Meng, Fan
    Shi, Yong
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 847 - 852
  • [7] Differentially Private Learning from Label Proportions
    Sachweh, Timon
    Boiar, Daniel
    Liebig, Thomas
    [J]. MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021, PT I, 2021, 1524 : 119 - 127
  • [8] A framework for evaluation in learning from label proportions
    Hernandez-Gonzalez, Jeronimo
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2019, 8 (03) : 359 - 373
  • [9] Learning a generative classifier from label proportions
    Fan, Kai
    Zhang, Hongyi
    Yan, Songbai
    Wang, Liwei
    Zhang, Wensheng
    Feng, Jufu
    [J]. NEUROCOMPUTING, 2014, 139 : 47 - 55
  • [10] LABEL PROPAGATION FOR LEARNING WITH LABEL PROPORTIONS
    Poyiadzi, Rafael
    Santos-Rodriguez, Raul
    Twomey, Niall
    [J]. 2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,