Learning from crowds with robust logistic regression

被引:1
|
作者
Li, Wenbin [1 ]
Li, Chaoqun [1 ,2 ]
Jiang, Liangxiao [3 ]
机构
[1] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
[2] Minist Educ, Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[3] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowdsourcing learning; Ground truth inference; Logistic regression; Robust classifiers; TOOL;
D O I
10.1016/j.ins.2023.119010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Crowdsourcing systems provide an easy way to obtain labels for data. Each instance in data will usually be labeled by multiple crowd labelers who are not experts. Thus, it is very important to design considerate ground truth inference algorithms to infer integrated labels from multiple crowd labels. While almost all ground truth inference algorithms show good performance when the number of crowd labels is large, few algorithms can perform well with few crowd labels. This paper considers how to deal with noise in multiple crowd labels as a key to good ground truth inference. This paper solves ground truth inference using robust classifiers. This paper proposes two versions of ground truth inference algorithm based on robust logistic regression to solve the following two problems: (1) how to embed noise level into the loss function of logistic regression and (2) how to estimate the parameters that model noise level in the crowdsourcing scenario. We call our algorithms robust logistic regression inference (RLRI). By employing the idea of robust classifiers, RLRI can still perform well in the case of a small number of labels. We also theoretically compare the advantages and disadvantages of the two versions of RLRI. Finally, the performance of our algorithms is verified on benchmark and real-world datasets.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [31] Outliers and Robust Logistic Regression in Health Sciences
    Cutanda Henriquez, Francisco
    REVISTA ESPANOLA DE SALUD PUBLICA, 2008, 82 (06): : 617 - 625
  • [32] Robust Multinomial Logistic Regression Based on RPCA
    Yin, Ming
    Zeng, Deyu
    Gao, Junbin
    Wu, Zongze
    Xie, Shengli
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (06) : 1144 - 1154
  • [33] Logistic Regression for Transductive Transfer Learning from Multiple Sources
    Zhang, Yuhong
    Hu, Xuegang
    Fang, Yucheng
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 175 - 182
  • [34] TRANSFER LEARNING BASED ON LOGISTIC REGRESSION
    Paul, A.
    Rottensteiner, F.
    Heipke, C.
    ISPRS GEOSPATIAL WEEK 2015, 2015, 40-3 (W3): : 145 - 152
  • [35] Active learning for logistic regression: an evaluation
    Schein, Andrew I.
    Ungar, Lyle H.
    MACHINE LEARNING, 2007, 68 (03) : 235 - 265
  • [36] Active learning for logistic regression: an evaluation
    Andrew I. Schein
    Lyle H. Ungar
    Machine Learning, 2007, 68 : 235 - 265
  • [37] Robust Logistic Regression for Graduate's Employability from Public Universities in Malaysia
    Mohamed, Tengku Salbiah Tengku
    Lee, Muhammad Hisyam
    MATEMATIKA, 2021, 37 (01) : 33 - 43
  • [38] Adversarial Learning from Crowds
    Chen, Pengpeng
    Sun, Hailong
    Yang, Yongqiang
    Chen, Zhijun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5304 - 5312
  • [39] Ensemble Learning from Crowds
    Zhang, Jing
    Wu, Ming
    Sheng, Victor S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (08) : 1506 - 1519
  • [40] Deep Learning from Crowds
    Rodrigues, Filipe
    Pereira, Francisco C.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1611 - 1618