Learning from crowds with robust logistic regression

被引：1

作者：

Li, Wenbin ^{[1
]}

Li, Chaoqun ^{[1
,2
]}

Jiang, Liangxiao ^{[3
]}

机构：

[1] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China

[2] Minist Educ, Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

[3] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China

来源：

INFORMATION SCIENCES | 2023年 / 639卷

基金：

中国国家自然科学基金;

关键词：

Crowdsourcing learning; Ground truth inference; Logistic regression; Robust classifiers; TOOL;

D O I：

10.1016/j.ins.2023.119010

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Crowdsourcing systems provide an easy way to obtain labels for data. Each instance in data will usually be labeled by multiple crowd labelers who are not experts. Thus, it is very important to design considerate ground truth inference algorithms to infer integrated labels from multiple crowd labels. While almost all ground truth inference algorithms show good performance when the number of crowd labels is large, few algorithms can perform well with few crowd labels. This paper considers how to deal with noise in multiple crowd labels as a key to good ground truth inference. This paper solves ground truth inference using robust classifiers. This paper proposes two versions of ground truth inference algorithm based on robust logistic regression to solve the following two problems: (1) how to embed noise level into the loss function of logistic regression and (2) how to estimate the parameters that model noise level in the crowdsourcing scenario. We call our algorithms robust logistic regression inference (RLRI). By employing the idea of robust classifiers, RLRI can still perform well in the case of a small number of labels. We also theoretically compare the advantages and disadvantages of the two versions of RLRI. Finally, the performance of our algorithms is verified on benchmark and real-world datasets.

引用

下载

页数：15

共 50 条

[31] Outliers and Robust Logistic Regression in Health Sciences
Cutanda Henriquez, Francisco
REVISTA ESPANOLA DE SALUD PUBLICA, 2008, 82 (06): : 617 - 625
[32] Robust Multinomial Logistic Regression Based on RPCA
Yin, Ming
Zeng, Deyu
Gao, Junbin
Wu, Zongze
Xie, Shengli
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (06) : 1144 - 1154
[33] Logistic Regression for Transductive Transfer Learning from Multiple Sources
Zhang, Yuhong
Hu, Xuegang
Fang, Yucheng
ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 175 - 182
[34] TRANSFER LEARNING BASED ON LOGISTIC REGRESSION
Paul, A.
Rottensteiner, F.
Heipke, C.
ISPRS GEOSPATIAL WEEK 2015, 2015, 40-3 (W3): : 145 - 152
[35] Active learning for logistic regression: an evaluation
Schein, Andrew I.
Ungar, Lyle H.
MACHINE LEARNING, 2007, 68 (03) : 235 - 265
[36] Active learning for logistic regression: an evaluation
Andrew I. Schein
Lyle H. Ungar
Machine Learning, 2007, 68 : 235 - 265
[37] Robust Logistic Regression for Graduate's Employability from Public Universities in Malaysia
Mohamed, Tengku Salbiah Tengku
Lee, Muhammad Hisyam
MATEMATIKA, 2021, 37 (01) : 33 - 43
[38] Adversarial Learning from Crowds
Chen, Pengpeng
Sun, Hailong
Yang, Yongqiang
Chen, Zhijun
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5304 - 5312
[39] Ensemble Learning from Crowds
Zhang, Jing
Wu, Ming
Sheng, Victor S.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (08) : 1506 - 1519
[40] Deep Learning from Crowds
Rodrigues, Filipe
Pereira, Francisco C.
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1611 - 1618

← 1 2 3 4 5 →