Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

被引:0
|
作者
Imamura, Hideaki [1 ,2 ]
Sato, Issei [1 ,2 ]
Sugiyama, Masashi [1 ,2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] RIKEN, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While crowdsourcing has become an important means to label data, there is great interest in estimating the ground truth from unreliable labels produced by crowdworkers. The Dawid and Skene (DS) model is one of the most well-known models in the study of crowdsourcing. Despite its practical popularity, theoretical error analysis for the DS model has been conducted only under restrictive assumptions on class priors, confusion matrices, or the number of labels each worker provides. In this paper, we derive a minimax error rate under more practical setting for a broader class of crowdsourcing models including the DS model as a special case. We further propose the worker clustering model, which is more practical than the DS model under real crowdsourcing settings. The wide applicability of our theoretical analysis allows us to immediately investigate the behavior of this proposed model, which can not be analyzed by existing studies. Experimental results showed that there is a strong similarity between the lower bound of the minimax error rate derived by our theoretical analysis and the empirical error of the estimated value.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A fuzzy minimax clustering model and its applications
    Li, Xiang
    Wong, Hau-San
    Wu, Si
    INFORMATION SCIENCES, 2012, 186 (01) : 114 - 125
  • [2] Error analysis of borehole gas flow model and its application
    Qin, Yueping
    Liu, Jia
    Gao, Yu
    Duan, Wenpeng
    Caikuang yu Anquan Gongcheng Xuebao/Journal of Mining and Safety Engineering, 2021, 38 (06): : 1259 - 1268
  • [3] Minimax estimation with thresholding and its application to wavelet analysis
    Zhou, HH
    Hwang, JTG
    ANNALS OF STATISTICS, 2005, 33 (01): : 101 - 125
  • [4] A clustering cure rate model with application to a sealantstudy
    Gallardo, Diego I.
    Bolfarine, Heleno
    Pedroso-de-Lima, Atonio Carlos
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (16) : 2949 - 2962
  • [5] A Generative Clustering Ensemble Model and Its Application in IoT Data Analysis
    Du, Hangyuan
    Wang, Wenjian
    Bai, Liang
    Feng, Jinsong
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [6] Fuzzy Clustering Analysis Mathematical Model and Its Application in Teaching Evaluation
    Gao, Shanshan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND INFORMATION TECHNOLOGY, 2015, 35 : 625 - 630
  • [7] Biased minimax probability model and its application in prediction of gasoline properties
    He K.-X.
    Liu J.-J.
    Wang X.-B.
    Su Z.-Y.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2020, 37 (08): : 1799 - 1807
  • [8] Combination clustering analysis method and its application
    Liu, Yang
    Li, Qin-Liang
    Dong, Li-Yuan
    Wen, Bang-Chun
    Journal of Applied Sciences, 2013, 13 (08) : 1251 - 1255
  • [9] Neural network model based on fuzzy clustering and its application in the analysis of seepage flow
    Zhang, Qianfei
    Xu, Hongzhong
    Wu, Zhongru
    Wang, Yuguo
    Gao, Mingjun
    Shuili Fadian Xuebao/Journal of Hydroelectric Engineering, 2002, (02):
  • [10] Model clustering and its application to water quality monitoring
    Zhu, Rong
    El-Shaarawi, Abdel H.
    ENVIRONMETRICS, 2009, 20 (02) : 190 - 205