Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

被引:0
|
作者
Imamura, Hideaki [1 ,2 ]
Sato, Issei [1 ,2 ]
Sugiyama, Masashi [1 ,2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] RIKEN, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While crowdsourcing has become an important means to label data, there is great interest in estimating the ground truth from unreliable labels produced by crowdworkers. The Dawid and Skene (DS) model is one of the most well-known models in the study of crowdsourcing. Despite its practical popularity, theoretical error analysis for the DS model has been conducted only under restrictive assumptions on class priors, confusion matrices, or the number of labels each worker provides. In this paper, we derive a minimax error rate under more practical setting for a broader class of crowdsourcing models including the DS model as a special case. We further propose the worker clustering model, which is more practical than the DS model under real crowdsourcing settings. The wide applicability of our theoretical analysis allows us to immediately investigate the behavior of this proposed model, which can not be analyzed by existing studies. Experimental results showed that there is a strong similarity between the lower bound of the minimax error rate derived by our theoretical analysis and the empirical error of the estimated value.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A Comprehensive Model for Wind Power Forecast Error and its Application in Economic Analysis of Energy Storage Systems
    Huang, Yu
    Xu, Qingshan
    Jiang, Xianqiang
    Zhang, Tong
    Liu, Jiankun
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 13 (06) : 2168 - 2177
  • [42] The structure optimized fuzzy clustering neural network model and its application
    Zou, Kaiqi
    Hu, Juan
    Kong, Xiaoyan
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (07): : 1627 - 1634
  • [43] A particular Gaussian mixture model for clustering and its application to image retrieval
    Sahbi, Hichem
    SOFT COMPUTING, 2008, 12 (07) : 667 - 676
  • [44] Fuzzy model-based clustering and its application in image segmentation
    Choy, Siu Kai
    Lam, Shu Yan
    Yu, Kwok Wai
    Lee, Wing Yan
    Leung, King Tai
    PATTERN RECOGNITION, 2017, 68 : 141 - 157
  • [45] A particular Gaussian mixture model for clustering and its application to image retrieval
    Hichem Sahbi
    Soft Computing, 2008, 12 : 667 - 676
  • [46] Estimation of GPS strain rate and its error analysis in the Chinese continent
    Zhu, Shoubiao
    Shi, Yaolin
    JOURNAL OF ASIAN EARTH SCIENCES, 2011, 40 (01) : 351 - 362
  • [47] A MODEL FOR ANALYSIS OF ACCIDENTS AND ITS APPLICATION
    TUOMINEN, R
    SAARI, J
    JOURNAL OF OCCUPATIONAL ACCIDENTS, 1982, 4 (2-4): : 263 - 273
  • [48] The application of double layer clustering model on log data analysis
    School of Computer Science, Beijing University of Posts and Telecommunications, Beijing
    100876, China
    不详
    100876, China
    不详
    100101, China
    不详
    100876, China
    Beijing Youdian Daxue Xuebao, (63-66 and 71):
  • [49] Model Error (or Ambiguity) and Its Estimation, with Particular Application to Loss Reserving
    Taylor, Greg
    Mcguire, Grainne
    RISKS, 2023, 11 (11)
  • [50] Research of Aiming Error and Its Application in CGF Tank Firing Model
    Zheng, Changwei
    Xue, Qing
    Ren, Xiaoming
    Li, Guanghui
    SYSTEM SIMULATION AND SCIENTIFIC COMPUTING, PT II, 2012, 327 : 370 - 374