Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

被引:0
|
作者
Imamura, Hideaki [1 ,2 ]
Sato, Issei [1 ,2 ]
Sugiyama, Masashi [1 ,2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] RIKEN, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While crowdsourcing has become an important means to label data, there is great interest in estimating the ground truth from unreliable labels produced by crowdworkers. The Dawid and Skene (DS) model is one of the most well-known models in the study of crowdsourcing. Despite its practical popularity, theoretical error analysis for the DS model has been conducted only under restrictive assumptions on class priors, confusion matrices, or the number of labels each worker provides. In this paper, we derive a minimax error rate under more practical setting for a broader class of crowdsourcing models including the DS model as a special case. We further propose the worker clustering model, which is more practical than the DS model under real crowdsourcing settings. The wide applicability of our theoretical analysis allows us to immediately investigate the behavior of this proposed model, which can not be analyzed by existing studies. Experimental results showed that there is a strong similarity between the lower bound of the minimax error rate derived by our theoretical analysis and the empirical error of the estimated value.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Asymmetric aggregation operator and its application to fuzzy clustering model
    Sato-Ilic, M
    Sato, Y
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 32 (3-4) : 379 - 394
  • [22] General c-means clustering model and its application
    Yu, J
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2003, : 122 - 127
  • [23] Feature proposal model on multidimensional data clustering and its application
    Li, Xi
    Ma, Huimin
    Wang, Xiang
    PATTERN RECOGNITION LETTERS, 2018, 112 : 41 - 48
  • [24] Contour error vector model and its application to CNC systems
    Wang, Bao-Ren
    Wang, Jie
    Zhang, Cheng-Rui
    Wu, Hong-En
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2010, 16 (07): : 1401 - 1407
  • [25] Panel Data Clustering and its Application to Discount Rate of B Stock in China
    Zheng, Tao
    Zhu, Donghua
    Wang, Xuefeng
    Yu, Bo
    ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 1, PROCEEDINGS: COMPUTING SCIENCE AND ITS APPLICATION, 2009, : 163 - 166
  • [26] Application of Error Analysis Method in the Complex Vehicle Model
    Liao, Shui Rong
    Yang, Tao
    MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3, 2012, 591-593 : 584 - +
  • [27] REDUCED MODEL ERROR ANALYSIS APPLICATION TO SYNCHRONOUS MACHINES
    DERBEL, N
    KAMOUN, MBA
    POLOUJADOFF, M
    JOURNAL DE PHYSIQUE III, 1994, 4 (10): : 1999 - 2012
  • [28] The multiplicative model in time series and GARCH error amending model and its application
    Yang, S.-D. (Yangshangdong2011@126.com), 2013, Hunan University (40):
  • [29] A New Estimator of the Mahalanobis Distance and its Application to Classification Error Rate Estimation
    Gvardinskas, Mindaugas
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2016, 2016, 639 : 319 - 331
  • [30] Regularized matrix data clustering and its application to image analysis
    Gao, Xu
    Shen, Weining
    Zhang, Liwen
    Hu, Jianhua
    Fortin, Norbert J.
    Frostig, Ron D.
    Ombao, Hernando
    BIOMETRICS, 2021, 77 (03) : 890 - 902