A Crowdsourcing Worker Quality Evaluation Algorithm on MapReduce for Big Data Applications

被引:16
|
作者
Dang, Depeng [1 ]
Liu, Ying [1 ]
Zhang, Xiaoran [1 ]
Huang, Shihang [1 ]
机构
[1] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
基金
国家教育部科学基金资助; 中国国家自然科学基金;
关键词
Crowdsourcing systems; quality control; big data; mapreduce; hadoop; SYSTEMS;
D O I
10.1109/TPDS.2015.2457924
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Crowdsourcing is a new emerging distributed computing and business model on the backdrop of Internet blossoming. With the development of crowdsourcing systems, the data size of crowdsourcers, contractors and tasks grows rapidly. The worker quality evaluation based on big data analysis technology has become a critical challenge. This paper first proposes a general worker quality evaluation algorithm that is applied to any critical tasks such as tagging, matching, filtering, categorization and many other emerging applications, without wasting resources. Second, we realize the evaluation algorithm in the Hadoop platform using the MapReduce parallel programming model. Finally, to effectively verify the accuracy and the effectiveness of the algorithm in a wide variety of big data scenarios, we conduct a series of experiments. The experimental results demonstrate that the proposed algorithm is accurate and effective. It has high computing performance and horizontal scalability. And it is suitable for large-scale worker quality evaluations in a big data environment.
引用
收藏
页码:1879 / 1888
页数:10
相关论文
共 50 条
  • [21] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [22] Energy-Aware Scheduling of MapReduce Jobs for Big Data Applications
    Mashayekhy, Lena
    Nejad, Mahyar Movahed
    Grosu, Daniel
    Zhang, Quan
    Shi, Weisong
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (10) : 2720 - 2733
  • [23] On Traffic-Aware Partition and Aggregation in MapReduce for Big Data Applications
    Ke, Huan
    Li, Peng
    Guo, Song
    Guo, Minyi
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) : 818 - 828
  • [24] Data Quality Management for Big Data Applications
    Khaleel, Majida Yaseen
    Hamad, Murtadha M.
    [J]. 12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 357 - 362
  • [25] Dache: A Data Aware Caching for Big-Data Applications Using The MapReduce Framework
    Zhao, Yaxiong
    Wu, Jie
    [J]. 2013 PROCEEDINGS IEEE INFOCOM, 2013, : 35 - 39
  • [26] Dache: A Data Aware Caching for Big-Data Applications Using the MapReduce Framework
    Yaxiong Zhao
    Jie Wu
    Cong Liu
    [J]. Tsinghua Science and Technology, 2014, 19 (01) : 39 - 50
  • [27] Dache: A Data Aware Caching for Big-Data Applications Using the MapReduce Framework
    Zhao, Yaxiong
    Wu, Jie
    Liu, Cong
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2014, 19 (01) : 39 - 50
  • [28] High Quality Participant Recruitment of Mobile Crowdsourcing over Big Data
    Li, Shu
    Zhang, Jie
    Xie, Dongqing
    Yu, Shui
    Dou, Wanchun
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [29] A Distributed Load Balance Algorithm of MapReduce for Data Quality Detection
    Gao, Yitong
    Zhang, Yan
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 294 - 306
  • [30] In-Mapper combiner based MapReduce algorithm for processing of big climate data
    Manogaran, Gunasekaran
    Lopez, Daphne
    Chilamkurti, Naveen
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 433 - 445