A Crowdsourcing Worker Quality Evaluation Algorithm on MapReduce for Big Data Applications

被引:16
|
作者
Dang, Depeng [1 ]
Liu, Ying [1 ]
Zhang, Xiaoran [1 ]
Huang, Shihang [1 ]
机构
[1] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
基金
国家教育部科学基金资助; 中国国家自然科学基金;
关键词
Crowdsourcing systems; quality control; big data; mapreduce; hadoop; SYSTEMS;
D O I
10.1109/TPDS.2015.2457924
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Crowdsourcing is a new emerging distributed computing and business model on the backdrop of Internet blossoming. With the development of crowdsourcing systems, the data size of crowdsourcers, contractors and tasks grows rapidly. The worker quality evaluation based on big data analysis technology has become a critical challenge. This paper first proposes a general worker quality evaluation algorithm that is applied to any critical tasks such as tagging, matching, filtering, categorization and many other emerging applications, without wasting resources. Second, we realize the evaluation algorithm in the Hadoop platform using the MapReduce parallel programming model. Finally, to effectively verify the accuracy and the effectiveness of the algorithm in a wide variety of big data scenarios, we conduct a series of experiments. The experimental results demonstrate that the proposed algorithm is accurate and effective. It has high computing performance and horizontal scalability. And it is suitable for large-scale worker quality evaluations in a big data environment.
引用
收藏
页码:1879 / 1888
页数:10
相关论文
共 50 条
  • [1] Toward Conceptual MapReduce Algorithm for Big Data Platfrom
    Sohn, Seungdae
    Kim, Jinhong
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 985 - 988
  • [2] Improving Network Traffic in MapReduce for Big Data Applications
    Gawande, Priya
    Shaikh, Nuzhaft
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 2979 - 2983
  • [3] Investigation and Characterization of MapReduce Applications for Big Data Analytics
    Li, Y.
    Lam, T. B. V.
    Do, T. V. Van
    Chakka, R.
    Rotter, C.
    [J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2018, 77 (09): : 493 - 498
  • [4] The Scalability of Volunteer Computing for MapReduce Big Data Applications
    Li, Wei
    Guo, William
    [J]. DATA SCIENCE, PT 1, 2017, 727 : 153 - 165
  • [5] MapReduce-based parallel GEP algorithm for efficient function mining in big data applications
    Liu, Yang
    Ma, Chenxiao
    Xu, Lixiong
    Shen, Xiaodong
    Li, Maozhen
    Li, Pengcheng
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (23):
  • [6] A Paralleled Big Data Algorithm with MapReduce Framework for Mining Twitter Data
    Li Bing
    Chan, Keith C. C.
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 121 - 128
  • [7] Performance Evaluation of Big Data Frameworks: MapReduce and Spark
    Singh, Jaspreet
    Panda, S. N.
    Kaushal, Rajesh
    [J]. INTELLIGENT COMMUNICATION, CONTROL AND DEVICES, ICICCD 2017, 2018, 624 : 1611 - 1619
  • [8] PARALLEL KNOWLEDGE ACQUISITION ALGORITHM FOR BIG DATA USING MAPREDUCE
    Qian, Jin
    Xia, Min
    Lv, Ping
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL. 1, 2015, : 316 - 321
  • [9] Incremental attribute reduction algorithm for big data using MapReduce
    Lv, Ping
    Qian, Jin
    Yue, Xiaodong
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2016, 16 (03) : 641 - 652
  • [10] MapReduce based Classification for Fault Detection in Big Data Applications
    Shafiq, M. Omair
    Fekri, Maryam
    Ibrahim, Rami
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 637 - 642