CrowdLink: Crowdsourcing for Large-Scale Linked Data Management

被引:2
|
作者
Basharat, Amna [1 ]
Arpinar, I. Budak [1 ]
Dastgheib, Shima [1 ]
Kursuncu, Ugur [1 ]
Kochut, Krys [1 ]
Dogdu, Erdogan [2 ]
机构
[1] Univ Georgia, Dept Comp Sci, Large Scale Distributed Informat Syst LSDIS Lab, Athens, GA 30602 USA
[2] TOBB Univ Econ & Technol, Dept Comp Engn, TR-06560 Ankara, Turkey
关键词
Crowdsourcing; Semantic Web; Workflow Management; Linking Open Data (LOD); Human Intelligent Task; Ontology Verification and Entity Disambiguation;
D O I
10.1109/ICSC.2014.14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowdsourcing is an emerging paradigm to exploit the notion of human-computation for solving various computational problems, which cannot be accurately solved solely by the machine-based solutions. We use crowdsourcing for large-scale link management in the Semantic Web. More specifically, we develop CrowdLink, which utilizes crowdworkers for verification and creation of triples in Linking Open Data (LOD). LOD incorporates the core data sets in the Semantic Web, yet is not in full conformance with the guidelines for publishing high quality linked data on the Web. Our approach can help in enriching and improving quality of mission-critical links in LOD. Scalable LOD link management requires a hybrid approach, where human intelligent and machine intelligent tasks interleave in a workflow execution. Likewise, many other crowdsourcing applications require a sophisticated workflow specification not only on human intelligent tasks, but also machine intelligent tasks to handle data and control-flow, which is strictly deficient in the existing crowdsourcing platforms. Hence, we are strongly motivated to investigate the interplay of crowdsourcing, and semantically enriched workflows as well as human-machine cooperation in task completion. We demonstrate usefulness of our approach through various link creation and verification tasks, and workflows using Amazon Mechanical Turk. Experimental evaluation demonstrates promising results in terms of accuracy of the links created, and verified by the crowdworkers
引用
收藏
页码:227 / 234
页数:8
相关论文
共 50 条
  • [1] Large-scale linked data integration using probabilistic reasoning and crowdsourcing
    Demartini, Gianluca
    Difallah, Djellel Eddine
    Cudre-Mauroux, Philippe
    [J]. VLDB JOURNAL, 2013, 22 (05): : 665 - 687
  • [2] Large-scale linked data integration using probabilistic reasoning and crowdsourcing
    Gianluca Demartini
    Djellel Eddine Difallah
    Philippe Cudré-Mauroux
    [J]. The VLDB Journal, 2013, 22 : 665 - 687
  • [3] Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infrastructure
    Zhanikeev, Marat
    [J]. 2017 IEEE 10TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2017, : 66 - 72
  • [4] Anytime Large-Scale Analytics of Linked Open Data
    Soulet, Arnaud
    Suchanek, Fabian M.
    [J]. SEMANTIC WEB - ISWC 2019, PT I, 2019, 11778 : 576 - 592
  • [5] Visualizing Large-scale Linked Data with Memo Graph
    Ghorbel, Fatma
    Hamdi, Faycal
    Ellouze, Nebrasse
    Metais, Elisabeth
    Gargouri, Faiez
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 854 - 863
  • [6] Discovering Missing Links in Large-Scale Linked Data
    Nam Hau
    Ichise, Ryutaro
    Le, Bac
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 468 - 477
  • [7] Large-scale Semantic Integration of Linked Data: A Survey
    Mountantonakis, Michalis
    Tzitzikas, Yannis
    [J]. ACM COMPUTING SURVEYS, 2019, 52 (05)
  • [8] Hierarchical Management of Large-Scale Malware Data
    Kellogg, Lee
    Ruttenberg, Brian
    O'Connor, Alison
    Howard, Michael
    Pfeffer, Avi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 666 - 674
  • [9] Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform
    Simone Hantke
    Tobias Olenyi
    Christoph Hausner
    Tobias Appel
    Bj?rn Schuller
    [J]. International Journal of Automation and Computing, 2019, (04) : 427 - 436
  • [10] Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform
    Simone Hantke
    Tobias Olenyi
    Christoph Hausner
    Tobias Appel
    Björn Schuller
    [J]. International Journal of Automation and Computing, 2019, 16 : 427 - 436