CrowdLink: Crowdsourcing for Large-Scale Linked Data Management

被引:2
|
作者
Basharat, Amna [1 ]
Arpinar, I. Budak [1 ]
Dastgheib, Shima [1 ]
Kursuncu, Ugur [1 ]
Kochut, Krys [1 ]
Dogdu, Erdogan [2 ]
机构
[1] Univ Georgia, Dept Comp Sci, Large Scale Distributed Informat Syst LSDIS Lab, Athens, GA 30602 USA
[2] TOBB Univ Econ & Technol, Dept Comp Engn, TR-06560 Ankara, Turkey
关键词
Crowdsourcing; Semantic Web; Workflow Management; Linking Open Data (LOD); Human Intelligent Task; Ontology Verification and Entity Disambiguation;
D O I
10.1109/ICSC.2014.14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowdsourcing is an emerging paradigm to exploit the notion of human-computation for solving various computational problems, which cannot be accurately solved solely by the machine-based solutions. We use crowdsourcing for large-scale link management in the Semantic Web. More specifically, we develop CrowdLink, which utilizes crowdworkers for verification and creation of triples in Linking Open Data (LOD). LOD incorporates the core data sets in the Semantic Web, yet is not in full conformance with the guidelines for publishing high quality linked data on the Web. Our approach can help in enriching and improving quality of mission-critical links in LOD. Scalable LOD link management requires a hybrid approach, where human intelligent and machine intelligent tasks interleave in a workflow execution. Likewise, many other crowdsourcing applications require a sophisticated workflow specification not only on human intelligent tasks, but also machine intelligent tasks to handle data and control-flow, which is strictly deficient in the existing crowdsourcing platforms. Hence, we are strongly motivated to investigate the interplay of crowdsourcing, and semantically enriched workflows as well as human-machine cooperation in task completion. We demonstrate usefulness of our approach through various link creation and verification tasks, and workflows using Amazon Mechanical Turk. Experimental evaluation demonstrates promising results in terms of accuracy of the links created, and verified by the crowdworkers
引用
收藏
页码:227 / 234
页数:8
相关论文
共 50 条
  • [31] MidSemI: A Middleware for Semantic Integration of Business Data with Large-scale Social and Linked Data
    Sellami, Samir
    Dkaki, Taoufiq
    Zarour, Nacereddine
    Charrel, Pierre-Jean
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2019, 10 (02) : 1 - 25
  • [32] Automatic large-scale data acquisition via crowdsourcing for crosswalk classification: A deep learning approach
    Berriel, Rodrigo F.
    Rossi, Franco Schmidt
    de Souza, Alberto F.
    Oliveira-Santos, Thiago
    [J]. COMPUTERS & GRAPHICS-UK, 2017, 68 : 32 - 42
  • [33] Detecting Anomaly in Large-scale Network using Mobile Crowdsourcing
    Li, Yang
    Sun, Jiachen
    Huang, Wenguang
    Tian, Xiaohua
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 2179 - 2187
  • [34] Parallel and Streaming Truth Discovery in Large-Scale Quantitative Crowdsourcing
    Ouyang, Robin Wentao
    Kaplan, Lance M.
    Toniolo, Alice
    Srivastava, Mani
    Norman, Timothy J.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (10) : 2984 - 2997
  • [35] A KL-LUCB Bandit Algorithm for Large-Scale Crowdsourcing
    Tanczos, Ervin
    Nowak, Robert
    Mankoff, Bob
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [36] Visualization and management of large-scale data on SX-6
    Kameyama, T
    Nakano, E
    Takei, T
    Yoshida, A
    Takahara, H
    [J]. NEC RESEARCH & DEVELOPMENT, 2003, 44 (01): : 95 - 98
  • [37] Guest Editorial: Large-scale Data Management for Mobile Applications
    Thierry Delot
    Sandra Geisler
    Sergio Ilarri
    Christoph Quix
    [J]. Distributed and Parallel Databases, 2016, 34 : 1 - 2
  • [38] Data Management for Large-Scale Position-Tracking Systems
    Inoue, Fumiaki
    Zhang, Yongbing
    Ji, Yusheng
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2011, E94B (01) : 45 - 54
  • [39] BASIC: an Alternative to BASE for Large-Scale Data Management System
    Wu, Lengdong
    Yuan, Li-Yan
    You, Jia-Huai
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 5 - 14
  • [40] Efficient data management in a large-scale epidemiology research project
    Meyer, Jens
    Ostrzinski, Stefan
    Fredrich, Daniel
    Havemann, Christoph
    Krafczyk, Janina
    Hoffmann, Wolfgang
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 107 (03) : 425 - 435