WhoKnows? Evaluating linked data heuristics with a quiz that cleans up DBpedia

被引:31
|
作者
Waitelonis, Joerg [1 ,2 ,3 ]
Ludwig, Nadine [1 ,4 ,5 ]
Knuth, Magnus [1 ,2 ]
Sack, Harald [1 ,2 ,6 ]
机构
[1] Hasso Plattner Inst, Potsdam, Germany
[2] Univ Potsdam, Hasso Plattner Inst IT Syst Engn HPI, Potsdam, Germany
[3] Swiss Fed Inst Technol Zurich, Multimedia Proc Syst REPLAY, Zurich, Switzerland
[4] Berlin Inst Technol, MuLF Ctr, Berlin, Germany
[5] Hasso Plattner Inst Software Syst Engn, Res Grp, Semant Technol, Potsdam, Germany
[6] Friedrich Schiller Univ, Jena, Germany
关键词
Online databases; Encyclopaedias; Data management; Semantics; DBpedia; Data cleansing; Serious games; Evaluation;
D O I
10.1108/17415651111189478
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Purpose - Linking Open Data (LOD) provides a vast amount of well structured semantic information, but many inconsistencies may occur, especially if the data are generated with the help of automated methods. Data cleansing approaches enable detection of inconsistencies and overhauling of affected data sets, but they are difficult to apply automatically. The purpose of this paper is to present WhoKnows?, an online quiz that generates different kinds of questionnaires from DBpedia data sets. Design/methodology/approach - Besides its playfulness, WhoKnows? has been developed for the evaluation of property relevance ranking heuristics on DBpedia data, with the convenient side effect of detecting inconsistencies and doubtful facts. Findings - The original purpose for developing WhoKnows? was to evaluate heuristics to rank LOD properties and thus, obtain a semantic relatedness between entities according to the properties by which they are linked. The presented approach is an efficient method to detect popular properties within a limited amount of triples. Ongoing work continues in the development of sound property ranking heuristics for the purpose of detecting the most relevant characteristics of entities. Originality/value - WhoKnows? uses the approach of "Games with a Purpose" to detect inconsistencies in Linked Data and score properties to rank them for sophisticated semantic search scenarios.
引用
收藏
页码:236 / +
页数:14
相关论文
共 50 条
  • [1] Evaluating Topological Queries in Linked Data Using DBpedia and GeoNames in Switzerland and Scotland
    Grutter, Rolf
    Purves, Ross S.
    Wotruba, Lukas
    [J]. TRANSACTIONS IN GIS, 2017, 21 (01) : 114 - 133
  • [2] Ranking the Linked Data: The Case of DBpedia
    Mirizzi, Roberto
    Ragone, Azzurra
    Di Noia, Tommaso
    Di Sciascio, Eugenio
    [J]. WEB ENGINEERING, 2010, 6189 : 337 - 354
  • [3] Sustainable Linked Data Generation: The Case of DBpedia
    Maroy, Wouter
    Dimou, Anastasia
    Kontokostas, Dimitris
    De Meester, Ben
    Verborgh, Ruben
    Lehmann, Jens
    Mannens, Erik
    Hellmann, Sebastian
    [J]. SEMANTIC WEB - ISWC 2017, PT II, 2017, 10588 : 297 - 313
  • [4] Declarative Data Transformations for Linked Data Generation: The Case of DBpedia
    De Meester, Ben
    Maroy, Wouter
    Dimou, Anastasia
    Verborgh, Ruben
    Mannens, Erik
    [J]. SEMANTIC WEB, ESWC 2017, PT II, 2017, 10250 : 33 - 48
  • [5] Internationalization of Linked Data: The case of the Greek DBpedia edition
    Kontokostas, Dimitris
    Bratsas, Charalampos
    Auer, Soeren
    Hellmann, Sebastian
    Antoniou, Ioannis
    Metakides, George
    [J]. JOURNAL OF WEB SEMANTICS, 2012, 15 : 51 - 61
  • [6] Audiovisual and Linked Data: a study of the DBpedia and LMDB databases
    Simionato, Ana Carolina
    Coneglian, Caio Saraiva
    Ventura Amorim Goncalez, Paula Regina
    Santarem Segundo, Jose Eduardo
    [J]. EM QUESTAO, 2018, 24 (03): : 297 - 315
  • [7] Linked Data Quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO
    Faerber, Michael
    Bartscherer, Frederic
    Menne, Carsten
    Rettinger, Achim
    [J]. SEMANTIC WEB, 2018, 9 (01)
  • [8] ACRyLIQ: Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
    ul Hassan, Umair
    Zaveri, Amrapali
    Marx, Edgard
    Curry, Edward
    Lehmann, Jens
    [J]. KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2016, 2016, 10024 : 681 - 696
  • [9] Digital transmitter cleans up process data flow
    不详
    [J]. PROFESSIONAL ENGINEERING, 2000, 13 (23) : 48 - 48
  • [10] Detecting Linked Data quality issues via crowdsourcing: A DBpedia study
    Acosta, Maribel
    Zaveri, Amrapali
    Simperl, Elena
    Kontokostas, Dimitris
    Floeck, Fabian
    Lehmann, Jens
    [J]. SEMANTIC WEB, 2018, 9 (03) : 303 - 335