Building and querying semantic layers for web archives (extended version)

被引:1
|
作者
Pavlos Fafalios
Helge Holzmann
Vaibhav Kasturia
Wolfgang Nejdl
机构
[1] Leibniz University of Hannover,L3S Research Center
关键词
Web archives; Semantic layer; Profiling; Linked data; Exploratory search;
D O I
暂无
中图分类号
学科分类号
摘要
Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts, and events), and publishing all these data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.
引用
收藏
页码:149 / 167
页数:18
相关论文
共 50 条
  • [21] Semantic cache mechanism for heterogeneous Web querying
    Chidlovskii, B
    Roncancio, C
    Schneider, ML
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL WORLD WIDE WEB CONFERENCE, 1999, : 269 - 282
  • [22] Building Scalable Web Archives
    Medjkoune, Leila
    Barton, Stanislav
    Carpentier, Florent
    Masanes, Julien
    Pop, Radu
    ARCHIVING 2014, FINAL PROGRAM AND PROCEEDINGS, 2014, : 138 - 143
  • [23] Semantic web tool for querying gene and gene products
    Xu, Qingwei
    Lu, Qiang
    Cao, Shunliang
    Luo, Qingming
    Li, Yixue
    Gaojishu Tongxin/Chinese High Technology Letters, 2007, 17 (11): : 1169 - 1173
  • [24] Querying real world services through the Semantic Web
    Hiramatsu, K
    Akahani, J
    Satoh, T
    SEMANTIC WEB - ISWC 2004, PROCEEDINGS, 2004, 3298 : 741 - 751
  • [25] PowerAqua: Supporting users in querying and exploring the Semantic Web
    Lopez, Vanessa
    Fernandez, Miriam
    Motta, Enrico
    Stieler, Nico
    SEMANTIC WEB, 2012, 3 (03) : 249 - 265
  • [26] Querying Semantic Web resources using TRIPLE views
    Miklós, Z
    Neumann, G
    Zdun, U
    Sintek, M
    SEMANTIC WEB - ISWC 2003, 2003, 2870 : 517 - 532
  • [27] Ontology based text indexing and querying for the semantic web
    Koehler, Jacob
    Philippi, Stephan
    Specht, Michael
    Rueegg, Alexander
    KNOWLEDGE-BASED SYSTEMS, 2006, 19 (08) : 744 - 754
  • [28] Qsense Learning Semantic Web Concepts by Querying DBpedia
    Panu, Andrei
    Buraga, Sabin C.
    Alboaie, Lenuta
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON E-BUSINESS (ICE-B 2013), 2013, : 351 - 356
  • [29] Temporal Shingling for Version Identification in Web Archives
    Schenkel, Ralf
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 508 - 519
  • [30] The Semantic Web and television archives: state of affairsy
    Sanchez Jimenez, Rodrigo
    Caldera Serrano, Jorge
    Botezan, Iuliana
    CUADERNOS DE DOCUMENTACION MULTIMEDIA, 2016, 27 (01): : 53 - 74