Clustering Remote RDF Data Using SPARQL Update Queries

被引:0
|
作者
Qi, Letao [1 ]
Lin, Harris T. [1 ]
Honavar, Vasant [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.
引用
收藏
页码:236 / 242
页数:7
相关论文
共 50 条
  • [41] ADERIS: Adaptively Integrating RDF Data from SPARQL Endpoints
    Lynden, Steven
    Kojima, Isao
    Matono, Akiyoshi
    Tanimura, Yusuke
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 400 - 403
  • [42] Evaluation of SPARQL queries using relational databases
    Dokulil, Jiri
    [J]. SEMANTIC WEB - ISEC 2006, PROCEEDINGS, 2006, 4273 : 972 - 973
  • [43] Sparklify: A Scalable Software Component for Efficient Evaluation of SPARQL Queries over Distributed RDF Datasets
    Stadler, Claus
    Sejdiu, Gezim
    Graux, Damien
    Lehmann, Jens
    [J]. SEMANTIC WEB - ISWC 2019, PT II, 2019, 11779 : 293 - 308
  • [44] Distributed SPARQL query answering over RDF data streams
    Leida, Marcello
    Chu, Andrej
    [J]. 2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 369 - 378
  • [45] Answers of SPARQL 1.1 queries along with their temporal validity intervals over temporal RDF datasets
    Analyti, Anastasia
    [J]. International Journal of Web Engineering and Technology, 2023, 18 (04) : 344 - 375
  • [46] Augmenting data collection and analysis of operational simulations with RDF and SPARQL
    Mihok, Brian
    Stocking, Richard
    Holmes, Douglas
    [J]. 2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 3491 - +
  • [47] Parallelizing Federated SPARQL Queries in Presence of Replicated Data
    Minier, Thomas
    Montoya, Gabriela
    Skaf-Molli, Hala
    Molli, Pascal
    [J]. SEMANTIC WEB: ESWC 2017 SATELLITE EVENTS, 2017, 10577 : 181 - 196
  • [48] Using Heterogeneous Mappings for Rewriting SPARQL Queries
    Rodrigues Lopes, Fernanda Ligia
    Sacramento, Eveline Russo
    Loscio, Bernadette Farias
    [J]. 2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 267 - 271
  • [49] Towards Efficient Distributed SPARQL Queries on Linked Data
    Li, Xuejin
    Niu, Zhendong
    Zhang, Chunxia
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 259 - 272
  • [50] Executing SPARQL Queries over the Web of Linked Data
    Hartig, Olaf
    Bizer, Christian
    Freytag, Johann-Christoph
    [J]. SEMANTIC WEB - ISWC 2009, PROCEEDINGS, 2009, 5823 : 293 - +