Anytime Large-Scale Analytics of Linked Open Data

被引:9
|
作者
Soulet, Arnaud [1 ,2 ]
Suchanek, Fabian M. [2 ]
机构
[1] Univ Tours, LIFAT, Blois, France
[2] Telecom Paris, Inst Polytech Paris, Paris, France
来源
关键词
SPARQL; WEB;
D O I
10.1007/978-3-030-30793-6_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analytical queries are queries with numerical aggregators: computing the average number of objects per property, identifying the most frequent subjects, etc. Such queries are essential to monitor the quality and the content of the Linked Open Data (LOD) cloud. Many analytical queries cannot be executed directly on the SPARQL endpoints, because the fair use policy cuts off expensive queries. In this paper, we show how to rewrite such queries into a set of queries that each satisfy the fair use policy. We then show how to execute these queries in such a way that the result provably converges to the exact query answer. Our algorithm is an anytime algorithm, meaning that it can give intermediate approximate results at any time point. Our experiments show that the approach converges rapidly towards the exact solution, and that it can compute even complex indicators at the scale of the LOD cloud.
引用
收藏
页码:576 / 592
页数:17
相关论文
共 50 条
  • [31] Big Data Analytics for Large-scale Wireless Networks: Challenges and Opportunities
    Dai, Hong-Ning
    Wong, Raymond Chi-Wing
    Wang, Hao
    Zheng, Zibin
    Vasilakos, Athanasios V.
    ACM COMPUTING SURVEYS, 2019, 52 (05)
  • [32] TerraBrasilis: A Spatial Data Analytics Infrastructure for Large-Scale Thematic Mapping
    Assis, Luiz Fernando F. G.
    Ferreira, Karine Reis
    Vinhas, Lubia
    Maurano, Luis
    Almeida, Claudio
    Carvalho, Andre
    Rodrigues, Jether
    Maciel, Adeline
    Camargo, Claudinei
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (11)
  • [33] BANKSAFE: Visual analytics for big data in large-scale computer networks
    Fischer, Fabian
    Fuchs, Johannes
    Mansmann, Florian
    Keim, Daniel A.
    INFORMATION VISUALIZATION, 2015, 14 (01) : 51 - 61
  • [34] Visual Analytics to make sense of large-scale administrative and normative data
    Guarino, Alfonso
    Lettieri, Nicola
    Malandrino, Delfina
    Russo, Pietro
    Zaccagnino, Rocco
    2019 23RD INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV): BIOMEDICAL VISUALIZATION AND GEOMETRIC MODELLING & IMAGING, 2019, : 133 - 138
  • [35] Effective ensemble learning approach for large-scale medical data analytics
    Lakshmana Rao Namamula
    Daniel Chaytor
    International Journal of System Assurance Engineering and Management, 2024, 15 : 13 - 20
  • [36] Going Digital: A Survey on Digitalization and Large-Scale Data Analytics in Healthcare
    Tresp, Volker
    Overhage, J. Marc
    Bundschus, Markus
    Rabizadeh, Shahrooz
    Fasching, Peter A.
    Yu Shipeng
    PROCEEDINGS OF THE IEEE, 2016, 104 (11) : 2180 - 2206
  • [37] A Novel Visual analytics Approach for Clustering Large-Scale Social Data
    Wang, Zhangye
    Zhou, Juanxia
    Chen, Wei
    Chen, Chang
    Liao, Jiyuan
    Maciejewski, Ross
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [38] Efficient Graph Analytics in Python']Python for Large-Scale Data Science
    Zhou, Xiantian
    Ordonez, Carlos
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021), 2021, 12925 : 158 - 164
  • [39] Integrating Online Compression to Accelerate Large-Scale Data Analytics Applications
    Bicer, Tekin
    Yin, Jian
    Chiu, David
    Agrawal, Gagan
    Schuchardt, Karen
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1205 - 1216
  • [40] Efficient Large-scale Medical Data (eHealth Big Data) Analytics in Internet of Things
    Plageras, Andreas P.
    Stergiou, Christos
    Kokkonis, George
    Psannis, Kostas E.
    Ishibashi, Yutaka
    Kim, Byung-Gyu
    Gupta, B. Brij
    2017 IEEE 19TH CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL 2, 2017, 2 : 21 - 27