Anytime Large-Scale Analytics of Linked Open Data

被引:9
|
作者
Soulet, Arnaud [1 ,2 ]
Suchanek, Fabian M. [2 ]
机构
[1] Univ Tours, LIFAT, Blois, France
[2] Telecom Paris, Inst Polytech Paris, Paris, France
来源
关键词
SPARQL; WEB;
D O I
10.1007/978-3-030-30793-6_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analytical queries are queries with numerical aggregators: computing the average number of objects per property, identifying the most frequent subjects, etc. Such queries are essential to monitor the quality and the content of the Linked Open Data (LOD) cloud. Many analytical queries cannot be executed directly on the SPARQL endpoints, because the fair use policy cuts off expensive queries. In this paper, we show how to rewrite such queries into a set of queries that each satisfy the fair use policy. We then show how to execute these queries in such a way that the result provably converges to the exact query answer. Our algorithm is an anytime algorithm, meaning that it can give intermediate approximate results at any time point. Our experiments show that the approach converges rapidly towards the exact solution, and that it can compute even complex indicators at the scale of the LOD cloud.
引用
收藏
页码:576 / 592
页数:17
相关论文
共 50 条
  • [1] A Hybrid Data Model for Large-Scale Analytics
    Feo, John
    2018 ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2018, : 269 - 269
  • [2] Large-scale regulatory and signaling network assembly through linked open data
    Lefebvre, M.
    Gaignard, A.
    Folschette, M.
    Bourdon, J.
    Guziolowski, C.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2021,
  • [3] Developing Anytime SVM Training Algorithms for Large-Scale Data Classification
    Han, Rui
    Ghanem, Moustafa
    Williams, Andreas
    Guo, Yike
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 360 - 366
  • [4] Visual Analytics of Large-Scale Climate Model Data
    Wong, Pak Chung
    Shen, Han-Wei
    Leung, Ruby
    Hagos, Samson
    Lee, Teng-Yok
    Tong, Xin
    Lu, Kewei
    2014 IEEE 4TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2014, : 85 - 92
  • [5] Disco: A Computing Platform for Large-Scale Data Analytics
    Mundkur, Prashanth
    Tuulos, Ville
    Flatow, Jared
    ERLANG 11: PROCEEDINGS OF THE 2011 ACM SIGPLAN ERLANG WORKSHOP, 2011, : 84 - 89
  • [6] Scalable computing for large-scale multimedia data analytics
    Karuppiah, Marimuthu
    Chaudhry, Shehzad Ashraf
    Alsharif, Mohammed H.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (03) : 601 - 603
  • [7] Visual Cascade Analytics of Large-Scale Spatiotemporal Data
    Deng, Zikun
    Weng, Di
    Liang, Yuxuan
    Bao, Jie
    Zheng, Yu
    Schreck, Tobias
    Xu, Mingliang
    Wu, Yingcai
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (06) : 2486 - 2499
  • [8] Extracting Domain-specific Concepts from Large-scale Linked Open Data
    Kume, Satoshi
    Kozaki, Kouji
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE GRAPHS (IJCKG 2021), 2021, : 28 - 37
  • [9] Linked open data: Building up Japanese vocabulary for large scale linked open data
    Tamagawa, Susumu
    Kagawa, Kosuke
    Morita, Takeshi
    Yamaguchi, Takahira
    Transactions of the Japanese Society for Artificial Intelligence, 2014, 29 (04) : 386 - 395
  • [10] Performance Evaluation of Big Data Frameworks for Large-Scale Data Analytics
    Veiga, Jorge
    Exposito, Roberto R.
    Pardo, Xoan C.
    Taboada, Guillermo L.
    Tourino, Juan
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 424 - 431