Bash Datalog: Answering Datalog Queries with Unix Shell Commands

被引:1
|
作者
Rebele, Thomas [1 ]
Tanon, Thomas Pellissier [1 ]
Suchanek, Fabian [1 ]
机构
[1] Telecom ParisTech, Paris, France
来源
关键词
SYSTEM;
D O I
10.1007/978-3-030-00671-6_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dealing with large tabular datasets often requires extensive preprocessing. This preprocessing happens only once, so that loading and indexing the data in a database or triple store may be an overkill. In this paper, we present an approach that allows preprocessing large tabular data in Datalog - without indexing the data. The Datalog query is translated to Unix Bash and can be executed in a shell. Our experiments show that, for the use case of data preprocessing, our approach is competitive with state-of-the-art systems in terms of scalability and speed, while at the same time requiring only a Bash shell on a Unix system.
引用
收藏
页码:566 / 582
页数:17
相关论文
共 50 条
  • [11] ON THE EXPECTED SIZE OF RECURSIVE DATALOG QUERIES
    SESHADRI, S
    NAUGHTON, JF
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1995, 51 (02) : 137 - 148
  • [12] DATALOG EXTENSIONS FOR DATABASE QUERIES AND UPDATES
    ABITEBOUL, S
    VIANU, V
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1991, 43 (01) : 62 - 124
  • [13] Safe Datalog queries with linear constraints
    Revesz, PZ
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING - CP98, 1998, 1520 : 355 - 369
  • [14] On the complexity of single-rule datalog queries
    Gottlob, G
    Papadimitriou, C
    INFORMATION AND COMPUTATION, 2003, 183 (01) : 104 - 122
  • [15] Monadic Datalog and Regular Tree Pattern Queries
    Mazowiecki, Filip
    Murlak, Filip
    Witkowski, Adam
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2014, PT I, 2014, 8634 : 426 - 437
  • [16] Consistent Query Answering for Primary Keys in Datalog
    Koutris, Paraschos
    Wijsen, Jef
    THEORY OF COMPUTING SYSTEMS, 2021, 65 (01) : 122 - 178
  • [17] Bounded arity Datalog (not equal) queries on graphs
    Afrati, FN
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (02) : 210 - 228
  • [18] Consistent Query Answering for Primary Keys in Datalog
    Paraschos Koutris
    Jef Wijsen
    Theory of Computing Systems, 2021, 65 : 122 - 178
  • [19] Big Data Analytics with Datalog Queries on Spark
    Shkapsky, Alexander
    Yang, Mohan
    Interlandi, Matteo
    Chiu, Hsuan
    Condie, Tyson
    Zaniolo, Carlo
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1135 - 1149
  • [20] Monadic Datalog and Regular Tree Pattern Queries
    Mazowiecki, Filip
    Murlak, Filip
    Witkowski, Adam
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2016, 41 (03):