HIFUN - a high level functional query language for big data analytics

被引:6
|
作者
Spyratos, Nicolas [1 ]
Sugibuchi, Tsuyoshi [2 ]
机构
[1] Univ Paris Sud 11, Lab Rech Informat, Rue Georges Clemenceau, F-91400 Orsay, France
[2] OppScience, 14 Ave Trudaine, F-75009 Paris, France
关键词
Query language; Big data analytics; Data modeling; MapReduce;
D O I
10.1007/s10844-018-0495-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a high level query language, called HIFUN, for defining analytic queries over big datasets, independently of how these queries are evaluated. An analytic query in HIFUN is defined to be a well-formed expression of a functional algebra that we define in the paper. The operations of this algebra combine functions to create HIFUN queries in much the same way as the operations of the relational algebra combine relations to create algebraic queries. The contributions of this paper are: (a) the definition of a formal framework in which to study analytic queries in the abstract; (b) the encoding of a HIFUN query either as a MapReduce job or as an SQL group-by query; and (c) the definition of a formal method for rewriting HIFUN queries and, as a case study, its application to the rewriting of MapReduce jobs and of SQL group-by queries. We emphasize that, although theoretical in nature, our work uses only basic and well known mathematical concepts, namely functions and their basic operations.
引用
收藏
页码:529 / 555
页数:27
相关论文
共 50 条
  • [1] HIFUN - a high level functional query language for big data analytics
    Nicolas Spyratos
    Tsuyoshi Sugibuchi
    Journal of Intelligent Information Systems, 2018, 51 : 529 - 555
  • [2] Data Exploration in the HIFUN Language
    Spyratos, Nicolas
    Sugibuchi, Tsuyoshi
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2019, 11529 : 176 - 187
  • [3] A Solution to Query Processing Challenges Through Smart Query Processor for Big Data Analytics
    Vaidya G.M.
    Kshirsagar M.M.
    SN Computer Science, 4 (2)
  • [4] Evaluation of high-level query languages based on MapReduce in Big Data
    Birjali, Marouane
    Beni-Hssane, Abderrahim
    Erritali, Mohammed
    JOURNAL OF BIG DATA, 2018, 5 (01)
  • [5] QDrill: Query-Based Distributed Consumable Analytics for Big Data
    Khalifa, Shadi
    Martin, Patrick
    Rope, Dan
    McRoberts, Mike
    Statchuk, Craig
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 117 - 124
  • [6] QoS-Aware Data Replications and Placements for Query Evaluation of Big Data Analytics
    Xia, Qiufen
    Liang, Weifa
    Xu, Zichuan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [7] Predictable performance and high query concurrency for data analytics
    Candea, George
    Polyzotis, Neoklis
    Vingralek, Radek
    VLDB JOURNAL, 2011, 20 (02): : 227 - 248
  • [8] Predictable performance and high query concurrency for data analytics
    George Candea
    Neoklis Polyzotis
    Radek Vingralek
    The VLDB Journal, 2011, 20 : 227 - 248
  • [9] High-Level Web Data Abstraction Using Language Integrated Query
    Misek, Jakub
    Zavoral, Filip
    INTELLIGENT DISTRIBUTED COMPUTING IV, 2010, 315 : 13 - 22
  • [10] A high-level query language for events
    Bry, Francois
    Eckert, Michael
    SCW 2006: IEEE SERVICES COMPUTING WORKSHOPS, PROCEEDINGS, 2006, : 31 - +