HIFUN - a high level functional query language for big data analytics

被引:6
|
作者
Spyratos, Nicolas [1 ]
Sugibuchi, Tsuyoshi [2 ]
机构
[1] Univ Paris Sud 11, Lab Rech Informat, Rue Georges Clemenceau, F-91400 Orsay, France
[2] OppScience, 14 Ave Trudaine, F-75009 Paris, France
关键词
Query language; Big data analytics; Data modeling; MapReduce;
D O I
10.1007/s10844-018-0495-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a high level query language, called HIFUN, for defining analytic queries over big datasets, independently of how these queries are evaluated. An analytic query in HIFUN is defined to be a well-formed expression of a functional algebra that we define in the paper. The operations of this algebra combine functions to create HIFUN queries in much the same way as the operations of the relational algebra combine relations to create algebraic queries. The contributions of this paper are: (a) the definition of a formal framework in which to study analytic queries in the abstract; (b) the encoding of a HIFUN query either as a MapReduce job or as an SQL group-by query; and (c) the definition of a formal method for rewriting HIFUN queries and, as a case study, its application to the rewriting of MapReduce jobs and of SQL group-by queries. We emphasize that, although theoretical in nature, our work uses only basic and well known mathematical concepts, namely functions and their basic operations.
引用
收藏
页码:529 / 555
页数:27
相关论文
共 50 条
  • [21] A Query-oriented Adaptive Indexing Technique for Smart Grid Big Data Analytics
    Chang Wang
    Yongxin Zhu
    Yan Ma
    Meikang Qiu
    Bin Liu
    Junjie Hou
    Yulan Shen
    Weiwei Shi
    Journal of Signal Processing Systems, 2018, 90 : 1091 - 1103
  • [22] Intelligent Query Placement strategy For Progressive-Real Time Analytics In Big Data
    Nath, Renya N.
    Narayanaswami, Priya
    Mohan, G. Geetha
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [23] MEMOPASCAL - EMBEDDING A QUERY LANGUAGE INTO A HIGH-LEVEL PROGRAMMING LANGUAGE
    NOACK, J
    KUCHEN, H
    ANGEWANDTE INFORMATIK, 1987, (04): : 149 - 156
  • [24] Introducing Big Data Analytics in High School and College
    Sooriamurthi, Raja
    ITICSE'18: PROCEEDINGS OF THE 23RD ANNUAL ACM CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, 2018, : 373 - 374
  • [25] Self-adaptive Based Model for Ambiguity Resolution of The Linked Data Query for Big Data Analytics
    Sharef, Nurfadhlina Mohd
    Shafazand, Yasser M.
    Nazri, Mohd Zakree Ahmad
    Husin, Nor Azura
    INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2018, 10 (06): : 176 - 182
  • [26] Efficient Data Placement and Replication for QoS-Aware Approximate Query Evaluation of Big Data Analytics
    Xia, Qiufen
    Xu, Zichuan
    Liang, Weifa
    Yu, Shui
    Guo, Song
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (12) : 2677 - 2691
  • [27] Big Data Analytics Integrating a Parallel Columnar DBMS and the R Language
    Zhang, Yiqun
    Ordonez, Carlos
    Cabrera, Wellington
    2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 627 - 630
  • [28] Model-Based Big Data Analytics-as-a-Service: Take Big Data to the Next Level
    Ardagna, Claudio Agostino
    Bellandi, Valerio
    Bezzi, Michele
    Ceravolo, Paolo
    Damiani, Ernesto
    Hebert, Cedric
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2021, 14 (02) : 516 - 529
  • [29] Big Data: The Structure & Value of Big Data Analytics
    Kim, Hak J.
    AMCIS 2015 PROCEEDINGS, 2015,
  • [30] Big data analytics and big data science: a survey
    Chen, Yong
    Chen, Hong
    Gorkhali, Anjee
    Lu, Yang
    Ma, Yiqian
    Li, Ling
    JOURNAL OF MANAGEMENT ANALYTICS, 2016, 3 (01) : 1 - 42