ITISS: an efficient framework for querying big temporal data

被引:2
|
作者
Chen, Zhongpu [1 ]
Yao, Bin [1 ]
Wang, Zhi-Jie [2 ,3 ,4 ]
Zhang, Wei [1 ]
Zheng, Kai [5 ]
Kalnis, Panos [6 ]
Tang, Feilong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Shanghai, Peoples R China
[2] Sen Univ, Guangzhou, Peoples R China
[3] Guangdong Key Lab Big Data Anal, Proc, Guangzhou, Peoples R China
[4] Natl Engn Lab Big Data Anal, Applict, Beijing, Beijing, Peoples R China
[5] Univ Elect Sci, Technol China, Chengdu, Peoples R China
[6] King Abdullah Univ Sci, Technol, Thuwal, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Temporal join query; Time travel query; Temporal aggregation query; Distributed in-memory systems; Big data; JOIN;
D O I
10.1007/s10707-019-00362-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the real word, temporal data can be found in many applications, and it is rapidly increasing nowadays. It is urgently important and challenging to manage and operate big temporal data efficiently and effectively, due to the large volume of big temporal data and the real-time response requirement. Processing big temporal data using a distributed system is a desired choice, since a single-machine based system usually has the limited computing ability. Nevertheless, existing distributed systems or methods either are disk-based solutions, or cannot support native queries, which may not well meet the demands of low latency and high throughput. To attack these issues, this article suggests a new approach to handle big temporal data. Our approach is an In-memory based Two-level Index Solution in Spark, dubbed as ITISS. The proposed framework of our solution is easily understood and implemented, but without loss of effectiveness and efficiency. Based on the proposed framework, this article develops targeted algorithms for handling time travel, temporal aggregation, and temporal join queries, respectively. We have implemented our framework in Apache Spark, extended the Apache Spark SQL to support declarative SQL interface that enables users to perform temporal queries with a few lines of SQL statements, and conducted extensive experiments to verify the performance of our solution. The experimental results, based on both real and synthetic datasets, consistently demonstrate that our proposed solution is efficient and competitive for processing big temporal data.
引用
收藏
页码:27 / 59
页数:33
相关论文
共 50 条
  • [1] ITISS: an efficient framework for querying big temporal data
    Zhongpu Chen
    Bin Yao
    Zhi-Jie Wang
    Wei Zhang
    Kai Zheng
    Panos Kalnis
    Feilong Tang
    [J]. GeoInformatica, 2020, 24 : 27 - 59
  • [2] Applied Temporal RDF: Efficient Temporal Querying of RDF Data with SPARQL
    Tappolet, Jonas
    Bernstein, Abraham
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 308 - 322
  • [3] BINARY: A Framework for Big Data Integration for Ad-hoc Querying
    Eftekhari, Azadeh
    Zulkernine, Farhana
    Martin, Patrick
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2746 - 2753
  • [4] High-Efficient Fuzzy Querying With HiveQL for Big Data Warehousing
    Malysiak-Mrozek, Bozena
    Wieszok, Jadwiga
    Pedrycz, Witold
    Ding, Weiping
    Mrozek, Dariusz
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (06) : 1823 - 1837
  • [5] Efficient Querying Distributed Big-XML Data using MapReduce
    Song Kunfang
    Hongwei Lu
    [J]. INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (03) : 70 - 79
  • [6] Querying Big Data by Accessing Small Data
    Fan, Wenfei
    Geerts, Floris
    Cao, Yang
    Deng, Ting
    Lu, Ping
    [J]. PODS'15: PROCEEDINGS OF THE 33RD ACM SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2015, : 173 - 184
  • [7] Querying semistructured temporal data
    Combi, Carlo
    Lavarini, Nico
    Oliboni, Barbara
    [J]. CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 4254 : 625 - 636
  • [8] sksOpen: Efficient Indexing, Querying, and Visualization of Geo-spatial Big Data
    Lu, Yun
    Zhang, Mingjin
    Witherspoon, Shonda
    Yesha, Yelena
    Yesha, Yaacov
    Rishe, Naphtali
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 495 - 500
  • [9] On Scale Independence for Querying Big Data
    Fan, Wenfei
    Geerts, Floris
    Libkin, Leonid
    [J]. PODS'14: PROCEEDINGS OF THE 33RD ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2014, : 51 - 62
  • [10] Composable and Efficient Functional Big Data Processing Framework
    Wu, Dongyao
    Sakr, Sherif
    Zhu, Liming
    Lu, Qinghua
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 279 - 286