SJSON']JSON: A succinct representation for JSON']JSON documents

被引:3
|
作者
Lee, Junhee [1 ]
Anjos, Edman [2 ]
Satti, Srinivasa Rao [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Google, Hamburg, Germany
关键词
!text type='JSON']JSON[!/text; Succinct data structure; Semi-structured document representation; Heterogeneous array indexing; SUFFIX ARRAYS; TREES;
D O I
10.1016/j.is.2020.101686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The massive amounts of data processed in modern computational systems are becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JSON (JavaScript Object Notation) and XML (eXtensible Markup Language), for human-readable platform-agnostic access. This paper focuses on exploring a set of succinct representations for JSON documents, which we call SJSON, achieving both reduced RAM and disk usage while supporting efficient queries on the documents. The representations we propose are mainly based on the idea that JSON documents can be decomposed into structural part and raw data part. In our method, we emulate the structure of the JSON document as a rooted ordered tree and represent it using succinct data structures, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data is reorganized into arrays of attributes and values. This deconstruction between structure and data allows for a straightforward connection between a node in the succinct tree and its corresponding name-value pair, dispensing pointers altogether. The proposed scheme is implemented as the SJSON library in C++, and evaluated with respect to a number of metrics, comparing its performance with popular alternative JSON parsers. Empirical results show that the library is able to represent JSON files succinctly while efficiently supporting traversal queries. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] SPEAR-JSON']JSON: Selective parsing of JSON']JSON to enable accelerated stream processing on FPGAs
    Hahn, Tobias
    Wildermann, Stefan
    Teich, Juergen
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 189 - 196
  • [22] Non-native Techniques for Storing JSON']JSON Documents into Relational Tables
    Petkovic, Dusan
    22ND INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2020), 2020, : 16 - 20
  • [23] JSON']JSON Schema Inference Approaches
    Contos, Pavel
    Svoboda, Martin
    ADVANCES IN CONCEPTUAL MODELING, ER 2020, 2020, 12584 : 173 - 183
  • [24] JUpdate: A JSON']JSON Update Language
    Brahmia, Zouhaier
    Brahmia, Safa
    Grandi, Fabio
    Bouaziz, Rafik
    ELECTRONICS, 2022, 11 (04)
  • [25] Witness Generation for JSON']JSON Schema
    Attouche, Lyes
    Baazizi, Mohamed-Amine
    Colazzo, Dario
    Ghelli, Giorgio
    Sartiani, Carlo
    Scherzinger, Stefanie
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (13): : 4002 - 4014
  • [26] Survey on JSON']JSON Data Modelling
    Lv, Teng
    Yan, Ping
    He, Weimin
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [27] Rdf serialization from JSON']JSON Data The case of JSON']JSON data in Diavgeia.gov.gr
    Theocharis, Stamatios
    Tsihrintzis, George
    2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2016,
  • [28] Hybrid.JSON']JSON: High-velocity Parallel In-Memory Polystore JSON']JSON Ingest
    Ortiz, Steven
    Enbatan, Caner
    Podkorytov, Maksim
    Soderman, Dylan
    Gubanov, Michael
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4807 - 4809
  • [29] AutoCouch: A JSON']JSON CRDT Framework
    Grosch, Pascal
    Krafft, Roman
    Woelki, Marcel
    Bieniusa, Annette
    7TH WORKSHOP ON PRINCIPLES AND PRACTICE OF CONSISTENCY FOR DISTRIBUTED DATA (PAPOC '20), 2020,
  • [30] XML and JSON']JSON Are Like Cardboard
    Helland, Pat
    COMMUNICATIONS OF THE ACM, 2017, 60 (12) : 46 - 47