SJSON']JSON: A succinct representation for JSON']JSON documents

被引:3
|
作者
Lee, Junhee [1 ]
Anjos, Edman [2 ]
Satti, Srinivasa Rao [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Google, Hamburg, Germany
关键词
!text type='JSON']JSON[!/text; Succinct data structure; Semi-structured document representation; Heterogeneous array indexing; SUFFIX ARRAYS; TREES;
D O I
10.1016/j.is.2020.101686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The massive amounts of data processed in modern computational systems are becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JSON (JavaScript Object Notation) and XML (eXtensible Markup Language), for human-readable platform-agnostic access. This paper focuses on exploring a set of succinct representations for JSON documents, which we call SJSON, achieving both reduced RAM and disk usage while supporting efficient queries on the documents. The representations we propose are mainly based on the idea that JSON documents can be decomposed into structural part and raw data part. In our method, we emulate the structure of the JSON document as a rooted ordered tree and represent it using succinct data structures, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data is reorganized into arrays of attributes and values. This deconstruction between structure and data allows for a straightforward connection between a node in the succinct tree and its corresponding name-value pair, dispensing pointers altogether. The proposed scheme is implemented as the SJSON library in C++, and evaluated with respect to a number of metrics, comparing its performance with popular alternative JSON parsers. Empirical results show that the library is able to represent JSON files succinctly while efficiently supporting traversal queries. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] ANOMALY DETECTION IN JSON']JSON STRUCTURED DATA
    Shliakhtina, E. A.
    Gamayunov, D. Y.
    PRIKLADNAYA DISKRETNAYA MATEMATIKA, 2022, (56): : 83 - 103
  • [42] Negation-closure for JSON']JSON Schema
    Baazizi, Mohamed -Amine
    Colazzo, Dario
    Ghelli, Giorgio
    Sartiani, Carlo
    Scherzinger, Stefanie
    THEORETICAL COMPUTER SCIENCE, 2023, 955
  • [43] Representing Synchrophasor Data Using JSON']JSON
    Brogan, Paul
    Moreno, Andres Jarmillo
    Liu, Xueqin Amy
    Hastings, John
    Laverty, David
    Morrow, D. John
    Duggan, Connor
    Best, Robert
    2021 32ND IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC 2021), 2021,
  • [44] JSON']JSON Schema Matching: Empirical Observations
    Waghray, Kunal
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2887 - 2889
  • [45] A JSON']JSON document algebra for query optimization
    Llano-Rios, Tomas
    Khalefa, Mohamed
    Badia, Antonio
    INFORMATION SYSTEMS, 2025, 132
  • [46] RDF/JSON']JSON Serialization of Knowledge Patterns
    Miarka, Rostislav
    INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
  • [47] Validation and Data Processing in JSON']JSON Format
    Spivak, Iryna
    Krepych, Svitlana
    Litvynchuk, Mykola
    Spivak, Serhii
    IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 326 - 330
  • [48] Reducing Ambiguity in Json']Json Schema Discovery
    Spoth, William
    Kennedy, Oliver
    Lu, Ying
    Hammerschmidt, Beda
    Liu, Zhen Hua
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1732 - 1744
  • [49] JSON']JSON: Data model and query languages
    Bourhis, Pierre
    Reutter, Juan L.
    Vrgoc, Domagoj
    INFORMATION SYSTEMS, 2020, 89
  • [50] Compression of XML and JSON']JSON API Responses
    Tiwary, Gyan P.
    Stroulia, Eleni
    Srivastava, Abhishek
    IEEE ACCESS, 2021, 9 : 57426 - 57439