SJSON']JSON: A succinct representation for JSON']JSON documents

被引:3
|
作者
Lee, Junhee [1 ]
Anjos, Edman [2 ]
Satti, Srinivasa Rao [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Google, Hamburg, Germany
关键词
!text type='JSON']JSON[!/text; Succinct data structure; Semi-structured document representation; Heterogeneous array indexing; SUFFIX ARRAYS; TREES;
D O I
10.1016/j.is.2020.101686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The massive amounts of data processed in modern computational systems are becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JSON (JavaScript Object Notation) and XML (eXtensible Markup Language), for human-readable platform-agnostic access. This paper focuses on exploring a set of succinct representations for JSON documents, which we call SJSON, achieving both reduced RAM and disk usage while supporting efficient queries on the documents. The representations we propose are mainly based on the idea that JSON documents can be decomposed into structural part and raw data part. In our method, we emulate the structure of the JSON document as a rooted ordered tree and represent it using succinct data structures, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data is reorganized into arrays of attributes and values. This deconstruction between structure and data allows for a straightforward connection between a node in the succinct tree and its corresponding name-value pair, dispensing pointers altogether. The proposed scheme is implemented as the SJSON library in C++, and evaluated with respect to a number of metrics, comparing its performance with popular alternative JSON parsers. Empirical results show that the library is able to represent JSON files succinctly while efficiently supporting traversal queries. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] SJSON']JSON: A Succinct Representation for Java']JavaScript Object Notation Documents
    Anjos, Edman
    Lee, Junhee
    Satti, Srinivasa Rao
    2016 ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM 2016), 2016, : 173 - 178
  • [2] EvsJSON']JSON : An Efficient Validator for Split JSON']JSON Documents
    He, Bangjun
    Zuo, Jie
    Feng, Qiaoyan
    Xie, Guicai
    Qin, Ruiqi
    Chen, Zihao
    Duan, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 775 - 779
  • [3] JSON']JSONDISCOVERER: Visualizing the schema lurking behind JSON']JSON documents
    Canovas Izquierdo, Javier Luis
    Cabot, Jordi
    KNOWLEDGE-BASED SYSTEMS, 2016, 103 : 52 - 55
  • [4] PROV-JSON']JSONLD: A JSON']JSON and Linked Data Representation for Provenance
    Trung Dong Huynh
    Michaelides, Danius T.
    Moreau, Luc
    Provenance and Annotation of Data and Processes, IPAW 2016, 2016, 9672 : 173 - 177
  • [5] Json']JsonToOnto: Building Owl2 Ontologies from Json']Json Documents
    Sbai, Sara
    Chbihi Louhdi, Mohammed Reda
    Behja, Hicham
    Chakhmoune, Rabab
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (10) : 213 - 218
  • [6] Validating Streaming JSON']JSON Documents with Learned VPAs
    Bruyere, Veronique
    Perez, Guillermo A.
    Staquet, Gaetan
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2023, 2023, 13993 : 271 - 289
  • [7] JSON']JSON Model: a Lightweight Featureful DSL for JSON']JSON
    Coelho, Fabien
    Yannou-Medrala, Claire
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2024, 2024, 14918 : 3 - 17
  • [8] On-demand JSON']JSON: A better way to parse documents?
    Keiser, John
    Lemire, Daniel
    SOFTWARE-PRACTICE & EXPERIENCE, 2024, 54 (06): : 1074 - 1086
  • [9] JSON']JSON Encryption
    Abd El-Aziz, A. A.
    Kannan, A.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
  • [10] An Attribute-Based Protection Model for JSON']JSON Documents
    Biswas, Prosunjit
    Sandhu, Ravi
    Krishnan, Ram
    NETWORK AND SYSTEM SECURITY, (NSS 2016), 2016, 9955 : 303 - 317