Validating Streaming JSON']JSON Documents with Learned VPAs

被引:0
|
作者
Bruyere, Veronique [1 ]
Perez, Guillermo A. [2 ]
Staquet, Gaetan [1 ,2 ]
机构
[1] Univ Mons UMONS, Mons, Belgium
[2] Univ Antwerp UAntwerp Flanders Make, Antwerp, Belgium
来源
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2023 | 2023年 / 13993卷
关键词
Visibly pushdown automata; !text type='JSON']JSON[!/text; streaming validation; XML;
D O I
10.1007/978-3-031-30823-9_14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a new streaming algorithm to validate JSON documents against a set of constraints given as a JSON schema. Among the possible values a JSON document can hold, objects are unordered collections of key-value pairs while arrays are ordered collections of values. We prove that there always exists a visibly pushdown automaton (VPA) that accepts the same set of JSON documents as a JSON schema. Leveraging this result, our approach relies on learning a VPA for the provided schema. As the learned VPA assumes a fixed order on the key-value pairs of the objects, we abstract its transitions in a special kind of graph, and propose an efficient streaming algorithm using the VPA and its graph to decide whether a JSON document is valid for the schema. We evaluate the implementation of our algorithm on a number of random JSON documents, and compare it to the classical validation algorithm.
引用
收藏
页码:271 / 289
页数:19
相关论文
共 50 条
  • [1] SJSON']JSON: A succinct representation for JSON']JSON documents
    Lee, Junhee
    Anjos, Edman
    Satti, Srinivasa Rao
    INFORMATION SYSTEMS, 2021, 97
  • [2] EvsJSON']JSON : An Efficient Validator for Split JSON']JSON Documents
    He, Bangjun
    Zuo, Jie
    Feng, Qiaoyan
    Xie, Guicai
    Qin, Ruiqi
    Chen, Zihao
    Duan, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 775 - 779
  • [3] JSON']JSONDISCOVERER: Visualizing the schema lurking behind JSON']JSON documents
    Canovas Izquierdo, Javier Luis
    Cabot, Jordi
    KNOWLEDGE-BASED SYSTEMS, 2016, 103 : 52 - 55
  • [4] Json']JsonToOnto: Building Owl2 Ontologies from Json']Json Documents
    Sbai, Sara
    Chbihi Louhdi, Mohammed Reda
    Behja, Hicham
    Chakhmoune, Rabab
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (10) : 213 - 218
  • [5] On-demand JSON']JSON: A better way to parse documents?
    Keiser, John
    Lemire, Daniel
    SOFTWARE-PRACTICE & EXPERIENCE, 2024, 54 (06): : 1074 - 1086
  • [6] An Attribute-Based Protection Model for JSON']JSON Documents
    Biswas, Prosunjit
    Sandhu, Ravi
    Krishnan, Ram
    NETWORK AND SYSTEM SECURITY, (NSS 2016), 2016, 9955 : 303 - 317
  • [7] Streaming CityJSON']JSON datasets
    Ledoux, Hugo
    Stavropoulou, Gina
    Dukai, Balazs
    19TH 3D GEOINFO CONFERENCE 2024, VOL. 48-4, 2024, : 57 - 63
  • [8] JEDI: These aren't the JSON']JSON documents you're looking for ...
    Huetter, Thomas
    Augsten, Nikolaus
    Kirsch, Christoph M.
    Carey, Michael J.
    Li, Chen
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1584 - 1597
  • [9] JSON']JSON Documents Processing Using Situation-Oriented Databases
    Mironov, Valeriy
    Gusarenko, Artem
    Yusupova, Nafisa
    Smetanin, Yuriy
    ACTA POLYTECHNICA HUNGARICA, 2020, 17 (08) : 29 - 40
  • [10] JSON']JSON Model: a Lightweight Featureful DSL for JSON']JSON
    Coelho, Fabien
    Yannou-Medrala, Claire
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2024, 2024, 14918 : 3 - 17