Validating Streaming JSON']JSON Documents with Learned VPAs

被引:0
|
作者
Bruyere, Veronique [1 ]
Perez, Guillermo A. [2 ]
Staquet, Gaetan [1 ,2 ]
机构
[1] Univ Mons UMONS, Mons, Belgium
[2] Univ Antwerp UAntwerp Flanders Make, Antwerp, Belgium
来源
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2023 | 2023年 / 13993卷
关键词
Visibly pushdown automata; !text type='JSON']JSON[!/text; streaming validation; XML;
D O I
10.1007/978-3-031-30823-9_14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a new streaming algorithm to validate JSON documents against a set of constraints given as a JSON schema. Among the possible values a JSON document can hold, objects are unordered collections of key-value pairs while arrays are ordered collections of values. We prove that there always exists a visibly pushdown automaton (VPA) that accepts the same set of JSON documents as a JSON schema. Leveraging this result, our approach relies on learning a VPA for the provided schema. As the learned VPA assumes a fixed order on the key-value pairs of the objects, we abstract its transitions in a special kind of graph, and propose an efficient streaming algorithm using the VPA and its graph to decide whether a JSON document is valid for the schema. We evaluate the implementation of our algorithm on a number of random JSON documents, and compare it to the classical validation algorithm.
引用
收藏
页码:271 / 289
页数:19
相关论文
共 50 条
  • [11] Secure Java']Javascript Object Notation (SecJSON']JSON) Enabling granular confidentiality and integrity of JSON']JSON documents
    Santos, Tiago
    Serrao, Carlos
    2016 11TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2016, : 329 - 334
  • [12] An Empirical Study on the "Usage of Not" in Real-World JSON']JSON Schema Documents
    Baazizi, Mohamed-Amine
    Colazzo, Dario
    Ghelli, Giorgio
    Sartiani, Carlo
    Scherzinger, Stefanie
    CONCEPTUAL MODELING, ER 2021, 2021, 13011 : 102 - 112
  • [13] Non-native Techniques for Storing JSON']JSON Documents into Relational Tables
    Petkovic, Dusan
    22ND INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2020), 2020, : 16 - 20
  • [14] JSON']JSON Encryption
    Abd El-Aziz, A. A.
    Kannan, A.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
  • [15] Temporal JSON']JSON
    Goyal, Aayush
    Dyreson, Curtis
    2019 IEEE 5TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC 2019), 2019, : 135 - 144
  • [16] Full-Text Search Extensions for JSON']JSON Documents: Design Goals and Implementations
    Petkovic, Dusan
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: FACING THE CHALLENGES OF DATA PROLIFERATION AND GROWING VARIETY, 2018, 928 : 283 - 293
  • [17] J-CO, A Framework for Fuzzy Querying Collections of JSON']JSON Documents (Demo)
    Fosci, Paolo
    Psaila, Giuseppe
    FLEXIBLE QUERY ANSWERING SYSTEMS (FQAS 2021), 2021, 12871 : 142 - 153
  • [18] An Approach for Schema Extraction of JSON']JSON and Extended JSON']JSON Document Collections
    Frozza, Angelo Augusto
    Mello, Ronaldo dos Santos
    da Costa, Felipe de Souza
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 356 - 363
  • [19] Foundations of JSON']JSON Schema
    Pezoa, Felipe
    Reutter, Juan L.
    Suarez, Fernando
    Ugarte, Martin
    Vrgoc, Domagoj
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 263 - 273
  • [20] PipeJSON']JSON: Parsing JSON']JSON at Line Speed on FPGAs
    Dann, Jonas
    Wagner, Royden
    Ritter, Daniel
    Faerber, Christian
    Froening, Holger
    18TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE, DAMON 2022, 2022,