Typing Massive JSON Datasets

Dario Colazzo 1, 2 Giorgio Ghelli 3 Carlo Sartiani 4
2 OAK - Database optimizations and architectures for complex large data
CNRS - Centre National de la Recherche Scientifique : UMR8623, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LRI - Laboratoire de Recherche en Informatique
Abstract : Cloud-specific languages are usually untyped, and no guarantees about the correctness of complex jobs can be statically obtained. Datasets too are usually untyped and no schema information is needed for their manipulation. In this paper we sketch a typing algorithm for JSON datasets. Our approach can be used to infer a succinct type from scratch for a collection of JSON objects, as well as to validate a dataset against a human-designed type and, if necessary, to adapt and improve this type.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-00758716
Contributor : Dario Colazzo <>
Submitted on : Thursday, November 29, 2012 - 11:04:54 AM
Last modification on : Tuesday, February 26, 2019 - 10:55:13 AM

Identifiers

  • HAL Id : hal-00758716, version 1

Collections

Citation

Dario Colazzo, Giorgio Ghelli, Carlo Sartiani. Typing Massive JSON Datasets. International Workshop on Cross-model Language Design and Implementation (XLDI), Sep 2012, Copenhagen, Denmark. ⟨hal-00758716⟩

Share

Metrics

Record views

349