The Nautilus Analyzer: Understanding and Debugging Data Transformations

Melanie Herschel 1, 2 Hanno Eichelberger 3
2 OAK - Database optimizations and architectures for complex large data
LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France, CNRS - Centre National de la Recherche Scientifique : UMR8623
Abstract : When developing data transformations---a task omnipresent in applications like data integration, data migration, data cleaning, or scientific data processing---developers quickly face the need to verify the semantic correctness of the transformation. Declarative specifications of data transformations, e.g., SQL or ETL tools, increase developer productivity but usually provide limited or no means for inspection or debugging. In this situation, developers today have no choice but to manually analyze the transformation and, in case of an error, to (repeatedly) fix and test the transformation. The goal of the Nautilus project is to semi-automatically support this analysis-fix-test cycle. This demonstration focuses on one main component of Nautilus, namely the Nautilus Analyzer that helps developers in understanding and debugging their data transformations. The demonstration will show the capabilities of this component for data transformations specified in SQL on scenarios from different domains that are based on real-world data. We provide an overview the Nautilus Analyzer, discuss components and implementation techniques, and outline our demonstration plan. The Nautilus website (\url{http://nautilus-system.org}) features a video, screenshots, and further details.
Document type :
Conference papers
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/hal-00757591
Contributor : Melanie Herschel <>
Submitted on : Tuesday, November 27, 2012 - 11:44:29 AM
Last modification on : Monday, May 28, 2018 - 2:38:02 PM
Long-term archiving on : Thursday, February 28, 2013 - 3:43:24 AM

File

CIKM12-herschel.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00757591, version 1

Collections

Citation

Melanie Herschel, Hanno Eichelberger. The Nautilus Analyzer: Understanding and Debugging Data Transformations. ACM International Conference on Information and Knowledge Management, Oct 2012, Maui, HI, United States. ⟨hal-00757591⟩

Share

Metrics

Record views

315

Files downloads

318