Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

HAP: Building Pipelines with Heterogeneous Data and Hive

Damien Graux 1 Pierre Genevès 1 Nabil Layaïda 1
1 TYREX - Types and Reasoning for the Web
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble [2007-2015]
Abstract : The increasing number of available datasets gives opportunities to build large and complex applications which aggregate results coming from several sources. These emerging usecases require new systems where combinations of heterogeneous sources are both allowed and efficient. To tackle these challenges, we provide a simple high-level set of primitives – called HAP – to easily describe processing chains. These descriptions are then compiled into optimized SQL queries executed by Hive.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.inria.fr/hal-01436850
Contributor : Tyrex Equipe <>
Submitted on : Monday, January 16, 2017 - 5:31:24 PM
Last modification on : Thursday, July 9, 2020 - 9:44:51 AM
Document(s) archivé(s) le : Monday, April 17, 2017 - 4:34:28 PM

File

report-hap.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01436850, version 1

Collections

CNRS | INRIA | LIG | UGA

Citation

Damien Graux, Pierre Genevès, Nabil Layaïda. HAP: Building Pipelines with Heterogeneous Data and Hive. 2017. ⟨hal-01436850⟩

Share

Metrics

Record views

455

Files downloads

217