HAP: Building Pipelines with Heterogeneous Data and Hive - Archive ouverte HAL Access content directly
Preprints, Working Papers, ... Year :

HAP: Building Pipelines with Heterogeneous Data and Hive

Damien Graux
  • Function : Author
  • PersonId : 995612
Pierre Genevès
Nabil Layaïda

Abstract

The increasing number of available datasets gives opportunities to build large and complex applications which aggregate results coming from several sources. These emerging usecases require new systems where combinations of heterogeneous sources are both allowed and efficient. To tackle these challenges, we provide a simple high-level set of primitives – called HAP – to easily describe processing chains. These descriptions are then compiled into optimized SQL queries executed by Hive.
Fichier principal
Vignette du fichier
report-hap.pdf (404.44 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01436850 , version 1 (16-01-2017)

Identifiers

  • HAL Id : hal-01436850 , version 1

Cite

Damien Graux, Pierre Genevès, Nabil Layaïda. HAP: Building Pipelines with Heterogeneous Data and Hive. 2017. ⟨hal-01436850⟩
217 View
137 Download

Share

Gmail Facebook Twitter LinkedIn More