Reuse-based Optimization for Pig Latin - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Reuse-based Optimization for Pig Latin

Résumé

Pig Latin is a popular language which is widely used for parallel processing of massive data sets. Currently, subexpres-sions occurring repeatedly in Pig Latin scripts are executed as many times as they appear, and the current Pig Latin optimizer does not identify reuse opportunities. We present a novel optimization approach aiming at identifying and reusing repeated subexpressions in Pig Latin scripts. Our optimization algorithm, named PigReuse, identifies subexpression merging opportunities, selects the best ones to execute based on a cost function, and reuses their results as needed in order to compute exactly the same output as the original scripts. Our experiments demonstrate the effectiveness of our approach.
Fichier principal
Vignette du fichier
paper-forHal.pdf (661.56 Ko) Télécharger le fichier
paper.pdf (688.55 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01425321 , version 1 (03-01-2017)

Identifiants

Citer

Jesús Camacho-Rodríguez, Dario Colazzo, Melanie Herschel, Ioana Manolescu, Soudip Roy Chowdhury. Reuse-based Optimization for Pig Latin. 25th ACM International on Conference on Information and Knowledge Management, Oct 2016, Indianapolis, United States. pp.2215 - 2220, ⟨10.1145/2983323.2983669⟩. ⟨hal-01425321⟩
408 Consultations
427 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More