Reuse-based Optimization for Pig Latin

Abstract : Pig Latin is a popular language which is widely used for parallel processing of massive data sets. Currently, subexpres-sions occurring repeatedly in Pig Latin scripts are executed as many times as they appear, and the current Pig Latin optimizer does not identify reuse opportunities. We present a novel optimization approach aiming at identifying and reusing repeated subexpressions in Pig Latin scripts. Our optimization algorithm, named PigReuse, identifies subexpression merging opportunities, selects the best ones to execute based on a cost function, and reuses their results as needed in order to compute exactly the same output as the original scripts. Our experiments demonstrate the effectiveness of our approach.
Type de document :
Communication dans un congrès
25th ACM International on Conference on Information and Knowledge Management, Oct 2016, Indianapolis, France. ACM International Conference on Information and Knowledge Management (CIKM), pp.2215 - 2220, 2016, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 〈http://www.cikm2016.org/〉. 〈10.1145/2983323.2983669〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01425321
Contributeur : Ioana Manolescu <>
Soumis le : mardi 3 janvier 2017 - 15:03:30
Dernière modification le : jeudi 12 avril 2018 - 01:50:21
Document(s) archivé(s) le : mardi 4 avril 2017 - 13:52:47

Fichiers

paper-forHal.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Jesús Camacho-Rodríguez, Dario Colazzo, Melanie Herschel, Ioana Manolescu, Soudip Roy Chowdhury. Reuse-based Optimization for Pig Latin. 25th ACM International on Conference on Information and Knowledge Management, Oct 2016, Indianapolis, France. ACM International Conference on Information and Knowledge Management (CIKM), pp.2215 - 2220, 2016, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 〈http://www.cikm2016.org/〉. 〈10.1145/2983323.2983669〉. 〈hal-01425321〉

Partager

Métriques

Consultations de la notice

493

Téléchargements de fichiers

110