Skip to Main content Skip to Navigation
Journal articles

UDAO: A Next-Generation Unified Data Analytics Optimizer

Khaled Zaouk 1 Fei Song 1 Chenghao Lyu 2 Arnab Sinha 1 Yanlei Diao 1, 2 Prashant Shenoy 2
1 CEDAR - Rich Data Analytics at Cloud Scale
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], Inria Saclay - Ile de France
Abstract : Big data analytics systems today still lack the ability to take user performance goals and budgetary constraints, collectively referred to as "objectives", and automatically configure an analytic job to achieve the objectives. This paper presents UDAO, a unified data analytics optimizer that can automatically determine the parameters of the runtime system, collectively called a job configuration, for general dataflow programs based on user objectives. UDAO embodies key techniques including in-situ modeling, which learns a model for each user objective in the same computing environment as the job is run, and multi-objective optimization, which computes a Pareto optimal set of job configurations to reveal tradeoffs between different objectives. Using benchmarks developed based on industry needs, our demonstration will allow the user to explore (1) learned models to gain insights into how various parameters affect user objectives; (2) Pareto frontiers to understand interesting tradeoffs between different objectives and how a configuration recommended by the optimizer explores these tradeoffs; (3) end-to-end benefits that UDAO can provide over default configurations or those manually tuned by engineers.
Document type :
Journal articles
Complete list of metadata

Cited literature [8 references]  Display  Hide  Download
Contributor : Khaled Zaouk Connect in order to contact the contributor
Submitted on : Tuesday, August 27, 2019 - 6:53:15 PM
Last modification on : Friday, April 30, 2021 - 9:55:21 AM
Long-term archiving on: : Thursday, January 9, 2020 - 10:06:20 PM


Files produced by the author(s)



Khaled Zaouk, Fei Song, Chenghao Lyu, Arnab Sinha, Yanlei Diao, et al.. UDAO: A Next-Generation Unified Data Analytics Optimizer. Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2019, 12 (12), ⟨10.14778/3352063.3352103⟩. ⟨hal-02267180⟩



Les métriques sont temporairement indisponibles