, and orange headers. DLMT found similar speedups using smaller budgets for kernels marked with, Cost of best points found on each run, and the iteration where they were found, vol.4
A comparison of search heuristics for empirical code optimization, CLUSTER, pp.421-429, 2008. ,
Combined selection of tile sizes and unroll factors using iterative compilation, The Journal of Supercomputing, vol.24, issue.1, pp.43-67, 2003. ,
Can search algorithms save large-scale automatic performance tuning?" in ICCS, pp.2136-2145, 2011. ,
, An experimental study of global and local search algorithms in empirical performance tuning, International Conference on High Performance Computing for Computational Science, pp.261-269, 2012.
Apollo: Reusable models for fast, dynamic tuning of input-dependent code, The 31th IEEE International Parallel and Distributed Processing Symposium, 2017. ,
Machine learning-based auto-tuning for enhanced performance portability of opencl applications, Concurrency and Computation: Practice and Experience, vol.29, issue.8, 2017. ,
AutoMOMML: Automatic Multi-objective Modeling with Machine Learning, High Performance Computing: 31st International Conference, ISC High Performance, pp.219-239, 2016. ,
SPAPT: Search problems in automatic performance tuning, Procedia Computer Science, vol.9, pp.1959-1968, 2012. ,
Annotation-based empirical performance tuning using Orio, Parallel & Distributed Processing, pp.1-11, 2009. ,
BOAST: A metaprogramming framework to produce portable and efficient computing kernels for hpc applications, The International Journal of High Performance Computing Applications, p.1094342017718068, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01620778
A scalable auto-tuning framework for compiler optimization, Parallel & Distributed Processing, pp.1-12, 2009. ,
POET: Parameterized optimizations for empirical tuning, Parallel and Distributed Processing Symposium, pp.1-8, 2007. ,
PetaBricks: a language and compiler for algorithmic choice, vol.44, 2009. ,
The algorithm selection problem, Advances in Computers 15, pp.65-118, 1976. ,
Optimizing matrix multiply using PHiPAC: a portable, high-performance, ansi c coding methodology, Proceedings of International Conference on Supercomputing, 1997. ,
Automatically tuned linear algebra software (ATLAS), Proceedings of SC, vol.98, 1998. ,
OSKI: A library of automatically tuned sparse matrix kernels, Journal of Physics: Conference Series, vol.16, p.521, 2005. ,
FFTW: An adaptive software architecture for the fft, Proceedings of the 1998 IEEE International Conference on, vol.3, pp.1381-1384, 1998. ,
Automatic performance analysis with periscope, Concurrency and Computation: Practice and Experience, vol.22, issue.6, pp.736-748, 2010. ,
A multi-objective auto-tuning framework for parallel codes, High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for, pp.1-12, 2012. ,
ParamILS: an automatic algorithm configuration framework, Journal of Artificial Intelligence Research, vol.36, issue.1, pp.267-306, 2009. ,
Opentuner: An extensible framework for program autotuning, Proceedings of the 23rd international conference on Parallel architectures and compilation, pp.303-316, 2014. ,
, lhs: Latin Hypercube Samples, 2018.
The design of optimum multifactorial experiments, Biometrika, vol.33, issue.4, pp.305-325, 1946. ,
R package FrF2 for creating and analyzing fractional factorial 2-level designs, Journal of Statistical Software, vol.56, issue.1, pp.1-56, 2014. ,
, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, 2018.
Theory of optimal experiments, 1972. ,
AlgDesign: Algorithmic Experimental Design, 2014. ,
Some main-effect plans and orthogonal arrays of strength two, The Annals of Mathematical Statistics, pp.1167-1176, 1961. ,
An algorithm for generating good mixed level factorial designs, Beuth University of Applied Sciences, 2018. ,
An R Companion to Applied Regression, Sage, 2011. ,
Adding virtualization capabilities to the Grid'5000 testbed, Cloud Computing and Services Science, ser. Communications in Computer and Information Science, vol.367, pp.3-20, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00946971
Git repository with all scripts and data ,
Autotuning high-level synthesis for fpgas using opentuner and legup, International Conference on Reconfigurable Computing and FPGAs (ReConFig, 2017. ,