J. Curley, HPC and Big Data, Innovation, vol.12, issue.3, 2014.

C. Silvano, G. Agosta, A. Bartolini, A. Beccari, L. Benini et al., AutoTuning and Adaptivity appRoach for Energy Efficient eXascale HPC Systems: The ANTAREX Approach, Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE), pp.343-346, 2015.
DOI : 10.3850/9783981537079_1012

URL : https://hal.archives-ouvertes.fr/hal-01235741

C. Silvano, G. Agosta, S. Cherubin, D. Gadioli, G. Palermo et al., The antarex approach to autotuning and adaptivity for energy efficient hpc systems, Proceedings of the ACM International Conference on Computing Frontiers, pp.288-293, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01341826

G. Chrysos, Intel® Xeon Phi TM Coprocessor-the Architecture, Intel Whitepaper, 2014.

J. M. Cardoso, T. Carvalho, J. G. Coutinho, W. Luk, R. Nobre et al., LARA, Proceedings of the 11th annual international conference on Aspect-oriented Software Development, AOSD '12, pp.179-190, 2012.
DOI : 10.1145/2162049.2162071

J. M. Cardoso, J. G. Coutinho, T. Carvalho, P. C. Diniz, Z. Petrov et al., Performancedriven instrumentation and mapping strategies using the LARA aspect-oriented programming approach, Software: Practice and Experience, 2014.

J. Irwin, G. Kickzales, J. Lamping, A. Mendhekar, C. Maeda et al., Aspectoriented Programming, ECOOP'97 ? Object- Oriented Programming, ser. Lecture Notes in Computer Science, pp.220-242, 1997.

J. M. Cardoso, T. Carvalho, J. G. Coutinho, R. Nobre, R. Nane et al., Controlling a complete hardware synthesis toolchain with LARA aspects, Microprocessors and Microsystems, vol.37, issue.8, pp.1073-1089, 2013.
DOI : 10.1016/j.micpro.2013.06.001

R. Nobre, L. G. Martins, and J. M. Cardoso, Use of Previously Acquired Positioning of Optimizations for Phase Ordering Exploration, Proceedings of the 18th International Workshop on Software and Compilers for Embedded Systems, SCOPES '15, pp.58-67, 2015.
DOI : 10.1145/1353445.1353451

G. Agosta, A. Barenghi, G. Pelosi, and M. Scandale, Towards Transparently Tackling Functionality and Performance Issues across Different OpenCL Platforms, 2014 Second International Symposium on Computing and Networking, pp.130-136, 2014.
DOI : 10.1109/CANDAR.2014.53

G. Agosta, A. Barenghi, A. D. Federico, and G. Pelosi, OpenCL performance portability for general-purpose computation on graphics processor units: an exploration on cryptographic primitives, Concurrency and Computation: Practice and Experience, 2014.
DOI : 10.1007/978-3-642-24712-5_16

F. Bodin, T. Kisuki, P. Knijnenburg, M. O. Boyle, and E. Rohou, Iterative compilation in a non-linear optimisation space, 1998.
URL : https://hal.archives-ouvertes.fr/inria-00475919

A. Cohen and E. Rohou, Processor virtualization and split compilation for heterogeneous multicore embedded systems, Proceedings of the 47th Design Automation Conference on, DAC '10, pp.102-107, 2010.
DOI : 10.1145/1837274.1837303

URL : https://hal.archives-ouvertes.fr/inria-00472274

A. Suresh, B. N. Swamy, E. Rohou, and A. Seznec, Intercepting Functions for Memoization, ACM Transactions on Architecture and Code Optimization, vol.12, issue.2, pp.1-1823, 2015.
DOI : 10.1145/1346281.1346300

URL : https://hal.archives-ouvertes.fr/hal-01178085

D. Michie, ???Memo??? Functions and Machine Learning, Nature, vol.115, issue.5136, pp.19-22, 1968.
DOI : 10.1038/218019a0

E. Paone, D. Gadioli, G. Palermo, V. Zaccaria, and C. Silvano, Evaluating orthogonality between application auto-tuning and run-time resource management for adaptive OpenCL applications, 2014 IEEE 25th International Conference on Application-Specific Systems, Architectures and Processors, pp.161-168, 2014.
DOI : 10.1109/ASAP.2014.6868651

E. Paone, F. Robino, G. Palermo, V. Zaccaria, I. Sander et al., Customization of OpenCL Applications for Efficient Task Mapping under Heterogeneous Platform Constraints, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015, pp.736-741, 2015.
DOI : 10.7873/DATE.2015.0125

B. Subramaniam, W. Saunders, T. Scogland, and W. Feng, Trends in energy-efficient computing: A perspective from the Green500, 2013 International Green Computing Conference Proceedings, 2013.
DOI : 10.1109/IGCC.2013.6604520

F. Fraternali, A. Bartolini, C. Cavazzoni, G. Tecchiolli, and L. Benini, Quantifying the impact of variability on the energy efficiency for a next-generation ultra-green supercomputer, Proceedings of the 2014 international symposium on Low power electronics and design, ISLPED '14, pp.295-298, 2014.
DOI : 10.1145/2627369.2627659

A. Auweter, A. Bode, M. Brehm, L. Brochard, N. Hammer et al., A Case Study of Energy Aware Scheduling on SuperMUC, pp.394-409, 2014.
DOI : 10.1007/978-3-319-07518-1_25

A. Borghesi, C. Conficoni, M. Lombardi, and A. Bartolini, MS3: A Mediterranean-stile job scheduler for supercomputers - do less when it's too hot!, 2015 International Conference on High Performance Computing & Simulation (HPCS), p.2015
DOI : 10.1109/HPCSim.2015.7237025

F. Beneventi, A. Bartolini, C. Cavazzoni, and L. Benini, Continuous learning of HPC infrastructure models using big data analytics and in-memory processing tools, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017, pp.1038-1043, 2017.
DOI : 10.23919/DATE.2017.7927143

G. Rudy, M. M. Khan, M. Hall, C. Chen, and J. Chame, A Programming Language Interface to Describe Transformations and Code Generation, International Workshop on Languages and Compilers for Parallel Computing, 2010.
DOI : 10.1145/1809028.1806606

Q. Yi, POET: a scripting language for applying parameterized source-to-source program transformations, Software: Practice and Experience, vol.5, issue.1, 2012.
DOI : 10.1109/99.660313

K. S. Namjoshi and N. Singhania, Loopy: Programmable and Formally Verified Loop Transformations, International Static Analysis Symposium, pp.383-402, 2016.
DOI : 10.1002/spe.1089

A. Qasem, G. Jin, and J. Mellor-crummey, Improving performance with integrated program transformations, 2003.

S. Donadio, J. Brodman, T. Roeder, K. Yotov, D. Barthou et al., A Language for the Compact Representation of Multiple Program Versions, Proceedings of the 18th International Conference on Languages and Compilers for Parallel Computing, ser. LCPC'05, pp.136-151, 2006.
DOI : 10.1007/978-3-540-69330-7_10

URL : https://hal.archives-ouvertes.fr/hal-00141067

A. Hartono, B. Norris, and P. Sadayappan, Annotationbased empirical performance tuning using Orio, IEEE International Symposium on Parallel Distributed Processing, pp.1-11, 2009.

D. Mustafa and R. Eigenmann, Portable Section-level Tuning of Compiler Parallelized Applications Storage and Analysis, ser. SC '12, Proceedings of the International Conference on High Performance Computing, Networking, 2012.

W. Wang, J. Cavazos, and A. Porterfield, Energy Autotuning using the Polyhedral Approach, Proceedings of the 4th International Workshop on Polyhedral Compilation Techniques, S. Rajopadhye and S. Verdoolaege, 2014.

A. Tiwari, M. A. Laurenzano, L. Carrington, and A. Snavely, Auto-tuning for Energy Usage in Scientific Applications, pp.178-187, 2011.
DOI : 10.1007/978-3-642-29740-3_21

M. Tillmann, T. Karcher, C. Dachsbacher, and W. F. Tichy, Application-independent autotuning for gpus, PARCO, 2013, pp.626-635

X. Sui, A. Lenharth, D. S. Fussell, and K. Pingali, Proactive control of approximate programs, ACM SIGOPS Operating Systems Review, vol.50, issue.2, 2016.

H. Hoffmann, S. Sidiroglou, M. Carbin, S. Misailovic, A. Agarwal et al., Dynamic knobs for responsive power-aware computing, ACM SIGPLAN Notices, 2011.

P. G. Kjeldsberg, A. Gocht, M. Gerndt, L. Riha, J. Schuchart et al., READEX: Linking two ends of the computing continuum to improve energy-efficiency in dynamic applications, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017, pp.2017-109, 2017.
DOI : 10.23919/DATE.2017.7926967

A. Hendricks, T. Heller, H. Jordan, P. Thoman, T. Fahringer et al., The allscale runtime interface: Theoretical foundation and concept Available: https, Proceedings of the 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers, ser. MTAGS '16, pp.13-19, 2016.

H. Jordan, P. Thoman, P. Zangerl, T. Heller, and T. Fahringer, A Context-Aware Primitive for Nested Recursive Parallelism, IWMSE Workshop, ser. Euro-Par 2016, 2016.
DOI : 10.1007/978-3-662-48096-0_19