B. Aarts, M. Barreteau, F. Bodin, P. Brinkhaus, Z. Chamski et al., OCEANS: Optimizing compilers for embedded applications, Proc. Euro-Par 97, pp.1351-1356, 1997.
DOI : 10.1007/BFb0002894

F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin et al., Using Machine Learning to Focus Iterative Optimization, International Symposium on Code Generation and Optimization (CGO'06), 2006.
DOI : 10.1109/CGO.2006.37

K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands et al., The landscape of parallel computing research: a view from Berkeley, 2006.

S. Benedict, V. Petkov, and M. Gerndt, PERISCOPE: An Online-Based Distributed Performance Analysis Tool, pp.1-16, 2010.
DOI : 10.1007/978-3-642-11261-4_1

F. Bodin, T. Kisuki, P. Knijnenburg, M. O. Boyle, and E. Rohou, Iterative compilation in a non-linear optimisation space, Proceedings of the Workshop on Profile and Feedback Directed Compilation, 1998.
URL : https://hal.archives-ouvertes.fr/inria-00475919

J. Cavazos, G. Fursin, F. Agakov, E. Bonilla, M. O. Boyle et al., Rapidly Selecting Good Compiler Optimizations using Performance Counters, International Symposium on Code Generation and Optimization (CGO'07), 2007.
DOI : 10.1109/CGO.2007.32

Y. Chen, L. Eeckhout, G. Fursin, L. Peng, O. Temam et al., Evaluating iterative optimization across 1000 data sets, Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI, 2010.

K. Cooper, P. Schielke, and D. Subramanian, Optimizing for reduced code space using genetic algorithms, Proceedings of the Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), pp.1-9, 1999.

K. Cooper, D. Subramanian, and L. Torczon, Adaptive optimizing compilers for the 21st century, Journal of Supercomputing, vol.23, issue.1, 2002.

J. Dongarra, The International Exascale Software Project roadmap, International Journal of High Performance Computing Applications, vol.25, issue.1, pp.3-60, 2011.
DOI : 10.1177/1094342010391989

B. Franke, M. O. 'boyle, J. Thomson, and G. Fursin, Probabilistic source-level optimisation of embedded programs, Proceedings of the Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), 2005.

G. Fursin, Iterative Compilation and Performance Prediction for Numerical Applications, 2004.

G. Fursin, HiPEAC thematic session at ACM FCRC'13: Making computer engineering a science. http://www.hipeac.net/thematic-session/making-computer-engineering-science, 2013.

G. Fursin, J. Cavazos, M. O. Boyle, and O. Temam, MiDataSets: Creating the Conditions for a More Realistic Evaluation of Iterative Optimization, Proceedings of the International Conference on High Performance Embedded Architectures & Compilers, 2007.
DOI : 10.1007/978-3-540-69338-3_17

G. Fursin, A. Cohen, M. O. Boyle, and O. Temam, A Practical Method for Quickly Evaluating Program Optimizations, Proceedings of the International Conference on High Performance Embedded Architectures & Compilers, pp.29-46, 2005.
DOI : 10.1007/11587514_4

URL : https://hal.archives-ouvertes.fr/inria-00001054

G. Fursin, Y. Kashnikov, A. W. Memon, Z. Chamski, O. Temam et al., Milepost GCC: Machine Learning Enabled Self-tuning Compiler, International Journal of Parallel Programming, vol.16, issue.2???3, pp.296-327, 2011.
DOI : 10.1007/s10766-010-0161-2

URL : https://hal.archives-ouvertes.fr/hal-00685276

G. Fursin, M. O. Boyle, and P. Knijnenburg, Evaluating Iterative Compilation, Proceedings of the Workshop on Languages and Compilers for Parallel Computers (LCPC), pp.305-315, 2002.
DOI : 10.1007/11596110_24

G. Fursin, M. O. Boyle, O. Temam, and G. Watts, Fast and accurate method for determining a lower bound on execution time. Concurrency: Practice and Experience, pp.271-292, 2004.

G. Fursin, M. O. Boyle, O. Temam, and G. Watts, Fast and accurate method for determining a lower bound on execution time. Concurrency: Practice and Experience, pp.271-292, 2004.

G. Fursin, M. F. O-'boyle, O. Temam, and G. Watts, A fast and accurate method for determining a lower bound on execution time, Concurrency and Computation: Practice and Experience, vol.16, issue.23, pp.271-292, 2004.
DOI : 10.1002/cpe.774

G. Fursin and O. Temam, Collective optimization, ACM Transactions on Architecture and Code Optimization, vol.7, issue.4, pp.1-2029, 2010.
DOI : 10.1145/1880043.1880047

URL : https://hal.archives-ouvertes.fr/inria-00445326

M. Geimer, F. Wolf, B. J. Wylie, E. Abrahám, D. Becker et al., The Scalasca performance toolset architecture, Concurrency and Computation: Practice and Experience, vol.16, issue.2-3, pp.702-719, 2010.
DOI : 10.1002/cpe.1556

T. Hey, S. Tansley, and K. M. Tolle, The Fourth Paradigm ??? Data-Intensive Scientific Discovery, 2009.
DOI : 10.1007/978-3-642-33299-9_1

K. Hoste and L. Eeckhout, Cole, Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization , CGO '08, 2008.
DOI : 10.1145/1356058.1356080

V. Jimenez, I. Gelado, L. Vilanova, M. Gil, G. Fursin et al., Predictive Runtime Code Scheduling for Heterogeneous Architectures, Proceedings of the International Conference on High Performance Embedded Architectures & Compilers, 2009.
DOI : 10.1007/978-3-540-92990-1_4

URL : https://hal.archives-ouvertes.fr/inria-00445304

T. Kisuki, P. Knijnenburg, and M. O. Boyle, Combined selection of tile sizes and unroll factors using iterative compilation, Proceedings 2000 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00622), pp.237-246, 2000.
DOI : 10.1109/PACT.2000.888348

D. Kuck, Computational capacity-based codesign of computer systems. High-Performance Scientific Computing, 2013.

P. Kulkarni, W. Zhao, H. Moon, K. Cho, D. Whalley et al., Finding effective optimization phase sequences, Proceedings of the Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), pp.12-23, 2003.

L. Luo, Y. Chen, C. Wu, S. Long, and G. Fursin, Finding representative sets of optimizations for adaptive multiversioning applications, 3rd Workshop on Statistical and Machine Learning Approaches Applied to Architectures and Compilation (SMART'09), 2009.
URL : https://hal.archives-ouvertes.fr/inria-00436034

F. Matteo and S. Johnson, FFTW: An adaptive software architecture for the FFT, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1381-1384, 1998.

A. Monsifrot, F. Bodin, and R. Quiniou, A Machine Learning Approach to Automatic Production of Compiler Heuristics, Proceedings of the International Conference on Artificial Intelligence: Methodology, Systems, Applications, LNCS 2443, pp.41-50, 2002.
DOI : 10.1007/3-540-46148-5_5

Z. Pan and R. Eigenmann, Rating compiler optimizations for automatic performance tuning, Proceedings of the International Conference on Supercomputing, 2004.

Z. Pan and R. Eigenmann, Fast and effective orchestration of compiler optimizations for automatic performance tuning, Proceedings of the International Symposium on Code Generation and Optimization (CGO), pp.319-332, 2006.

S. S. Shende and A. D. Malony, The Tau Parallel Performance System, International Journal of High Performance Computing Applications, vol.20, issue.2, pp.287-311, 2006.
DOI : 10.1177/1094342006064482

B. Singer and M. Veloso, Learning to predict performance from formula modeling and training data, Proceedings of the Conference on Machine Learning, 2000.

M. Stephenson, S. Amarasinghe, M. Martin, and U. Reilly, Meta optimization: Improving compiler heuristics with machine learning, Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'03), pp.77-90, 2003.

M. Tartara and S. Crespi-reghizzi, Continuous learning of compiler heuristics, ACM Transactions on Architecture and Code Optimization, vol.9, issue.4, p.46, 2013.
DOI : 10.1145/2400682.2400705

S. Triantafyllis, M. Vachharajani, N. Vachharajani, and D. August, Compiler optimizationspace exploration, Proceedings of the International Symposium on Code Generation and Optimization (CGO), pp.204-215, 2003.

R. Whaley and J. Dongarra, Automatically Tuned Linear Algebra Software, Proceedings of the IEEE/ACM SC98 Conference, 1998.
DOI : 10.1109/SC.1998.10004

S. Williams, A. Waterman, and D. Patterson, Roofline, Communications of the ACM, vol.52, issue.4, pp.65-76, 2009.
DOI : 10.1145/1498765.1498785

S. Zuckerman, J. Suetterlein, R. Knauerhase, and G. R. Gao, Using a "codelet" program execution model for exascale machines, Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era, EXADAPT '11, pp.64-69, 2011.
DOI : 10.1145/2000417.2000424