F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin et al., Using Machine Learning to Focus Iterative Optimization, International Symposium on Code Generation and Optimization (CGO'06), 2006.
DOI : 10.1109/CGO.2006.37

C. Bastoul, Code generation in the polyhedral model is easier than you think, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pp.7-16, 2004.
DOI : 10.1109/PACT.2004.1342537

URL : https://hal.archives-ouvertes.fr/hal-00017260

M. Benabderrahmane, L. Pouchet, A. Cohen, and C. Bastoul, The Polyhedral Model Is More Widely Applicable Than You Think, Intl. Conf. on Compiler Construction (ETAPS CC'10), pp.283-303, 2010.
DOI : 10.1007/978-3-642-11970-5_16

URL : https://hal.archives-ouvertes.fr/inria-00551087

U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev et al., Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model, International conference on Compiler Construction (ETAPS CC), 2008.
DOI : 10.1007/978-3-540-78791-4_9

U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, A practical automatic polyhedral program optimization system, ACM SIGPLAN Conference on Programming Language Design and Implementation, 2008.

R. R. Bouckaert, E. Frank, M. A. Hall, G. Holmes, B. Pfahringer et al., WEKA?experiences with a java opensource project, Journal of Machine Learning Research, vol.11, pp.2533-2541, 2010.

J. Cavazos, C. Dubach, F. Agakov, E. Bonilla, M. F. O-'boyle et al., Automatic performance model construction for the fast software exploration of new hardware designs, Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems , CASES '06, 2006.
DOI : 10.1145/1176760.1176765

J. Cavazos, G. Fursin, F. V. Agakov, E. V. Bonilla, M. F. O-'boyle et al., Rapidly Selecting Good Compiler Optimizations using Performance Counters, International Symposium on Code Generation and Optimization (CGO'07), pp.185-197, 2007.
DOI : 10.1109/CGO.2007.32

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.76.4471

C. Chen, J. Chame, and M. Hall, CHiLL: A framework for composing high-level loop transformations, 2008.

K. D. Cooper, P. J. Schielke, and D. Subramanian, Optimizing for reduced code space using genetic algorithms, Workshop on Languages , Compilers, and Tools for Embedded Systems, pp.1-9, 1999.
DOI : 10.1145/314403.314414

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.13.1586

K. D. Cooper, D. Subramanian, and L. Torczon, Adaptive optimizing compilers for the 21st century, The Journal of Supercomputing, vol.23, issue.1, pp.7-22, 2002.
DOI : 10.1023/A:1015729001611

C. Dubach, J. Cavazos, B. Franke, M. O. Boyle, G. Fursin et al., Fast compiler optimisation evaluation using code-feature based performance prediction, Proceedings of the 4th international conference on Computing frontiers , CF '07, 2007.
DOI : 10.1145/1242531.1242553

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.184.8545

C. Dubach, T. M. Jones, E. V. Bonilla, G. Fursin, and M. F. O-'boyle, Portable compiler optimization across embedded programs and microarchitectures using machine learning, Proceedings of the IEEE/ACM International Symposium on Microarchitecture (MICRO), 2009.

P. Feautrier, Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time, International Journal of Parallel Programming, vol.2, issue.4, pp.389-420, 1992.
DOI : 10.1007/BF01379404

B. Franke, M. O. 'boyle, J. Thomson, and G. Fursin, Probabilistic sourcelevel optimisation of embedded programs, Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems, pp.78-86, 2005.

M. Frigo and S. G. Johnson, The Design and Implementation of FFTW3, special issue on " Program Generation, Optimization, and Platform Adaptation, pp.216-231, 2005.
DOI : 10.1109/JPROC.2004.840301

G. Fursin, C. Miranda, O. Temam, M. Namolaru, E. Yom-tov et al., MILEPOST GCC: machine learning based research compiler, Proceedings of the GCC Developers' Summit, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00294704

S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello et al., Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies, International Journal of Parallel Programming, vol.20, issue.1, pp.261-317, 2006.
DOI : 10.1007/s10766-006-0012-3

URL : https://hal.archives-ouvertes.fr/hal-01257288

M. Haneda, P. M. Knijnenburg, and H. A. Wijshoff, Automatic selection of compiler options using non-parametric inferential statistics, 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05), pp.123-132, 2005.
DOI : 10.1109/PACT.2005.9

F. Irigoin and R. Triolet, Supernode partitioning, Proceedings of the 15th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '88, pp.319-329, 1988.
DOI : 10.1145/73560.73588

T. Kisuki, P. M. Knijnenburg, and M. F. O-'boyle, Combined selection of tile sizes and unroll factors using iterative compilation, Proceedings 2000 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00622), p.237, 2000.
DOI : 10.1109/PACT.2000.888348

P. Kulkarni, S. Hines, J. Hiser, D. Whalley, J. Davidson et al., Fast searches for effective optimization phase sequences, Proceedings of the ACM SIGPLAN '04 Conference on Programming Language Design and Implementation, pp.171-182, 2004.

A. W. Lim and M. S. Lam, Maximizing parallelism and minimizing synchronization with affine transforms, Proceedings of the 24th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '97, pp.201-214, 1997.
DOI : 10.1145/263699.263719

S. Long and G. Fursin, A heuristic search algorithm based on unified transformation framework, Proc. of the 2005 Intl. Conf. on Parallel Processing Workshops (ICPPW'05), pp.137-144, 2005.

A. Monsifrot, F. Bodin, and R. Quiniou, A Machine Learning Approach to Automatic Production of Compiler Heuristics, AIMSA '02: Proc. of the 10th Intl. Conf. on Artificial Intelligence: Methodology, Systems, and Applications, pp.41-50, 2002.
DOI : 10.1007/3-540-46148-5_5

P. Mucci, Papi ? the performance application programming interface, 2000.

M. Namolaru, A. Cohen, G. Fursin, A. Zaks, and A. Freund, Practical aggregation of semantical program properties for machine learning based optimization, Proceedings of the 2010 international conference on Compilers, architectures and synthesis for embedded systems, CASES '10, 2010.
DOI : 10.1145/1878921.1878951

URL : https://hal.archives-ouvertes.fr/inria-00551512

D. Parello, O. Temam, A. Cohen, and J. Verdun, Towards a Systematic, Pragmatic and Architecture-Aware Program Optimization Process for Complex Processors, Proceedings of the ACM/IEEE SC2004 Conference, p.15, 2004.
DOI : 10.1109/SC.2004.61

URL : https://hal.archives-ouvertes.fr/hal-01257302

L. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos, Iterative optimization in the polyhedral model: Part II, multidimensional time, ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI'08), pp.90-100, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01257273

L. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache, Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time, International Symposium on Code Generation and Optimization (CGO'07)
DOI : 10.1109/CGO.2007.21

URL : https://hal.archives-ouvertes.fr/hal-01257281

L. Pouchet, U. Bondhugula, C. Bastoul, A. Cohen, J. Ramanujam et al., Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2010.
DOI : 10.1109/SC.2010.14

URL : https://hal.archives-ouvertes.fr/inria-00551067

N. Johnson and . Rizzolo, Spiral: Code generation for dsp transforms, special issue on " Program Generation, Optimization, and Platform Adaptation, pp.232-275, 2005.

J. Ramanujam and P. Sadayappan, Tiling multidimensional iteration spaces for multicomputers, Journal of Parallel and Distributed Computing, vol.16, issue.2, pp.108-230, 1992.
DOI : 10.1016/0743-7315(92)90027-K

A. Tiwari, C. Chen, J. Chame, M. Hall, and J. K. Hollingsworth, A scalable auto-tuning framework for compiler optimization, 2009 IEEE International Symposium on Parallel & Distributed Processing, pp.1-12, 2009.
DOI : 10.1109/IPDPS.2009.5161054

K. Trifunovic, D. Nuzman, A. Cohen, A. Zaks, and I. Rosen, Polyhedralmodel guided loop-nest auto-vectorization, Intl. Conf. on Parallel Architectures and Compilation Techniques (PACT'09), 2009.
URL : https://hal.archives-ouvertes.fr/hal-00645325

Y. Voronenko, F. De-mesmay, and M. Püschel, Computer Generation of General Size Linear Transform Libraries, 2009 International Symposium on Code Generation and Optimization, pp.102-113, 2009.
DOI : 10.1109/CGO.2009.33

R. Vuduc, J. W. Demmel, and J. A. Bilmes, Statistical Models for Empirical Search-Based Performance Tuning, International Journal of High Performance Computing Applications, vol.18, issue.1, pp.65-94, 2004.
DOI : 10.1177/1094342004041293

R. C. Whaley and J. J. Dongarra, Automatically Tuned Linear Algebra Software, Proceedings of the IEEE/ACM SC98 Conference, pp.1-27, 1998.
DOI : 10.1109/SC.1998.10004

R. C. Whaley, A. Petitet, and J. J. Dongarra, Automated empirical optimizations of software and the atlas project, Parallel Computing, 2000.

M. Wolfe, More iteration space tiling, Proceedings of the 1989 ACM/IEEE conference on Supercomputing , Supercomputing '89, pp.655-664, 1989.
DOI : 10.1145/76263.76337

K. Yotov, X. Li, G. Ren, M. Cibulskis, G. Dejong et al., A comparison of empirical and model-driven optimization, ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI'03), 2003.