. Cloog, This version may be installed using the following commands: $ git clone https://github.com/periscop/cloog.git $ cd cloog $ git checkout cloog-0.18.3 $ ./get_submodules.sh $ ./autogen.sh $ ./configure --with-osl=system --with-osl-prefix=/usr/local $ make $ sudo make install IBB Installation: Once downloaded and unpacked, one can compile IBB by typing the following command on the IBB's root directory: $ make

R. T. Mullapudi, V. Vasista, and U. Bondhugula, PolyMage: Automatic Optimization for Image Processing Pipelines, Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '15, pp.429-443, 2015.

J. Ragan-kelley, C. Barnes, A. Adams, S. Paris, F. Durand et al., Halide: A Language and Compiler for Optimizing Parallelism, Locality, and Recomputation in Image Processing Pipelines, Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '13, pp.519-530, 2013.

Y. Tang, R. A. Chowdhury, B. C. Kuszmaul, C. Luk, and C. E. Leiserson, The pochoir stencil compiler, Proceedings of the 23rd ACM symposium on Parallelism in algorithms and architectures, SPAA '11, pp.117-128, 2011.
DOI : 10.1145/1989493.1989508

C. Lengauer, S. Apel, M. Bolten, A. Grö?linger, F. Hannig et al., ExaStencils: Advanced Stencil-Code Engineering, " in Euro- Par 2014: Parallel Processing Workshops, of Lecture Notes in Computer Science, pp.553-564, 2014.

C. Bastoul, OpenScop: A Specification and a Library for Data Exchange in Polyhedral Compilation Tools, tech. rep, vol.2, issue.109, pp.98-170, 2011.

P. Feautrier and C. Lengauer, Polyhedron model, Encyclopedia of Parallel Computing, pp.1581-1592, 2011.

C. Bastoul, Extracting polyhedral representation from high level languages, tech. rep, pp.2-39, 2008.

C. Bastoul, Clay: the chunky loop alteration wizardry, tech. rep, pp.2-41

C. Bastoul, Code generation in the polyhedral model is easier than you think, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pp.7-16, 2004.
DOI : 10.1109/PACT.2004.1342537

URL : https://hal.archives-ouvertes.fr/hal-00017260

N. Vasilache, C. Bastoul, and A. Cohen, Polyhedral Code Generation in the Real World, Lecture Notes in Computer Science, vol.3923, pp.185-201, 2006.
DOI : 10.1007/11688839_16

URL : https://hal.archives-ouvertes.fr/inria-00001106

C. Bastoul, Generating loops for scanning polyhedra, tech. rep., PRiSM, vol.2, issue.5, pp.39-42, 2002.

C. Bastoul and L. Pouchet, Candl: the chunky analyzer for dependences in loops, tech. rep, vol.2, pp.42-98

O. Zinenko, S. Huot, and C. Bastoul, Clint: A direct manipulation tool for parallelizing compute-intensive program parts, 2014 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC), pp.109-112, 2014.
DOI : 10.1109/VLHCC.2014.6883031

URL : https://hal.archives-ouvertes.fr/hal-01055788

U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, A practical automatic polyhedral parallelizer and locality optimizer, ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), pp.101-113, 2008.

U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev et al., Automatic Transformations for Communicationminimized Parallelization and Locality Optimization in the Polyhedral Model, Proceedings of the Joint European Conferences on Theory and Practice of Software 17th International Conference on Compiler Construction, CC'08/ETAPS'08, pp.132-146, 2008.

U. Bondhugula, O. Gunluk, S. Dash, and L. Renganarayanan, A model for fusion and code motion in an automatic parallelizing compiler, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.343-352, 2010.
DOI : 10.1145/1854273.1854317

A. Acharya and U. Bondhugula, PLUTO+: Near-complete Modeling of Affine Transformations for Parallelism and Locality, Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp.54-64, 2015.

K. A. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit et al., Titanium: a high-performance Java dialect, Concurrency: Practice and Experience, pp.11-13, 1998.
DOI : 10.1002/(SICI)1096-9128(199809/11)10:11/13<825::AID-CPE383>3.0.CO;2-H

B. Chamberlain, D. Callahan, and H. Zima, Parallel Programmability and the Chapel Language, International Journal of High Performance Computing Applications, vol.21, issue.3, pp.291-312, 2007.
DOI : 10.1177/1094342007078442

I. Fassi, P. Clauss, M. Kuhn, and Y. Slama, Multifor for Multicore, IMPACT 2013, Third International Workshop on Polyhedral Compilation Techniques Proceedings of the 3rd International Workshop on Polyhedral Compilation Techniques, pp.37-44, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00780748

I. Fassi and P. Clauss, XFOR: Filling the Gap between Automatic Loop Optimization and Peak Performance, 2015 14th International Symposium on Parallel and Distributed Computing, pp.62-86, 2015.
DOI : 10.1109/ISPDC.2015.19

URL : https://hal.archives-ouvertes.fr/hal-01155144

I. Fassi and P. Clauss, IBB : The Multifor Compiler, pp.2013-2014, 2015.

L. Dagum and R. Menon, OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998.
DOI : 10.1109/99.660313

H. L. Van-der-spek, E. M. Bakker, and H. A. Wijshoff, SPARK00: A Benchmark Package for the Compiler Evaluation of Irregular/Sparse Codes, pp.20-171, 2008.

E. W. Dijkstra, A preliminary investigation into Computer Assisted Programming

L. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos, Iterative optimization in the polyhedral model: part ii, multidimensional time, Proc. of the ACM SIGPLAN 2008 Conf. on Programming Language Design and Implementation, pp.90-100, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01257273

E. Park, J. Cavazos, L. Pouchet, C. Bastoul, A. Cohen et al., Predictive Modeling in a Polyhedral Optimization Space, International Journal of Parallel Programming, vol.18, issue.1, pp.704-750, 2013.
DOI : 10.1007/s10766-013-0241-1

URL : https://hal.archives-ouvertes.fr/hal-00918653

Z. Wang, G. Tournavitis, B. Franke, M. F. O-'boyle, G. Fursin et al., Integrating profiledriven parallelism detection and machine-learning-based mapping, Milepost GCC: Machine Learning Enabled Self-tuning Compiler, pp.23-296, 2011.

M. Benabderrahmane, L. Pouchet, A. Cohen, and C. Bastoul, The Polyhedral Model Is More Widely Applicable Than You Think, Proceedings of the International Conference on Compiler Construction (ETAPS CC'10), pp.25-27, 2010.
DOI : 10.1007/978-3-642-11970-5_16

URL : https://hal.archives-ouvertes.fr/inria-00551087

R. M. Karp, R. E. Miller, and S. Winograd, The Organization of Computations for Uniform Recurrence Equations, Journal of the ACM, vol.14, issue.3, pp.563-590, 1967.
DOI : 10.1145/321406.321418

A. W. Appel, Modern Compiler Implementation in C: Basic Techniques, p.25, 1997.
DOI : 10.1017/CBO9781139174930

S. P. Midkiff, Automatic Parallelization: An Overview of Fundamental Compiler Techniques, Synthesis Lectures on Computer Architecture, vol.7, issue.1, p.25, 2012.
DOI : 10.2200/S00340ED1V01Y201201CAC019

C. Bastoul, Improving data locality in static control programs, pp.25-35, 2004.

C. Bastoul, ontributions to High-Level Program Optimization, p.25, 2012.

P. Feautrier, Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time, International Journal of Parallel Programming, vol.2, issue.4, pp.389-420, 1992.
DOI : 10.1007/BF01379404

L. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache, Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time, International Symposium on Code Generation and Optimization (CGO'07), pp.144-156, 2007.
DOI : 10.1109/CGO.2007.21

URL : https://hal.archives-ouvertes.fr/hal-01257281

C. Bastoul, A. Cohen, S. Girbal, S. Sharma, and O. Temam, Putting Polyhedral Loop Transformations to Work, LCPC'16 International Workshop on Languages and Compilers for Parallel Computers, pp.209-225, 2003.
DOI : 10.1007/978-3-540-24644-2_14

URL : https://hal.archives-ouvertes.fr/inria-00071681

S. Verdoolaege and T. Grosser, Polyhedral extraction tool, Second International Workshop on Polyhedral Compilation Techniques, pp.2012-2039

T. Grosser, A. Grö?linger, and C. Lengauer, POLLY ??? PERFORMING POLYHEDRAL OPTIMIZATIONS ON A LOW-LEVEL INTERMEDIATE REPRESENTATION, Parallel Processing Letters, vol.22, issue.04, p.27, 2012.
DOI : 10.1142/S0129626412500107

L. Rauchwerger and D. Padua, The LRPD Test: Speculative Run-time Parallelization of Loops with Privatization and Reduction Parallelization, Proceedings of the ACM SIGPLAN 1995 Conference on Programming Language Design and Implementation, PLDI '95, pp.218-232, 1995.

D. A. Padua and M. J. Wolfe, Advanced compiler optimizations for supercomputers, Communications of the ACM, vol.29, issue.12, pp.1184-1201, 1986.
DOI : 10.1145/7902.7904

P. Feautrier, Array Expansion, ACM Int. Conf. on Supercomputing, pp.429-441, 1988.
URL : https://hal.archives-ouvertes.fr/hal-01099746

P. Feautrier, Dataflow analysis of array and scalar references, International Journal of Parallel Programming, vol.24, issue.4, p.33, 1991.
DOI : 10.1007/BF01407931

U. K. Banerjee, Loop Transformations for Restructuring Compilers: The Foundations, p.34, 1993.
DOI : 10.1007/b102311

L. Pouchet, U. Bondhugula, C. Bastoul, A. Cohen, J. Ramanujam et al., Loop Transformations: Convexity, Pruning and Optimization, Proceedings of the 38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pp.549-562, 2011.
DOI : 10.1145/1925844.1926449

URL : https://hal.archives-ouvertes.fr/inria-00551077

C. Bastoul, Efficient code generation for automatic parallelization and optimization, Second International Symposium on Parallel and Distributed Computing, 2003. Proceedings., pp.23-30, 2003.
DOI : 10.1109/ISPDC.2003.1267639

L. Pouchet, C. Bastoul, and A. Cohen, LetSee: the LEgal Transformation SpacE Explorator, Third International Summer School on Advanced Computer Architecture and Compilation for Embedded Systems (ACACES'07), L'Aquila, Italia, pp.247-251, 2007.

C. Bastoul and P. Feautrier, Improving Data Locality by Chunking, Lecture Notes in Computer Science, vol.2622, pp.320-334, 2003.
DOI : 10.1007/3-540-36579-6_23

URL : https://hal.archives-ouvertes.fr/inria-00001055

M. D. Wael, S. Marr, B. De-fraine, T. Van-cutsem, and W. Meuter, Partitioned Global Address Space Languages, ACM Computing Surveys, vol.47, issue.4, pp.1-6227, 2015.
DOI : 10.1145/2716320

URL : https://hal.archives-ouvertes.fr/hal-01109405

M. P. Forum, MPI: A Message-Passing Interface Standard, tech. rep, p.47, 1994.

P. Charles, C. Grothoff, V. Saraswat, C. Donawa, A. Kielstra et al., X10, ACM SIGPLAN Notices, vol.40, issue.10, pp.519-538, 2005.
DOI : 10.1145/1103845.1094852

URL : https://hal.archives-ouvertes.fr/in2p3-00166974

P. Feautrier, Ã. Violard, A. Ketterlin, V. A. Saraswat, O. Tardieu et al., Improving the Performance of X10 Programs by Clock Removal, Lecture Notes in Computer Science The IBM Corporation, vol.8409, pp.113-132, 2012.
DOI : 10.1007/978-3-642-54807-9_7

B. L. Chamberlain, S. Choi, S. J. Deitz, and A. Navarro, User-Defined Parallel Zippered Iterators in Chapel, PGAS 2011: Fifth Conf. on Partitioned Global Address Space Programming Models, p.49, 2011.

T. Henretty, R. Veras, F. Franchetti, L. Pouchet, J. Ramanujam et al., A stencil compiler for short-vector SIMD architectures, Proceedings of the 27th international ACM conference on International conference on supercomputing, ICS '13, pp.13-24, 2013.
DOI : 10.1145/2464996.2467268

K. Stock, M. Kong, T. Grosser, L. Pouchet, F. Rastello et al., A framework for enhancing data reuse via associative reordering, ACM SIGPLAN Conf. on Programming Language Design and Implementation, PLDI '14, pp.65-76, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01016093

I. Christadler, G. Erbacci, and A. D. Simpson, Facing the Multicore- Challenge II, " ch. Performance and productivity of new programming languages, pp.24-35, 2012.

P. Clauss, I. Fassi, and A. Jimborean, Software-controlled processor stalls for time and energy efficient data locality optimization, 2014 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XIV), pp.199-206, 2014.
DOI : 10.1109/SAMOS.2014.6893212

URL : https://hal.archives-ouvertes.fr/hal-01003228

P. Clauss, Mind The Gap! A study of some pitfalls preventing peak performance in polyhedral compilation using a polyhedral antidote, IMPACT 2015, 5th International Workshop on Polyhedral Compilation Techniques, p.86, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01099583

. Free-software-foundation, GNU General Public LicenseOnline; accessed 14, pp.99-152, 2015.

A. Sukumaran-rajam, J. M. Martinez, W. Wolff, A. Jimborean, and P. Clauss, Speculative Program Parallelization with Scalable and Decentralized Runtime Verification, pp.124-139, 2014.
DOI : 10.1007/978-3-319-11164-3_11

URL : https://hal.archives-ouvertes.fr/hal-01070610

. Free-software-foundation, The Flex Homepage, p.174, 2015.

. Free-software-foundation, The Bison Homepage, p.174, 2015.

. Free-software-foundation, The GNU Multiple Precision Arithmetic Library, p.175, 2015.

L. Loop, N. , and I. , 136 Composition, p.64

R. Domain and .. , 76 S Scattering Function, p.30