Recursive functions symbolic expressions and their computation by machine, Part I, Communications of the ACM, vol.3, issue.4, pp.184-195, 1960. ,
DOI : 10.1145/367177.367199
OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998. ,
DOI : 10.1109/99.660313
Speculative Program Parallelization with Scalable and Decentralized Runtime Verification, LNCS, vol.8734, pp.124-139, 2014. ,
DOI : 10.1007/978-3-319-11164-3_11
URL : https://hal.archives-ouvertes.fr/hal-01070610
The Parboil technical report, tech. rep., IMPACT Technical Report, pp.12-13, 2012. ,
Parsec 2.0: A new benchmark suite for chipmultiprocessors, Proceedings of the 5th Annual Workshop on Modeling, Benchmarking and Simulation, 2009. ,
Rodinia: A benchmark suite for heterogeneous computing, 2009 IEEE International Symposium on Workload Characterization (IISWC), pp.44-54, 2009. ,
DOI : 10.1109/IISWC.2009.5306797
SPARK00: A benchmark package for the compiler evaluation of irregular/sparse codes, p.805, 2008. ,
Polybench: The polyhedral benchmark suite, 2010. ,
Speculative Runtime Parallelization of Loop Nests: Towards Greater Scope and Efficiency, 20th International Workshop on High-level Parallel Programming Models and Supportive Environments, held in conjunction with 29th IEEE International Parallel & Distributed Processing Symposium, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01155172
Applied Multiple Regression/- Correlation Analysis for the Behavioral Sciences, Routledge, 2002. ,
Rodinia: A benchmark suite for heterogeneous computing, 2009 IEEE International Symposium on Workload Characterization (IISWC), pp.44-54, 2009. ,
DOI : 10.1109/IISWC.2009.5306797
Collective optimization, ACM Transactions on Architecture and Code Optimization, vol.7, issue.4, pp.1-2029, 2010. ,
DOI : 10.1145/1880043.1880047
URL : https://hal.archives-ouvertes.fr/inria-00445326
PERFECT (Power Efficiency Revolution For Embedded Computing Technologies) Benchmark Suite Manual, Pacific Northwest National Laboratory and Georgia Tech Research Institute, 2013. ,
A practical automatic polyhedral parallelizer and locality optimizer, PLDI '08, 2008. ,
Polyhedron model, Encyclopedia of Parallel Computing, pp.1581-1592, 2011. ,
The Organization of Computations for Uniform Recurrence Equations, Journal of the ACM, vol.14, issue.3, pp.563-590, 1967. ,
DOI : 10.1145/321406.321418
The systematic design of systolic arrays, Centre National De Recherche Scientifique on Automata Networks in Computer Science: Theory and Applications, pp.229-260, 1987. ,
URL : https://hal.archives-ouvertes.fr/inria-00076342
Dependence Analysis for Supercomputing, 1988. ,
DOI : 10.1007/978-1-4684-6894-6
The I test: an improved dependence test for automatic parallelization and vectorization, IEEE Transactions on Parallel and Distributed Systems, vol.2, issue.3, pp.342-349, 1991. ,
DOI : 10.1109/71.86109
A practical algorithm for exact array dependence analysis, Communications of the ACM, vol.35, issue.8, pp.102-114, 1992. ,
DOI : 10.1145/135226.135233
The power test for data dependence, IEEE Transactions on Parallel and Distributed Systems, vol.3, issue.5, pp.591-601, 1992. ,
DOI : 10.1109/71.159042
Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time, International Journal of Parallel Programming, vol.2, issue.4, 1992. ,
DOI : 10.1007/BF01379404
Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time, International Journal of Parallel Programming, vol.2, issue.4, pp.389-420, 1992. ,
DOI : 10.1007/BF01379404
Dataflow analysis of array and scalar references, International Journal of Parallel Programming, vol.24, issue.4, pp.23-53, 1991. ,
DOI : 10.1007/BF01407931
Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies, International Journal of Parallel Programming, vol.20, issue.1, pp.261-317, 2006. ,
DOI : 10.1007/s10766-006-0012-3
URL : https://hal.archives-ouvertes.fr/hal-01257288
Advanced compiler optimizations for supercomputers, Communications of the ACM, vol.29, issue.12, pp.1184-1201, 1986. ,
DOI : 10.1145/7902.7904
Array expansion, Proceedings of the 2nd International Conference on Supercomputing, ICS '88, pp.429-441, 1988. ,
URL : https://hal.archives-ouvertes.fr/hal-01099746
Dataflow analysis of array and scalar references, International Journal of Parallel Programming, vol.24, issue.4, 1991. ,
DOI : 10.1007/BF01407931
A Polyhedral Approach to Ease the Composition of Program Transformations, Euro-Par 2004 Parallel Processing, pp.292-303, 2004. ,
DOI : 10.1007/978-3-540-27866-5_38
URL : https://hal.archives-ouvertes.fr/hal-01257301
Putting Polyhedral Loop Transformations to Work, LCPC'16 Intl. Workshop on Languages and Compilers for Parallel ComputersCollege Station), pp.209-225, 2003. ,
DOI : 10.1007/978-3-540-24644-2_14
URL : https://hal.archives-ouvertes.fr/inria-00071681
An automatic parallelizer and locality optimizer for multicores ,
Toward automatic partitioning of arrays on distributed memory computers, Proceedings of the 7th international conference on Supercomputing , ICS '93, pp.175-184, 1993. ,
DOI : 10.1145/165939.165968
Parametric integer programming, RAIRO - Operations Research, vol.22, issue.3, pp.243-268, 1988. ,
DOI : 10.1051/ro/1988220302431
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.30.9957
Parametric analysis of polyhedral iteration spaces, Proceedings of International Conference on Application Specific Systems, Architectures and Processors: ASAP '96, pp.179-194, 1998. ,
DOI : 10.1109/ASAP.1996.542833
URL : https://hal.archives-ouvertes.fr/inria-00534840
Computing the volume, counting integral points, and exponential sums, Discrete & Computational Geometry, vol.21, issue.4, pp.123-141, 1993. ,
DOI : 10.1007/BF02573970
isl: An Integer Set Library for the Polyhedral Model, Mathematical Software -ICMS 2010, pp.299-302, 2010. ,
DOI : 10.1007/978-3-642-15582-6_49
Extracting polyhedral representation from high level languages, tech. rep, 2008. ,
Openscop: A specification and a library for data exchange in polyhedral compilation tools, tech. rep, 2011. ,
Contributions to High-Level Program Optimization, 2012. ,
XFOR: Filling the Gap between Automatic Loop Optimization and Peak Performance, 2015 14th International Symposium on Parallel and Distributed Computing, 2015. ,
DOI : 10.1109/ISPDC.2015.19
URL : https://hal.archives-ouvertes.fr/hal-01155144
Software-controlled processor stalls for time and energy efficient data locality optimization, 2014 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XIV), pp.199-206, 2014. ,
DOI : 10.1109/SAMOS.2014.6893212
URL : https://hal.archives-ouvertes.fr/hal-01003228
PolyMage, ACM SIGARCH Computer Architecture News, vol.43, issue.1, pp.429-443, 2015. ,
DOI : 10.1145/2786763.2694364
Generating loops for scanning polyhedra, Tech. Rep, vol.23, 2002. ,
POLLY ??? PERFORMING POLYHEDRAL OPTIMIZATIONS ON A LOW-LEVEL INTERMEDIATE REPRESENTATION, Parallel Processing Letters, 2012. ,
DOI : 10.1142/S0129626412500107
GRAPHITE: Polyhedral analyses and optimizations for GCC, Contribution to the GNU Compilers Collection Developers Summit, 2006. ,
The loop parallelizer loopo, Proc. Sixth Workshop on Compilers for Parallel Computers, pp.311-320, 1996. ,
LetSee: the LEgal Transformation SpacE Explorator, Third International Summer School on Advanced Computer Architecture and Compilation for Embedded Systems (ACACES'07), L'Aquila, Italia, pp.247-251, 2007. ,
Chill: A framework for composing high-level loop transformations, 2008. ,
Polyhedral parallel code generation for CUDA, ACM Transactions on Architecture and Code Optimization, vol.9, issue.4, pp.1-5423, 2013. ,
DOI : 10.1145/2400682.2400713
URL : https://hal.archives-ouvertes.fr/hal-00786677
R-stream 3.0: Technologies for high level embedded application mapping, Proceedings of the 8th Annual High Performance Embedded Computing (HPEC) Workshops, 2004. ,
A parametric high level compiler, Proceedings of the High Performance Embedded Computing Workshop (HPEC), 2006. ,
Adaptive Runtime Selection for GPU, 2013 42nd International Conference on Parallel Processing, pp.70-79, 2013. ,
DOI : 10.1109/ICPP.2013.16
URL : https://hal.archives-ouvertes.fr/hal-00869652
CPU+GPU Load Balance Guided by Execution Time Prediction, Fifth International Workshop on Polyhedral Compilation Techniques, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01095890
Speculative synchronization: Applying threadlevel speculation to explicitly parallel applications, Proceedings of the Tenth Symposium on Architectural Support for Programming Languages and Operating Systems, 2002. ,
iWatcher, 31st Int. Symp. on Computer Architecture (ISCA), pp.224-237, 2004. ,
DOI : 10.1145/1028176.1006720
ReEnact, ACM SIGARCH Computer Architecture News, vol.31, issue.2, pp.110-121, 2003. ,
DOI : 10.1145/871656.859632
Intel® 64 and ia-32 architectures software developer's manual https:// www-ssl.intel.com/content/dam-ia-32-architectures-software-developer-instruction-set\ -reference-manual-325383, 2015. ,
The IBM Blue Gene/Q Compute Chip, IEEE Micro, vol.32, issue.2, pp.48-60, 2012. ,
DOI : 10.1109/MM.2011.108
Rock: A High-Performance Sparc CMT Processor, IEEE Micro, vol.29, issue.2, pp.6-16, 2009. ,
DOI : 10.1109/MM.2009.34
A scalable approach to thread-level speculation, ACM SIGARCH Computer Architecture News, vol.28, issue.2, pp.1-12, 2000. ,
DOI : 10.1145/342001.339650
Thread-Level Speculation on a CMP can be energy efficient, Proceedings of the 19th annual international conference on Supercomputing , ICS '05, pp.219-228, 2005. ,
DOI : 10.1145/1088149.1088178
The STAMPede approach to thread-level speculation, ACM Transactions on Computer Systems, vol.23, issue.3, pp.253-300, 2005. ,
DOI : 10.1145/1082469.1082471
A scalable method for run-time loop parallelization, International Journal of Parallel Programming, vol.4, issue.1, pp.26-32, 1995. ,
DOI : 10.1007/BF02577866
Unifying Thread-Level Speculation and Transactional Memory, Proceedings of the 13th International Middleware Conference, pp.187-207, 2012. ,
DOI : 10.1109/SRDS.2011.16
The LRPD Test: Speculative Run-time Parallelization of Loops with Privatization and Reduction Parallelization, Proceedings of the ACM SIGPLAN 1995 Conference on Programming Language Design and Implementation, PLDI '95, pp.218-232, 1995. ,
The R-LRPD test: speculative parallelization of partially parallel loops, Proceedings 16th International Parallel and Distributed Processing Symposium, 2002. ,
DOI : 10.1109/IPDPS.2002.1015493
Adapting the polytope model for dynamic and speculative parallelization, 2012. ,
URL : https://hal.archives-ouvertes.fr/tel-00733850
VMAD: An Advanced Dynamic Program Analysis and Instrumentation Framework, Proceedings of the 21st International Conference on Compiler Construction, pp.220-239, 2012. ,
DOI : 10.1007/978-3-642-28652-0_12
Support for Thread-Level Speculation into OpenMP, OpenMP in a Heterogeneous World, pp.275-278, 2012. ,
DOI : 10.1007/978-3-642-30961-8_25
Online Dynamic Dependence Analysis for Speculative Polyhedral Parallelization, Euro-Par 2013, pp.191-202, 2013. ,
DOI : 10.1007/978-3-642-40047-6_21
URL : https://hal.archives-ouvertes.fr/hal-00825744
POSH, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '06, 2006. ,
DOI : 10.1145/1122971.1122997
Speculative thread decomposition through empirical optimization, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '07, 2007. ,
DOI : 10.1145/1229428.1229474
Spice, Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization , CGO '08, 2008. ,
DOI : 10.1145/1356058.1356082
Code generation for parallel execution of a class of irregular loops on distributed memory systems, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis, p.12, 2012. ,
DOI : 10.1109/SC.2012.30
DieHard: probabilistic memory safety for unsafe languages, PLDI '06, pp.158-168, 2006. ,
DieHarder, Proceedings of the 17th ACM conference on Computer and communications security, CCS '10, pp.573-584, 2010. ,
DOI : 10.1145/1866307.1866371
Generation of synchronous code for automatic parallelization of while loops, Euro-Par '95 Parallel Processing, First International Euro-Par Conference Proceedings, pp.315-326, 1995. ,
DOI : 10.1007/BFb0020474
Automatic parallelization ofwhile-loops using speculative execution, International Journal of Parallel Programming, vol.634, issue.1, pp.191-219, 1995. ,
DOI : 10.1007/BF02577789
Parallelization of while loops in nested loop programs for shared-memory multiprocessor systems, 2011 Design, Automation & Test in Europe, pp.1-6, 2011. ,
DOI : 10.1109/DATE.2011.5763118
On the parallelization of loop nests containing while loops, Proceedings the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis, 1994. ,
DOI : 10.1109/AISPAS.1995.401360
Fuzzy Array Dataflow Analysis, Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP '95, pp.92-101, 1995. ,
Non-affine Extensions to Polyhedral Code Generation, Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO '14, pp.185185-185194, 2014. ,
DOI : 10.1145/2581122.2544141