A. Ali-ap, A Dynamic Feedback-Based Fittest Optimization Chooser, HPCS 2018 -16th International Conference on High Performance Computing & Simulation -Special Session on Compiler Architecture, Design and Optimization, 2018.

A. Arif, E. Ap, and . Rohou, Dynamic Function Specialization ». In: International Conference on Embedded Computer Systems: Architectures, MOdeling and Simulation. Pythagorion, 2017.

H. Amir and . Ashouri, « A survey on compiler autotuning using machine learning, 2018.

V. Aslot, A New Benchmark Suite for Measuring Parallel Computer Performance, OpenMP Shared Memory Parallel Programming: International Workshop on OpenMP Applications and Tools, 2001.

, , pp.1-10, 2001.

,

A. V. Aho, R. Sethi, and J. D. Ullman, Compilers: Principles, Techniques, and Tools. Addison-Wesley series in computer science / World student series edition, pp.0-201, 1986.

D. F. Bacon, S. L. Graham, and O. J. Sharp, « Compiler Transformations for High-performance Computing, ACM Comput. Surv, vol.26, pp.345-420, 1994.

D. Bruening, « Efficient, Transparent, and Comprehensive Runtime Code Manipulation, 2004.

D. Bruening, Q. Zhao, and S. Amarasinghe, « Transparent Dynamic Instrumentation, SIGPLAN Not. 47.7, pp.133-144, 2012.

M. P. João, J. Cardoso, F. Gabriel, P. C. Coutinho, M. P. Diniz-;-joão et al., Chapter 5 -Source code transformations and optimizations, pp.137-183, 2017.

B. Childers, J. W. Davidson, and M. L. Soffa, « Continuous compilation: a new approach to aggressive and adaptive code transformation, Proceedings International Parallel and Distributed Processing Symposium, vol.10, 2003.

D. Chen, Taming Hardware Event Samples for FDO Compilation, Proceedings of the 8th Annual IEEE/ACM International Symposium on Code Generation and Optimization. CGO '10, pp.42-52, 2010.

D. William and . Clinger, How to Read Floating Point Numbers Accurately, Proceedings of the ACM SIGPLAN 1990 Conference on Programming Language Design and Implementation. PLDI '90, pp.92-101, 1990.

C. Consel, J. L. Lawall, and A. Meur, « A tour of Tempo: a program specializer for the C language, Science of Computer Programming, vol.52, pp.167-6423, 2004.

J. J. Dongarra and A. R. Hinds, Unrolling loops in fortran, Software: Practice and Experience, vol.9, pp.219-226, 1979.

A. Dinaburg and A. Ruef, McSema: Static translation of x86 instructions to LLVM, pp.2016-2027

K. Ebcioglu, E. Altman, and E. Hokenek, « A JAVA ILP machine based on fast dynamic compilation, IEEE MASCOTS INTER-NATIONAL WORKSHOP ON SECURITY AND E CIENCY ASPECTS OF JAVA, 1997.

P. Ekemark, Static Multi-Versioning for Efficient Prefetching, 2016.

J. A. Fisher, « Trace Scheduling: A Technique for Global Microcode Compaction, IEEE Trans. Comput, vol.30, issue.7, 1981.

G. Goumas, A. Sotiropoulos, and N. Koziris, « Minimizing completion time for loop tiling with computation and communication overlapping, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS, vol.10, 2001.

N. Hallou, Dynamic Re-Vectorization of Binary Code, International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation -SAMOS XV, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01155207

B. Hawkins, Optimizing Binary Translation of Dynamically Generated Code, Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization. CGO '15, pp.68-78, 2015.

W. W. Hwu and P. P. Chang, « Achieving High Instruction Cache Performance with an Optimizing Compiler, SIGARCH Comput. Archit. News, vol.17, pp.163-5964, 1989.

B. Hawkins, B. Demsky, M. B. Taylor, and . Blackbox, Lightweight Security Monitoring for COTS Binaries, Proceedings of the 2016 International Symposium on Code Generation and Optimization. CGO '16, pp.261-272, 2016.

J. L. Henning, SPEC CPU2006 Benchmark Descriptions, vol.34, pp.163-5964, 2006.

D. Hong, A Multi-threaded and Retargetable Dynamic Binary Translator on Multicores, Proceedings of the Tenth International Symposium on Code Generation and Optimization. CGO '12, pp.104-113, 2012.

N. Hallou, E. Rohou, and P. Clauss, « Runtime Vectorization Transformations of Binary Code, International Journal of Parallel Programming, 2016.

E. Hodzic and W. Shang, On Time Optimal Supernode Shape, vol.12, pp.1045-9219, 2002.

T. Kisuki, Iterative Compilation in Program Optimization, 2000.

T. Kisuki, P. M. Knijnenburg, and M. F. O'boyle, « Combined selection of tile sizes and unroll factors using iterative compilation, Proceedings 2000 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00622), pp.237-246, 2000.

P. M. Knijnenburg, T. Kisuki, and M. F. O'boyle, « Iterative Compilation, Embedded Processor Design Challenges: Systems, Architectures, Modeling, and Simulation -SAMOS. Ed. by Ed F. Deprettere, Jürgen Teich, and Stamatis Vassiliadis, pp.171-187, 2002.

K. Kennedy and K. S. Mckinley, « Maximizing loop parallelism and improving data locality via loop fusion and distribution, Languages and Compilers for Parallel Computing, pp.978-981, 1994.

C. Lattner and V. Adve, « LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation, International Symposium on Code Generation and Optimization (CGO'04), 2004.

J. Lehr, Jahrestagung der Gesellschaft für Informatik, 26.-30, Counting performance: hardware performance counter and compiler instrumentation, vol.46, pp.2187-2198, 2016.

L. Writing-an and . Pass,

C. Luk, Pin: building customized program analysis tools with dynamic instrumentation, 2005.

R. S. Machado, International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW), pp.25-30, 2017.

P. Michaud, A study of thread migration in temperature-constrained multicores », In: j-TACO, vol.4, pp.1544-3973, 2007.

S. S. Muchnick, Advanced Compiler Design and Implementation, 1997.

N. Nethercote, Dynamic binary analysis and instrumentation, 2004.

N. Nethercote and J. Seward, « Valgrind: a framework for heavyweight dynamic binary instrumentation, PLDI, pp.89-100, 2007.

D. Nuzman, JIT Technology with C/C++: Feedback-directed Dynamic Recompilation for Statically Compiled Languages

A. C. Optim, , vol.10, pp.1-59, 2013.

P. Péneau, PATMOS: Power and Timing Modeling, Optimization and Simulation. Proceedings of the 26th International Workshop on Power and Timing Modeling, Optimization and Simulation, pp.162-169, 2016.

K. Pettis and R. C. Hansen, « Profile Guided Code Positioning, SIGPLAN Not, vol.25, pp.16-27, 1990.

. Polybench/c-v4, 1: the Polyhedral Benchmark suite

M. Paleczny, C. Vick, and C. Click, « The Java HotSpot? Server Compiler, Proc. of the Java Virtual Machine Research and Technology Symposium, 2001.

E. Riou, PADRONE: a Platform for Online Profiling, Analysis, and Optimization, DCE 2014 -International workshop on Dynamic Compilation Everywhere, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00917950

M. Stephenson and S. Amarasinghe, « Predicting Unroll Factors Using Supervised Classification, Proceedings of the International Symposium on Code Generation and Optimization. CGO '05, pp.123-134, 2005.

H. Santos, Just-in-time Value Specialization, International Symposium on Code Generation and Optimization (CGO)

. Cgo-'13 and . Usa, , pp.1-11, 2013.

A. Sukumaran-rajam and P. Clauss, « The Polyhedral Model of Nonlinear Loops, ACM Trans. Archit. Code Optim, vol.12, 2015.

F. Schneider, R. Thomas, and . Gross, Using Platform-Specific Performance Counters for Dynamic Compilation, Languages and Compilers for Parallel Computing: 18th International Workshop, LCPC 2005, pp.334-346, 2005.

F. Schneider and T. R. Gross, « Using Platform-Specific Performance Counters for Dynamic Compilation, Languages and Compilers for Parallel Computing, pp.334-346, 2006.

M. D. Smith, « Overcoming the Challenges to Feedback-directed Optimization (Keynote Talk) ». In: SIGPLAN Not, vol.35, 2000.

, SPEC.org. SPEC CPU2017

A. Suresh, E. Rohou, and A. Seznec, « Compile-time Function Memoization, Proceedings of the 26th International Conference on Compiler Construction, pp.45-54, 2017.

,

A. Sukumaran-rajam, « Speculative Program Parallelization with Scalable and Decentralized Runtime Verification, pp.978-981, 2014.

A. Suresh, Intercepting Functions for Memoization: A Case Study Using Transcendental Functions, ACM Transactions on Architecture and Code Optimization (TACO), vol.12, issue.2, p.23, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01178085

L. Guy, J. L. Steele, and . White, « How to Print Floating-Point Numbers Accurately, 1990.

S. Sinnadurai, Q. Zhao, and W. Wong, Transparent Runtime Shadow Stack: Protection against malicious return address modifications, 2006.

N. Watkinson, Using Hardware Counters to Predict Vectorization, 2017.

J. Whaley, Dynamic Optimization through the use of Automatic Runtime Specialization, 1999.

B. Wicht, Hardware Counted Profile-Guided Optimization, 2014.

M. Zhou, Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications. OOPSLA '14, pp.763-776, 2014.

. .. , 10 2 Séquence d'appel: Normal vs Spécialisation

. .. Kko02], Iterative Compilation Flow Chart given in, p.29

. .. Pass, 40 2.2 The shared Monitoring table is accessed by both the application and FITTCHOOSER

F. .. Overhead-of,

. Overall and . .. Fittchooser, , p.49

, Performance of critical function variations

, Call sequence: Normal vs Specialization

. .. , 2 OFSPER attached to the application process, p.64

, The shared monitoring table is accessed by both the application and OFSPER

, Impact of specialization on execution time of a function, p.69

. .. , , p.71

.. .. Speedups,

, Example of the shared Monitoring table

. .. Repeatability-of-arguments,

.. .. Monitoring-table, , p.61

, Monitoring table for the pure function exp2