A Dynamic Feedback-Based Fittest Optimization Chooser, HPCS 2018 -16th International Conference on High Performance Computing & Simulation -Special Session on Compiler Architecture, Design and Optimization, 2018. ,
, Dynamic Function Specialization ». In: International Conference on Embedded Computer Systems: Architectures, MOdeling and Simulation. Pythagorion, 2017.
« A survey on compiler autotuning using machine learning, 2018. ,
A New Benchmark Suite for Measuring Parallel Computer Performance, OpenMP Shared Memory Parallel Programming: International Workshop on OpenMP Applications and Tools, 2001. ,
, , pp.1-10, 2001.
,
Compilers: Principles, Techniques, and Tools. Addison-Wesley series in computer science / World student series edition, pp.0-201, 1986. ,
« Compiler Transformations for High-performance Computing, ACM Comput. Surv, vol.26, pp.345-420, 1994. ,
« Efficient, Transparent, and Comprehensive Runtime Code Manipulation, 2004. ,
« Transparent Dynamic Instrumentation, SIGPLAN Not. 47.7, pp.133-144, 2012. ,
, Chapter 5 -Source code transformations and optimizations, pp.137-183, 2017.
« Continuous compilation: a new approach to aggressive and adaptive code transformation, Proceedings International Parallel and Distributed Processing Symposium, vol.10, 2003. ,
Taming Hardware Event Samples for FDO Compilation, Proceedings of the 8th Annual IEEE/ACM International Symposium on Code Generation and Optimization. CGO '10, pp.42-52, 2010. ,
How to Read Floating Point Numbers Accurately, Proceedings of the ACM SIGPLAN 1990 Conference on Programming Language Design and Implementation. PLDI '90, pp.92-101, 1990. ,
« A tour of Tempo: a program specializer for the C language, Science of Computer Programming, vol.52, pp.167-6423, 2004. ,
Unrolling loops in fortran, Software: Practice and Experience, vol.9, pp.219-226, 1979. ,
McSema: Static translation of x86 instructions to LLVM, pp.2016-2027 ,
« A JAVA ILP machine based on fast dynamic compilation, IEEE MASCOTS INTER-NATIONAL WORKSHOP ON SECURITY AND E CIENCY ASPECTS OF JAVA, 1997. ,
Static Multi-Versioning for Efficient Prefetching, 2016. ,
« Trace Scheduling: A Technique for Global Microcode Compaction, IEEE Trans. Comput, vol.30, issue.7, 1981. ,
« Minimizing completion time for loop tiling with computation and communication overlapping, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS, vol.10, 2001. ,
Dynamic Re-Vectorization of Binary Code, International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation -SAMOS XV, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01155207
Optimizing Binary Translation of Dynamically Generated Code, Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization. CGO '15, pp.68-78, 2015. ,
« Achieving High Instruction Cache Performance with an Optimizing Compiler, SIGARCH Comput. Archit. News, vol.17, pp.163-5964, 1989. ,
Lightweight Security Monitoring for COTS Binaries, Proceedings of the 2016 International Symposium on Code Generation and Optimization. CGO '16, pp.261-272, 2016. ,
, SPEC CPU2006 Benchmark Descriptions, vol.34, pp.163-5964, 2006.
A Multi-threaded and Retargetable Dynamic Binary Translator on Multicores, Proceedings of the Tenth International Symposium on Code Generation and Optimization. CGO '12, pp.104-113, 2012. ,
« Runtime Vectorization Transformations of Binary Code, International Journal of Parallel Programming, 2016. ,
, On Time Optimal Supernode Shape, vol.12, pp.1045-9219, 2002.
Iterative Compilation in Program Optimization, 2000. ,
« Combined selection of tile sizes and unroll factors using iterative compilation, Proceedings 2000 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00622), pp.237-246, 2000. ,
« Iterative Compilation, Embedded Processor Design Challenges: Systems, Architectures, Modeling, and Simulation -SAMOS. Ed. by Ed F. Deprettere, Jürgen Teich, and Stamatis Vassiliadis, pp.171-187, 2002. ,
« Maximizing loop parallelism and improving data locality via loop fusion and distribution, Languages and Compilers for Parallel Computing, pp.978-981, 1994. ,
« LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation, International Symposium on Code Generation and Optimization (CGO'04), 2004. ,
Jahrestagung der Gesellschaft für Informatik, 26.-30, Counting performance: hardware performance counter and compiler instrumentation, vol.46, pp.2187-2198, 2016. ,
,
Pin: building customized program analysis tools with dynamic instrumentation, 2005. ,
International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW), pp.25-30, 2017. ,
A study of thread migration in temperature-constrained multicores », In: j-TACO, vol.4, pp.1544-3973, 2007. ,
Advanced Compiler Design and Implementation, 1997. ,
Dynamic binary analysis and instrumentation, 2004. ,
« Valgrind: a framework for heavyweight dynamic binary instrumentation, PLDI, pp.89-100, 2007. ,
, JIT Technology with C/C++: Feedback-directed Dynamic Recompilation for Statically Compiled Languages
, , vol.10, pp.1-59, 2013.
, PATMOS: Power and Timing Modeling, Optimization and Simulation. Proceedings of the 26th International Workshop on Power and Timing Modeling, Optimization and Simulation, pp.162-169, 2016.
« Profile Guided Code Positioning, SIGPLAN Not, vol.25, pp.16-27, 1990. ,
1: the Polyhedral Benchmark suite ,
« The Java HotSpot? Server Compiler, Proc. of the Java Virtual Machine Research and Technology Symposium, 2001. ,
PADRONE: a Platform for Online Profiling, Analysis, and Optimization, DCE 2014 -International workshop on Dynamic Compilation Everywhere, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00917950
« Predicting Unroll Factors Using Supervised Classification, Proceedings of the International Symposium on Code Generation and Optimization. CGO '05, pp.123-134, 2005. ,
Just-in-time Value Specialization, International Symposium on Code Generation and Optimization (CGO) ,
, , pp.1-11, 2013.
« The Polyhedral Model of Nonlinear Loops, ACM Trans. Archit. Code Optim, vol.12, 2015. ,
Using Platform-Specific Performance Counters for Dynamic Compilation, Languages and Compilers for Parallel Computing: 18th International Workshop, LCPC 2005, pp.334-346, 2005. ,
« Using Platform-Specific Performance Counters for Dynamic Compilation, Languages and Compilers for Parallel Computing, pp.334-346, 2006. ,
« Overcoming the Challenges to Feedback-directed Optimization (Keynote Talk) ». In: SIGPLAN Not, vol.35, 2000. ,
, SPEC.org. SPEC CPU2017
« Compile-time Function Memoization, Proceedings of the 26th International Conference on Compiler Construction, pp.45-54, 2017. ,
,
« Speculative Program Parallelization with Scalable and Decentralized Runtime Verification, pp.978-981, 2014. ,
Intercepting Functions for Memoization: A Case Study Using Transcendental Functions, ACM Transactions on Architecture and Code Optimization (TACO), vol.12, issue.2, p.23, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01178085
« How to Print Floating-Point Numbers Accurately, 1990. ,
Transparent Runtime Shadow Stack: Protection against malicious return address modifications, 2006. ,
Using Hardware Counters to Predict Vectorization, 2017. ,
Dynamic Optimization through the use of Automatic Runtime Specialization, 1999. ,
Hardware Counted Profile-Guided Optimization, 2014. ,
, Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications. OOPSLA '14, pp.763-776, 2014.
10 2 Séquence d'appel: Normal vs Spécialisation ,
Iterative Compilation Flow Chart given in, p.29 ,
40 2.2 The shared Monitoring table is accessed by both the application and FITTCHOOSER ,
,
, , p.49
, Performance of critical function variations
, Call sequence: Normal vs Specialization
2 OFSPER attached to the application process, p.64 ,
, The shared monitoring table is accessed by both the application and OFSPER
, Impact of specialization on execution time of a function, p.69
, , p.71
,
, Example of the shared Monitoring table
,
, , p.61
, Monitoring table for the pure function exp2