S. Browne, N. Dongarra, K. Garner, P. London, and . Mucci, A Portable Programming Interface for Performance Evaluation on Modern Processors, International Journal of High Performance Computing Applications, vol.14, issue.3, pp.189-204, 2000.
DOI : 10.1177/109434200001400303

C. Jacqmot, Load Management in Distributed Computing Systems: Towards Adaptive Strategies, 1996.

R. K. Jain, The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling, 1991.

J. L. Lo, S. J. Eggers, J. S. Emer, H. M. Levy, R. L. Stamm et al., Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading, ACM Transactions on Computer Systems, vol.15, issue.3, pp.322-354, 1997.
DOI : 10.1145/263326.263382

J. D. Mccalpin, Memory bandwidth and machine balance in current high performance computers, IEEE Technical Committee on Computer Architecture (TCCA) Newsletter, 1995.

C. Douglas and . Montgomery, Design and Analysis of Experiments, Student Solutions Manual, 2005.

T. Mytkowicz, A. Diwan, M. Hauswirth, and P. F. Sweeney, Producing wrong data without doing anything obviously wrong! SIGPLAN Not, pp.265-276, 2009.

A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia et al., A Framework for Performance Modeling and Prediction, ACM/IEEE SC 2002 Conference (SC'02), pp.1-17, 2002.
DOI : 10.1109/SC.2002.10004

M. Mustafa, L. Tikir, E. Carrington, A. Strohmaier, and . Snavely, A genetic algorithms approach to modeling the performance of memory-bound