S. Graham, P. Kessler, and M. Mckusick, Gprof, ACM SIGPLAN Notices, vol.17, issue.6, pp.120-126, 1982.
DOI : 10.1145/872726.806987

A. Knüpfer, H. Brunst, J. Doleschal, M. Jurenz, M. Lieber et al., The Vampir Performance Analysis Tool-Set, Tools for High Performance Computing, pp.139-155, 2008.
DOI : 10.1007/978-3-540-68564-7_9

V. Pillet, J. Labarta, T. Cortes, and S. Girona, Paraver: A tool to visualize and analyze parallel code, " in Transputer and occam developments: WoTUG-18: proceedings of the 187th world occam and Transputer User Group Technical Meeting, pp.9-13, 1995.

R. Danjean, P. Namyst, and . Wacrenier, Visual Trace Explorer Available: http://vite.gforge.inria.fr An efficient multi-level trace toolkit for multi-threaded applications, Euro-Par 2005 Parallel Processing, pp.166-175, 2005.

E. Karrels and E. Lusk, Performance analysis of MPI programs, Proceedings of the Workshop on Environments and Tools For Parallel Scientific Computing. SIAM Publications, pp.195-200, 1994.

J. Vetter and B. De-supinski, Dynamic Software Testing of MPI Applications with Umpire, ACM/IEEE SC 2000 Conference (SC'00), p.51, 2000.
DOI : 10.1109/SC.2000.10055

S. Bull, NPTL Stabilization Project, Linux Symposium, p.111

J. Caubet, J. Gimenez, J. Labarta, L. Derose, and J. Vetter, A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications, OpenMP Shared Memory Parallel Programming, pp.53-67, 2001.
DOI : 10.1007/3-540-44587-0_6

M. Noeth, P. Ratn, F. Mueller, M. Schulz, and B. De-supinski, ScalaTrace: Scalable compression and replay of communication traces for high-performance computing, Journal of Parallel and Distributed Computing, vol.69, issue.8, pp.696-710, 2009.
DOI : 10.1016/j.jpdc.2008.09.001

K. Vijayakumar, F. Mueller, X. Ma, and P. Roth, Scalable I/O tracing and analysis, Proceedings of the 4th Annual Workshop on Petascale Data Storage, PDSW '09, pp.26-31, 2009.
DOI : 10.1145/1713072.1713080

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

M. Geimer, F. Wolf, B. Wylie, E. Ábrahám, D. Becker et al., The Scalasca performance toolset architecture, Concurrency and Computation: Practice and Experience, pp.702-719, 2010.
DOI : 10.1002/cpe.1556

S. Shende and A. Malony, The Tau Parallel Performance System, International Journal of High Performance Computing Applications, vol.20, issue.2, p.287, 2006.
DOI : 10.1177/1094342006064482

M. Muller, A. Knupfer, M. Jurenz, M. Lieber, H. Brunst et al., Developing scalable applications with Vampir, VampirServer and VampirTrace, Proceedings of the Minisymposium on Scalability and Usability of HPC Programming Tools at PARCO, 2007.

B. Buck and J. Hollingsworth, An API for Runtime Code Patching, International Journal of High Performance Computing Applications, vol.14, issue.4, pp.317-329, 2000.
DOI : 10.1177/109434200001400404

S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, A Portable Programming Interface for Performance Evaluation on Modern Processors, International Journal of High Performance Computing Applications, vol.14, issue.3, p.189, 2000.
DOI : 10.1177/109434200001400303

M. Schulz, J. Galarowicz, D. Maghrak, W. Hachfeld, D. Montoya et al., Open| SpeedShop: An open source infrastructure for parallel performance analysis, Scientific Programming, pp.105-121, 2008.

D. Reed, P. Roth, R. Aydt, K. Shields, L. Tavera et al., Scalable performance analysis: the Pablo performance analysis environment, Proceedings of Scalable Parallel Libraries Conference, pp.104-113, 1993.
DOI : 10.1109/SPLC.1993.365577

E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak et al., Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects, Journal of Physics: Conference Series, p.12037, 2009.
DOI : 10.1088/1742-6596/180/1/012037

A. Knüpfer, R. Brendel, H. Brunst, H. Mix, and W. Nagel, Introducing the Open Trace Format (OTF), Computational Science?ICCS, pp.526-533, 2006.
DOI : 10.1007/11758525_71

J. Chassin-de-kergommeaux, B. De-oliveira-stein, and G. Mounié, Pajé input data Format, 2003.

G. Mercier and J. Clet-ortega, Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments, EuroPVM/MPI , ser, pp.104-115, 2009.
DOI : 10.1007/978-3-642-03770-2_17

URL : https://hal.archives-ouvertes.fr/inria-00392581

E. Gabriel, G. Fagg, G. Bosilca, T. Angskun, J. Dongarra et al., Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation, Recent Advances in Parallel Virtual Machine and Message Passing Interface, pp.353-377, 2004.
DOI : 10.1007/978-3-540-30218-6_19