G. Aguilera, P. Teller, M. Taufer, and F. Wolf, A systematic multi-step methodology for performance analysis of communication traces of distributed applications based on hierarchical clustering, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium, pp.388-395, 2006.
DOI : 10.1109/IPDPS.2006.1639645

M. Arnold and B. G. Ryder, A framework for reducing the cost of instrumented code, Proc. of the ACM SIGPLAN 2001 conference on Programming language design and implementation L. Soffa, PLDI '01, pp.168-179, 2001.

H. Brunst, D. Kranzlmüller, M. S. Muller, W. E. Vampir, N. et al., Tools for scalable parallel program analysis: Vampir NG, MARMOT, and DeWiz, International Journal of Computational Science and Engineering, vol.4, issue.3, pp.149-161, 2009.
DOI : 10.1504/IJCSE.2009.027377

B. Buck and J. K. Hollingsworth, An API for Runtime Code Patching, International Journal of High Performance Computing Applications, vol.14, issue.4, pp.317-329, 2000.
DOI : 10.1177/109434200001400404

M. Calzarossa, L. Massari, and D. Tessera, Workload Characterization Issues and Methodologies, Performance Evaluation: Origins and Directions, pp.459-481, 2000.
DOI : 10.1007/3-540-46506-5_20

H. Casanova, A. Legrand, and M. Quinson, SimGrid: A Generic Framework for Large-Scale Distributed Experiments, Tenth International Conference on Computer Modeling and Simulation (uksim 2008), pp.126-131, 2008.
DOI : 10.1109/UKSIM.2008.28

URL : https://hal.archives-ouvertes.fr/inria-00260697

M. Geimer, F. Wolf, B. Wylie, and B. Mohr, Scalable Parallel Trace-Based Performance Analysis, Recent Advances in Parallel Virtual Machine and Message Passing Interface, pp.303-312, 2006.
DOI : 10.1007/11846802_43

J. Himmelspach, R. Ewald, and A. Uhrmacher, A flexible and scalable experimentation layer, 2008 Winter Simulation Conference, pp.827-835, 2008.
DOI : 10.1109/WSC.2008.4736146

S. Hung, C. Tu, and T. Soon, Trace-based performance analysis framework for heterogeneous multicore systems, 2010 15th Asia and South Pacific Design Automation Conference (ASP-DAC), pp.19-24, 2010.
DOI : 10.1109/ASPDAC.2010.5419926

A. Knüpfer, H. Brunst, J. Doleschal, M. Jurenz, M. Lieber et al., The Vampir Performance Analysis Tool-Set, Tools for High Performance Computing, pp.139-155, 2008.
DOI : 10.1007/978-3-540-68564-7_9

J. Lawrence and X. Yuan, An MPI tool for automatically discovering the switch level topologies of Ethernet clusters, 2008 IEEE International Symposium on Parallel and Distributed Processing, pp.1-8, 2008.
DOI : 10.1109/IPDPS.2008.4536545

E. P. Mancini, M. Rak, R. Torella, and U. Villano, Predictive Autonomicity of Web Services in the MAWeS Framework, Journal of Computer Science, vol.2, issue.6, pp.513-520, 2006.
DOI : 10.3844/jcssp.2006.513.520

J. Milano, G. Muller-shultz, and G. Lakner, Blue Gene/L: Hardware Overview and Planning . IBM Corp. http://workshops.alcf.anl.gov/gs10/files, Architecture. pdf, vol.01, 2006.

D. L. Mills, Improved algorithms for synchronizing computer network clocks, IEEE/ACM Transactions on Networking, vol.3, issue.3, pp.245-254, 1995.
DOI : 10.1109/90.392384

B. Mohr and F. Wolf, KOJAK ??? A Tool Set for Automatic Performance Analysis of Parallel Programs, Euro-Par 2003 Parallel Processing, pp.1301-1304, 2003.
DOI : 10.1007/978-3-540-45209-6_177

J. Ribault, O. Dalle, D. Conan, and S. Leriche, OSIF: A Framework To Instrument, Validate, and Analyze Simulations, Proceedings of the 3rd International ICST Conference on Simulation Tools and Techniques, pp.56-65, 2010.
DOI : 10.4108/ICST.SIMUTOOLS2010.8729

URL : https://hal.archives-ouvertes.fr/inria-00465141

R. Rouvoy, D. Conan, and L. Seinturier, Software Architecture Patterns for a Context-Processing Middleware Framework, IEEE Distributed Systems Online, vol.9, issue.6, pp.1-12, 2008.
DOI : 10.1109/MDSO.2008.17

URL : https://hal.archives-ouvertes.fr/inria-00286616

M. Schulz, J. Snf-galarowicz, D. Maghrak, W. Hachfeld, D. Montoya et al., Analyzing the Performance of Scientific Applications with OpenSpeedShop, Parallel Computational Fluid Dynamics: Recent Advances and Future Directions, pp.151-159, 2009.

S. S. Shende and A. D. Malony, The Tau Parallel Performance System, International Journal of High Performance Computing Applications, vol.20, issue.2, pp.287-311, 2006.
DOI : 10.1177/1094342006064482

J. A. Smith, S. D. Hammond, G. R. Mudalige, J. A. Davis, A. B. Mills et al., hpsgprof: A New Profiling Tool for Large-Scale Parallel Scientific Code, Proc. of Uk Performance Engineering Workshop 2009 (UKPEW09), pp.1-11, 2009.

C. B. Stunkel, J. Herring, B. Abali, and R. Sivaram, A new switch chip for IBM RS/6000 SP systems, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '99, pp.1-16, 1999.
DOI : 10.1145/331532.331548

B. R. Supinski, F. Mueller, R. Fowler, P. Ratn, T. Gamblin et al., An open infrastructure for scalable, reconfigurable analysis, Proc. of International Workshop on Scalable Tools for High-End Computing (STHEC 2008) ACM/SIGARCH, pp.39-50, 2008.

F. Wolf, F. Freitag, B. Mohr, S. Moore, and B. J. Wylie, Large Event Traces in Parallel Performance Analysis, Proc. of 8th Workshop on Parallel Systems and Algorithms (PASA), 2006.

F. Wolf and B. Mohr, EARL???A programmable and extensible toolkit for analyzing event traces of message passing programs, High-Performance Computing and Networking, pp.503-512, 1999.
DOI : 10.1007/BFb0100611

C. E. Wu, A. Bolmarcich, M. Snir, D. Wootton, F. Parpia et al., From Trace Generation to Visualization: A Performance Framework for Distributed Parallel Systems, ACM/IEEE SC 2000 Conference (SC'00), pp.50-68, 2000.
DOI : 10.1109/SC.2000.10050

E. Zahavi, G. Johnson, D. J. Kerbyson, and M. Lang, Optimized InfiniBand fat-tree routing for shift all-to-all communication patterns, Concurrency and Computation: Practice and Experience, vol.22, issue.2, pp.217-231, 2010.

A. Biographies and O. Dalle-is-maitre-de-conférences, He received his B.Sc. from the University of Bordeaux 1 and his M.Sc. and Ph.D. from UNS. From 1999 to 2000, he was a postdoctoral fellow at the French Space Agency center in Toulouse (CNES-CST), where he started working on component-based discrete-event simulation of multi-media telecommunication systems he was appointed to UNS, and he joined the MASCOTTE research group, a joint team of UNS, CNRS and INRIA. His current research interests in discrete-event simulation are on methodology support, very large-scale networked systems, and wireless communication systems, 2000.

E. P. Mancini and . Post-doc-researcher-inria-sophia-antipolis-méditerranée, He received his Ph.D. in Information Engineering and his M.Sc. in Computer Engineering from the University of Sannio, Italy. His research interests are in the area of simulation, high performance computing, and autonomic computing