A. Agarwal and M. Cherian, Adaptive Backoff Synchronization Techniques, Proceedings of the 16th Annual International Symposium on Computer Architecture, ISCA '89, pp.396-406, 1989.

M. K. Aguilera, J. C. Mogul, J. L. Wiener, P. Reynolds, and A. Muthitacharoen, Performance Debugging for Distributed Systems of Black Boxes, Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles, p.3, 2003.

B. Alpern, S. Augart, S. M. Blackburn, M. Butrico, A. Cocchi et al., The Jikes Research Virtual Machine project: Building an open-source research community, IBM Systems Journal, vol.44, issue.2, pp.399-417, 2005.
DOI : 10.1147/sj.442.0399

E. Altman, M. Arnold, S. Fink, and N. Mitchell, Performance Analysis of Idle Programs, Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA '10, pp.739-753, 2010.

D. F. Bacon, R. Konuru, C. Murthy, and M. Serrano, Thin locks: featherweight synchronization for Java, Proceedings of the ACM SIGPLAN 1998 Conference on Programming Language Design and Implementation, PLDI '98, pp.258-268, 1998.

S. Benedict, V. Petkov, and M. Gerndt, PERISCOPE: An Online-Based Distributed Performance Analysis Tool, Tools for High Performance Computing 2009 -Proceedings of the 3rd International Workshop on Parallel Tools for High Performance Computing, pp.1-16, 2009.
DOI : 10.1007/978-3-642-11261-4_1

W. Binder, A Portable and Customizable Profiling Framework for Java Based on Bytecode Instruction Counting, Proceedings of the Third Asian Conference on Programming Languages and Systems, APLAS '05, pp.178-194, 2005.
DOI : 10.1007/11575467_13

W. Binder and J. Hulaas, A portable CPU-management framework for Java, IEEE Internet Computing, vol.8, issue.5, 2004.
DOI : 10.1109/MIC.2004.28

S. M. Blackburn, R. Garner, C. Hoffmann, A. M. Khang, K. S. Mckinley et al., The DaCapo Benchmarks: Java Benchmarking Development and Analysis, Proceedings of the 21st Annual ACM SIGPLAN Conference on Object-oriented Programming Systems, Languages, and Applications, OOPSLA '06, pp.169-190, 2006.

S. Boyd-wickizer, M. F. Kaashoek, R. Morris, and N. Zeldovich, Non-scalable Locks are Dangerous, Proceedings of the Linux Symposium, 2012.

R. Bryant and J. Hawkes, Lockmeter: Highly-informative Instrumentation for Spin Locks in the Linux Kernel, Proceedings of the 4th Annual Linux Showcase & Conference, pp.271-282, 2000.

A. Chanda, A. L. Cox, and W. Zwaenepoel, Whodunit: Transactional Profiling for Multitier Applications, Proceedings of the 2Nd ACM SIGOPS/EuroSys European Conference on Computer Systems, p.7, 2007.

F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach et al., Bigtable, Proceedings of the 7th Symposium on Operating Systems Design and Implementation, OSDI '06, pp.205-218, 2006.
DOI : 10.1145/1365815.1365816

M. Chow, D. Meisner, J. Flinn, D. Peek, and T. F. Wenisch, The Mystery Machine: Endto-end Performance Analysis of Large-scale Internet Services, Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation, OSDI '14, 2014.

K. Coulomb, A. Degomme, M. Faverge, and F. Trahay, An Open-Source Tool-Chain for Performance Analysis, Parallel Tools Workshop, pp.37-48, 2011.
DOI : 10.1007/978-3-642-31476-6_4

URL : https://hal.archives-ouvertes.fr/hal-00707236

M. Curtiss, I. Becker, T. Bosman, S. Doroshenko, L. Grijincu et al., Unicorn, Proc. VLDB Endow, pp.1150-1161, 2013.
DOI : 10.14778/2536222.2536239

M. Dashti, A. Fedorova, J. Funston, F. Gaud, R. Lachaize et al., Traffic Management: A Holistic Approach to Memory Placement on NUMA Systems, Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '13, pp.381-394, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00945758

F. David, Profiler dynamique de contention pour les verrous des applications java. Com- PAS '13, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00937220

F. David, G. Thomas, J. Lawall, and G. Muller, Continuously Measuring Critical Section Pressure with the Free Lunch Profiler
URL : https://hal.archives-ouvertes.fr/hal-00957154

F. David, G. Thomas, J. Lawall, and G. Muller, Continuously Measuring Critical Section Pressure with the Free-Lunch Profiler, Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications, OOPSLA '14, pp.291-307, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00957154

J. Davidson, B. Liebald, J. Liu, P. Nandy, T. Van-vleet et al., The YouTube video recommendation system, Proceedings of the fourth ACM conference on Recommender systems, RecSys '10, pp.293-296, 2010.
DOI : 10.1145/1864708.1864770

J. Dean and S. Ghemawat, MapReduce, Proceedings of the 6th Conference on Symposium on Operating System Design and Implementation, OSDI '04, pp.107-113, 2004.
DOI : 10.1145/1327452.1327492

G. Decandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman et al., Dynamo: Amazon's highly available key-value store, Proceedings of Twenty-first ACM SIGOPS Symposium on Operating Systems Principles, SOSP '07, pp.205-220, 2007.

J. Demme and S. Sethumadhavan, Rapid identification of architectural bottlenecks via precise event counting, Proceedings of the 38th Annual International Symposium on Computer Architecture, ISCA '11, pp.353-364, 2011.

D. Dice, Implementing fast Java TM monitors with relaxed-locks, Proceedings of the 2001 Symposium on JavaTM Virtual Machine Research and Technology Symposium - Volume 1, JVM '01, pp.79-90, 2001.

D. Dice, M. Moir, and W. Scherer, Quickly reacquirable locks, 2003.

J. Dongarra, K. London, S. Moore, P. Mucci, D. Terpstra et al., Experiences and lessons learned with a portable interface to hardware performance counters, Proceedings International Parallel and Distributed Processing Symposium, p.6, 2003.
DOI : 10.1109/IPDPS.2003.1213517

K. D. Bois, J. B. Sartor, S. Eyerman, and L. Eeckhout, Bottle graphs: visualizing scalability bottlenecks in multi-threaded applications, Proceedings of the 2013 ACM SIGPLAN International Conference on Object Oriented Programming Systems Languages and Applications , OOPSLA '13, pp.355-372, 2013.

. Extrae, http://www.bsc.es/computer-sciences/extrae, 2015.

S. Eyerman and L. Eeckhout, Modeling Critical Sections in Amdahl's Law and Its Implications for Multicore Design, Proceedings of the 37th Annual International Symposium on Computer Architecture, ISCA '10, pp.362-370, 2010.

N. Geoffray, G. Thomas, J. Lawall, G. Muller, and B. Folliot, VMKit, Proceedings of the 6th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments, VEE '10, pp.51-62, 2010.
DOI : 10.1145/1735997.1736006

URL : https://hal.archives-ouvertes.fr/inria-00354577

K. Glerum, K. Kinshumann, S. Greenberg, G. Aul, V. Orgovan et al., Debugging in the (very) large, Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, SOSP '09, pp.103-116, 2009.
DOI : 10.1145/1629575.1629586

J. Gosling, B. Joy, G. Steele, and G. Bracha, The Java TM language specification, 2005.

Y. Huang, Z. Cui, L. Chen, W. Zhang, Y. Bao et al., HaLock, Proceedings of the 21st international conference on Parallel architectures and compilation techniques, PACT '12, pp.253-262, 2012.
DOI : 10.1145/2370816.2370854

H. Inoue and T. Nakatani, How a Java VM can get more from a hardware performance monitor, Proceedings of the 24th ACM SIGPLAN Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA '09, pp.137-154, 2009.

R. Jones, A. Hosking, and E. Moss, The garbage collection handbook: the art of automatic memory management, 2011.
DOI : 10.1201/9781315388021

J. Java and T. , Virtual Machine Debug Interface, 2014.

J. Java and T. , Virtual Machine Tool Interface, 2014.

T. Kalibera, M. Mole, R. Jones, and J. Vitek, A Black-box Approach to Understanding Concurrency in DaCapo, Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA '12, pp.335-354, 2012.

K. Kawachiya, A. Koseki, and T. Onodera, Lock Reservation: Java Locks Can Mostly Do Without Atomic Operations, Proceedings of the 17th ACM SIGPLAN Conference on Object-oriented Programming, Systems, Languages, and Applications, OOPSLA '02, pp.130-141, 2002.

Y. Koren, Factorization meets the neighborhood, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD 08, pp.426-434, 2008.
DOI : 10.1145/1401890.1401944

Y. Koren, Collaborative filtering with temporal dynamics, Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '09, pp.447-456, 2009.

R. Lachaize, B. Lepers, and V. Quéma, MemProf: A Memory Profiler for NUMA Multicore Systems, Proceedings of the 2012 USENIX Conference on Annual Technical Conference, USENIX ATC'12, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00945731

A. Lakshman and P. Malik, Cassandra: A Structured Storage System on a P2P Network, Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures, SPAA '09, 2009.

D. Lea, The java.util.concurrent synchronizer framework, Science of Computer Programming, vol.58, issue.3, pp.293-309, 2005.
DOI : 10.1016/j.scico.2005.03.007

T. Liu and E. D. Berger, Sheriff: Precise detection and automatic mitigation of false sharing, Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications, 2011.

J. Lozi, F. David, G. Thomas, J. Lawall, and G. Muller, Remote Core Locking: migrating critical-section execution to improve the performance of multithreaded applications, Proceedings of the 2012 USENIX Annual Technical Conference, USENIX ATC'12, pp.65-76, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00991709

J. Manson, W. Pugh, and S. V. Adve, The Java memory model, Proceedings of the 32Nd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL '05, pp.378-391, 2005.

M. Milenkovic, S. Jones, F. Levine, and E. Pineda, Performance inspector tools with instruction tracing and per-thread / function profiling, Linux Symposium, 2008.

N. Mitchell and P. F. Sweeney, On-the-fly Capacity Planning, Proceedings of the 2013 ACM SIGPLAN International Conference on Object Oriented Programming Systems Languages & Applications, OOPSLA '13, pp.849-866, 2013.

S. Multicore, https://www.ibm.com/developerworks/mydeveloperworks/groups/service, pp.9-29, 2014.

. Mutrace, Measuring Lock Contention, 2014.

M. S. Müller, A. Knüpfer, M. Jurenz, M. Lieber, H. Brunst et al., Developing scalable applications with vampir, vampirserver and vampirtrace, PARCO, volume 15 of Advances in Parallel Computing, pp.637-644, 2007.

K. Nagaraj, C. Killian, and J. Neville, Structured comparative analysis of systems logs to diagnose performance problems, Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation, NSDI '12, 2012.

W. E. Nagel, A. Arnold, M. Weber, H. Hoppe, and K. Solchenbach, Vampir: Visualization and analysis of mpi resources, pp.69-80, 1996.

T. Onodera and K. Kawachiya, A Study of Locking Objects with Bimodal Fields, Proceedings of the 14th ACM SIGPLAN Conference on Object-oriented Programming, Systems , Languages, and Applications, OOPSLA '99, pp.223-237, 1999.

T. Onodera, K. Kawachiya, and A. Koseki, Lock Reservation for Java Reconsidered, ECOOP, pp.559-583, 2004.
DOI : 10.1007/978-3-540-24851-4_26

K. Ousterhout, P. Wendell, M. Zaharia, and I. Stoica, Sparrow, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP '13, pp.69-84, 2013.
DOI : 10.1145/2517349.2522716

S. Patil, M. Polte, K. Ren, W. Tantisiriroj, L. Xiao et al., YCSB++, Proceedings of the 2nd ACM Symposium on Cloud Computing, SOCC '11, pp.1-914, 2011.
DOI : 10.1145/2038916.2038925

A. Pesterev, N. Zeldovich, and R. T. Morris, Locating cache performance bottlenecks using data profiling, Proceedings of the 5th European conference on Computer systems, EuroSys '10, pp.335-348, 2010.
DOI : 10.1145/1755913.1755947

V. Pillet, J. Labarta, T. Cortes, and S. Girona, PARAVER: A Tool to Visualize and Analyze Parallel Code, Proceedings of WoTUG-18: Transputer and occam Developments, pp.17-31, 1995.

F. Pizlo, D. Frampton, and A. L. Hosking, Fine-grained adaptive biased locking, Proceedings of the 9th International Conference on Principles and Practice of Programming in Java, PPPJ '11, pp.171-181, 2011.
DOI : 10.1145/2093157.2093184

P. Reynolds, J. L. Wiener, J. C. Mogul, M. K. Aguilera, and A. Vahdat, WAP5, Proceedings of the 15th international conference on World Wide Web , WWW '06, p.6, 2006.
DOI : 10.1145/1135777.1135830

K. Russell and D. Detlefs, Eliminating Synchronization-related Atomic Operations with Biased Locking and Bulk Rebiasing, Proceedings of the 21st Annual ACM SIGPLAN Conference on Object-oriented Programming Systems, Languages, and Applications, OOP- SLA '06, pp.263-272, 2006.

M. Schulz, J. Galarowicz, D. Maghrak, W. Hachfeld, D. Montoya et al., Open | speedshop: An open source infrastructure for parallel performance analysis, Sci. Program, vol.16, issue.2-3, pp.105-121, 2008.

S. S. Shende and A. D. Malony, The Tau Parallel Performance System, International Journal of High Performance Computing Applications, vol.20, issue.2, pp.287-311, 2006.
DOI : 10.1177/1094342006064482

N. R. Tallent, J. M. Mellor-crummey, and A. Porterfield, Analyzing lock contention in multithreaded applications, Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '10, pp.269-280, 2010.

F. Trahay, F. Rué, M. Faverge, Y. Ishikawa, R. Namyst et al., EZTrace: A Generic Framework for Performance Analysis, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2011.
DOI : 10.1109/CCGrid.2011.83

URL : https://hal.archives-ouvertes.fr/inria-00587216

F. Trahay, E. Brunet, M. M. Bouksiaa, and J. Liao, Selecting Points of Interest in Traces Using Patterns of Events, 2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, p.15, 2015.
DOI : 10.1109/PDP.2015.30

URL : https://hal.archives-ouvertes.fr/hal-01257904

N. Vasudevan, K. S. Namjoshi, and S. A. Edwards, Simple and fast biased locks, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.65-74, 2010.
DOI : 10.1145/1854273.1854287

F. Xian, W. , and H. Jiang, Contention-aware scheduler: unlocking execution parallelism in multithreaded Java programs, Proceedings of the 23rd ACM SIGPLAN Conference on Object-oriented Programming Systems Languages and Applications, OOP- SLA '08, pp.163-180, 2008.

W. Xiong, S. Park, J. Zhang, Y. Zhou, and Z. Ma, Ad Hoc Synchronization Considered Harmful, Proceedings of the 9th USENIX Conference on Operating Systems Design and Implementation, OSDI '10, pp.1-8, 2010.

. Yourkit, Yourkit home page, 2014.

Y. Yu, T. Rodeheffer, and W. Chen, Racetrack: Efficient detection of data race conditions via adaptive tracking, Proceedings of the Twentieth ACM Symposium on Operating Systems Principles, SOSP '05, pp.221-234, 2005.

M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, Spark: Cluster Computing with Working Sets, Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, 2010.

X. Zhao, Y. Zhang, D. Lion, M. F. Ullah, Y. Luo et al., lprof: A Non-intrusive Request Flow Profiler for Distributed Systems, Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation, OSDI '14, pp.629-644, 2014.