M. Arce-acuna and T. Aoki, Multi-gpu computing and scalability for real-time tsunami simulation, HPCS '10: Proceedings of the International Conference on High Performance Computing & Simulation, pp.125-132, 2010.

L. A. Bautista-gomez, N. Maruyama, F. Cappello, and S. Matsuoka, Distributed diskless checkpoint for large scale systems, Cluster, Cloud and Grid Computing (CCGrid) 10th IEEE/ACM International Conference on, pp.63-72, 2010.

L. A. Bautista-gomez, S. Tsuboi, D. Komatitsch, F. Cappello, N. Maruyama et al., FTI, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, p.32, 2011.
DOI : 10.1145/2063384.2063427

URL : https://hal.archives-ouvertes.fr/hal-00721216

A. Bhatelé, L. V. Kalé, and S. Kumar, Dynamic topology aware load balancing algorithms for molecular dynamics applications, Proceedings of the 23rd international conference on Conference on Supercomputing, ICS '09, pp.110-116, 2009.
DOI : 10.1145/1542275.1542295

S. Borkar, Designing Reliable Systems from Unreliable Components: The Challenges of Transistor Variability and Degradation, IEEE Micro, vol.25, issue.6, pp.10-16, 2005.
DOI : 10.1109/MM.2005.110

A. Bouteiller, T. Herault, G. Bosilca, and J. Dongarra, Correlated Set Coordination in Fault Tolerant Message Logging Protocols, Euro-Par 2011, pp.51-64, 2011.
DOI : 10.1007/978-3-642-23397-5_6

Z. Chen and J. Dongarra, A Scalable Checkpoint Encoding Algorithm for Diskless Checkpointing, 2008 11th IEEE High Assurance Systems Engineering Symposium, pp.71-79, 2008.
DOI : 10.1109/HASE.2008.13

Z. J. Chen, Y. He, P. Rosa-neto, J. Germann, and A. C. Evans, Revealing Modular Architecture of Human Brain Structural Networks by Using Cortical Thickness from MRI, Cerebral Cortex, vol.18, issue.10, pp.2374-2381, 2008.
DOI : 10.1093/cercor/bhn003

C. Da-lu, C. Da-lu, and D. A. Reed, Scalable diskless checkpointing for large parallel systems, 2005.

E. N. Elnozahy, System Resilience at Extreme Scale, 2008.

E. N. Elnozahy, L. Alvisi, Y. Wang, and D. B. Johnson, A survey of rollback-recovery protocols in message-passing systems, ACM Computing Surveys, vol.34, issue.3, pp.375-408, 2002.
DOI : 10.1145/568522.568525

A. Guermouche, T. Ropars, M. Snir, and F. Cappello, Hydee: An energy and memory efficient cluster-based hybrid checkpointing protocol for mpi applications, 2011.

A. Guermouche, T. Ropars, M. Snir, and F. Cappello, HydEE: Failure Containment without Event Logging for Large Scale Send-Deterministic MPI Applications, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, 2012.
DOI : 10.1109/IPDPS.2012.111

URL : https://hal.archives-ouvertes.fr/hal-01121941

D. B. Johnson and W. Zwaenepoel, Sender-Based Message Logging, Digest of Papers: The 17th Annual International Symposium on Fault- Tolerant Computing, pp.14-19, 1987.

S. Kamil, J. Shalf, L. Oliker, and D. Skinner, Understanding ultra-scale application communication requirements, Proceedings of the 2005 IEEE International Symposium on Workload Characterization, pp.178-187, 2005.

N. Maruyama, T. Nomura, K. Sato, and S. Matsuoka, Physis, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-11, 2011.
DOI : 10.1145/2063384.2063398

E. Meneses, C. L. Mendes, and L. V. Kale, Team-Based Message Logging: Preliminary Results, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, 2010.
DOI : 10.1109/CCGRID.2010.110

D. Meunier, R. Lambiotte, and E. T. Bullmore, Modular and Hierarchically Modular Organization of Brain Networks, Frontiers in Neuroscience, vol.4, p.11, 2010.
DOI : 10.3389/fnins.2010.00200

A. Moody, G. Bronevetsky, K. Mohror, and B. R. Supinski, Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-11, 2010.
DOI : 10.1109/SC.2010.18

R. A. Oldfield, S. Arunagiri, P. J. Teller, S. Seelam, M. R. Varela et al., Modeling the Impact of Checkpoints on Next-Generation Systems, 24th IEEE Conference on Mass Storage Systems and Technologies (MSST 2007), pp.30-46, 2007.
DOI : 10.1109/MSST.2007.4367962

J. S. Plank, K. Li, and M. A. Puening, Diskless checkpointing, IEEE Transactions on Parallel and Distributed Systems, vol.9, issue.10, pp.972-986, 1998.
DOI : 10.1109/71.730527

S. Rao, L. Alvisi, and H. M. Vin, The Cost of Recovery in Message Logging Protocols, Symposium on Reliable Distributed Systems, pp.10-18, 1998.

T. Ropars, A. Guermouche, B. Uçar, E. Meneses, L. V. Kalé et al., On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications, Euro-Par 2011, pp.567-578, 2011.
DOI : 10.1002/cpe.1364

URL : https://hal.archives-ouvertes.fr/hal-00786558

M. Rubinov and O. Sporns, Complex network measures of brain connectivity: Uses and interpretations, NeuroImage, vol.52, issue.3, pp.1059-1069, 2010.
DOI : 10.1016/j.neuroimage.2009.10.003

E. Solomonik, A. Bhatele, and J. Demmel, Improving communication performance in dense linear algebra via topology aware collectives, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-77, 2011.
DOI : 10.1145/2063384.2063487

J. Yang, K. F. Li, W. Li, and D. Zhang, Trading off logging overhead and coordinating overhead to achieve efficient rollback recovery, Concurrency and Computation : Practice and Experience, pp.819-853, 2009.
DOI : 10.1002/cpe.1364

C. Zhou, L. Zemanová, G. Zamora, C. C. Hilgetag, and J. Kurths, Hierarchical Organization Unveiled by Functional Connectivity in Complex Brain Networks, Physical Review Letters, vol.97, issue.23, p.97238103, 2006.
DOI : 10.1103/PhysRevLett.97.238103