C. Chilan, M. Yang, A. Cheng, and L. Arber, Parallel I/O Performance Study with HDF5, a Scientific Data Package, 2006.

H. Abbasi, M. Wolf, G. Eisenhauer, S. Klasky, K. Schwan et al., DataStager, Proceedings of the 18th ACM international symposium on High performance distributed computing, HPDC '09, pp.39-48, 2009.
DOI : 10.1145/1551609.1551618

M. Dorier, G. Antoniu, F. Cappello, M. Snir, and L. Orf, Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O, 2012 IEEE International Conference on Cluster Computing, pp.155-163, 2012.
DOI : 10.1109/CLUSTER.2012.26
URL : https://hal.archives-ouvertes.fr/hal-00715252

J. Bent, G. Gibson, G. Grider, B. Mcclelland, P. Nowoczynski et al., PLFS, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, pp.1-12, 2009.
DOI : 10.1145/1654059.1654081

X. Zhang, K. Davis, and S. Jiang, IOrchestrator: Improving the Performance of Multi-node I/O Systems via Inter-Server Coordination, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-11, 2010.
DOI : 10.1109/SC.2010.30

A. Batsakis, R. Burns, A. Kanevsky, J. Lentini, and T. Talpey, CA-NFS, FAST '09, pp.99-110, 2009.
DOI : 10.1145/1629080.1629085

H. Shan and J. Shalf, Using IOR to Analyze the I/O Performance for HPC Platforms, Cray User Group Conference, 2007.

R. Nathuji, A. Kansal, and A. Ghaffarkhah, Q-clouds, Proceedings of the 5th European conference on Computer systems, EuroSys '10, pp.237-250, 2010.
DOI : 10.1145/1755913.1755938

X. Pu, L. Liu, Y. Mei, S. Sivathanu, Y. Koh et al., Understanding Performance Interference of I/O Workload in Virtualized Cloud Environments, 2010 IEEE 3rd International Conference on Cloud Computing, pp.51-58, 2010.
DOI : 10.1109/CLOUD.2010.65

D. Skinner and W. Kramer, Understanding the causes of performance variability in HPC workloads, IEEE International. 2005 Proceedings of the IEEE Workload Characterization Symposium, 2005., pp.137-149, 2005.
DOI : 10.1109/IISWC.2005.1526010

A. Uselton, M. Howison, N. Wright, D. Skinner, N. Keen et al., Parallel I/O performance: From events to ensembles, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-11, 2010.
DOI : 10.1109/IPDPS.2010.5470424

J. A. Zounmevo, D. Kimpe, R. Ross, and A. Afsahi, Using MPI in high-performance computing services, Proceedings of the 20th European MPI Users' Group Meeting on, EuroMPI '13, pp.43-48, 2013.
DOI : 10.1145/2488551.2488556

S. Lang, P. Carns, R. Latham, R. Ross, K. Harms et al., I/O performance challenges at leadership scale, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, pp.1-12, 2009.
DOI : 10.1145/1654059.1654100

R. Thakur, W. Gropp, and E. Lusk, Data sieving and collective I/O in ROMIO, Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation, pp.182-182, 1999.
DOI : 10.1109/FMPC.1999.750599

J. Prost, R. Treumann, R. Hedges, B. Jia, and A. Koniges, MPI-IO/GPFS, an optimized implementation of MPI-IO on top of GPFS, Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '01, pp.17-17, 2001.
DOI : 10.1145/582034.582051

P. Dickens and J. Logan, Towards a High Performance Implementation of MPI-IO on the Lustre File System, On the Move to Meaningful Internet Systems OTM '08, 2008.
DOI : 10.1007/978-3-540-88871-0_61

K. Ohta, H. Matsuba, and Y. Ishikawa, Improving Parallel Write by Node-Level Request Scheduling, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.196-203, 2009.
DOI : 10.1109/CCGRID.2009.71

X. Zhang, K. Davis, and S. Jiang, Opportunistic Data-driven Execution of Parallel Programs for Efficient I/O Services, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp.330-341, 2012.
DOI : 10.1109/IPDPS.2012.39

J. Lofstead, F. Zheng, Q. Liu, S. Klasky, R. Oldfield et al., Managing Variability in the IO Performance of Petascale Storage Systems, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-12, 2010.
DOI : 10.1109/SC.2010.32

M. Li, S. Vazhkudai, A. Butt, F. Meng, X. Ma et al., Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-12, 2010.
DOI : 10.1109/SC.2010.28

A. Moody, G. Bronevetsky, K. Mohror, and B. R. De-supinski, Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System, ACM/IEEE SC '10, pp.1-11, 2010.

R. Prabhakar, S. Vazhkudai, Y. Kim, A. Butt, M. Li et al., Provisioning a Multi-tiered Data Staging Area for Extreme-Scale Machines, 2011 31st International Conference on Distributed Computing Systems, pp.1-12, 2011.
DOI : 10.1109/ICDCS.2011.33

J. Fu, R. Latham, M. Min, and C. D. Carothers, I/O threads to reduce checkpoint blocking for an electromagnetics solver on Blue Gene/P and Cray XK6, Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers, ROSS '12, pp.1-8, 2012.
DOI : 10.1145/2318916.2318919

X. Ouyang, K. Gopalakrishnan, T. Gangadharappa, and D. Panda, Fast checkpointing by Write Aggregation with Dynamic Buffer and Interleaving on multicore architecture, 2009 International Conference on High Performance Computing (HiPC), pp.99-108, 2009.
DOI : 10.1109/HIPC.2009.5433218

X. Ma, J. Lee, and M. Winslett, High-level buffering for hiding periodic output cost in scientific simulations, IEEE Transactions on Parallel and Distributed Systems, vol.17, issue.3, pp.193-204, 2006.
DOI : 10.1109/TPDS.2006.36

A. Nisar, W. Liao, and A. Choudhary, Scaling parallel I/O performance through I/O delegate and caching system, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, 2008.
DOI : 10.1109/SC.2008.5214358
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.369.8862

Y. Qian, E. Barton, T. Wang, N. Puntambekar, and A. Dilger, A??Novel network request scheduler for a??large scale storage system, Computer Science - Research and Development, vol.7, issue.10, pp.143-148, 2009.
DOI : 10.1007/s00450-009-0073-9

S. Donovan, G. Huizenga, A. J. Hutton, C. C. Ross, M. K. Petersen et al., Lustre: Building a File System for 1000-Node Clusters, 2003.

H. Song, Y. Yin, X. Sun, R. Thakur, and S. Lang, Server-side I/O coordination for parallel file systems, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-11, 2011.
DOI : 10.1145/2063384.2063407
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.228.7941

X. Zhang, K. Davis, and S. Jiang, QoS support for end users of I/O-intensive applications using shared storage systems, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-12, 2011.
DOI : 10.1145/2063384.2063408

A. Lebre, G. Huard, Y. Denneulin, and P. Sowa, I/O Scheduling Service for Multi-Application Clusters, 2006 IEEE International Conference on Cluster Computing, pp.1-8, 2006.
DOI : 10.1109/CLUSTR.2006.311854
URL : https://hal.archives-ouvertes.fr/hal-00486929

Y. Tanimura, R. Filgueira, I. Kojima, and M. Atkinson, Poster: Reservation-Based I/O Performance Guarantee for MPI-IO Applications Using Shared Storage Systems, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.1384-1384, 2012.
DOI : 10.1109/SC.Companion.2012.204