S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, A Portable Programming Interface for Performance Evaluation on Modern Processors, International Journal of High Performance Computing Applications, vol.14, issue.3, pp.189-204, 2000.
DOI : 10.1177/109434200001400303

J. Chen, W. Watson, I. , R. Edwards, and W. Mao, Message Passing for Linux Clusters with Gigabit Ethernet Mesh Connections, IPDPS'05: Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) -Workshop 9, 2005.

H. K. Chu, Zero-Copy TCP in Solaris, Proceedings of the USENIX Annual Technical Conference, pp.253-264, 1996.

M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, Cache-Oblivious Algorithms, Proceedings of the 40th Annual Symposium on Foundations of Computer Science, 1999.

R. L. Daniel, T. S. Graham, and . Woodall, Open MPI: Goals, concept, and design of a next generation MPI implementation, Proceedings, 11th European PVM/MPI Users' Group Meeting, pp.97-104, 2004.

B. Goglin, Design and implementation of Open-MX: High-performance message passing over generic Ethernet hardware, 2008 IEEE International Symposium on Parallel and Distributed Processing, 2008.
DOI : 10.1109/IPDPS.2008.4536140

URL : https://hal.archives-ouvertes.fr/inria-00210704

B. Goglin, Improving message passing over Ethernet with I/OAT copy offload in Open-MX, 2008 IEEE International Conference on Cluster Computing, pp.223-231, 2008.
DOI : 10.1109/CLUSTR.2008.4663775

URL : https://hal.archives-ouvertes.fr/inria-00288757

B. Goglin, NIC-Assisted Cache-Efficient Receive Stack for Message Passing over Ethernet, Proceedings of the 15th International Euro-Par Conference, pp.1065-1077, 2009.
DOI : 10.1145/1080695.1069976

URL : https://hal.archives-ouvertes.fr/inria-00379168

L. Grossman, Large Receive Offload Implementation in Neterion 10GbE Ethernet Driver, Proceedings of the Linux Symposium (OLS2005), pp.195-200, 2005.

R. Huggahalli, R. Iyer, and S. Tetrick, Direct Cache Access for High Bandwidth Network I/O. SIGARCH Computer Architecture News, pp.50-59, 2005.

S. Karlsson, S. Passas, G. Kotsis2, and A. Bilas, MultiEdge: An Edge-based Communication Subsystem for Scalable Commodity Servers, 2007 IEEE International Parallel and Distributed Processing Symposium, p.28, 2007.
DOI : 10.1109/IPDPS.2007.370218

I. Myricom, Myrinet Express (MX): A High Performance, Low-Level, Message-Passing Interface for Myrinet, 2006.

S. Passas, K. Magoutis, and A. Bilas, Towards 100 gbit/s ethernet, Proceedings of the 23rd international conference on Conference on Supercomputing, ICS '09, pp.214-224, 2009.
DOI : 10.1145/1542275.1542308

J. Mohammad, A. Rashti, and . Afsahi, 10-Gigabit iWARP Ethernet: Comparative Performance Analysis with Infiniband and Myrinet-10G, Proceedings of the International Workshop on Communication Architecture for Clusters (CAC), held in conjunction with IPDPS'07, p.234, 2007.

P. Shivam, P. Wyckoff, and D. K. Panda, EMP, Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '01, p.57, 2001.
DOI : 10.1145/582034.582091

S. Sumimoto, K. Ooe, K. Kumon, T. Boku, M. Sato et al., A scalable communication layer for multi-dimensional hyper crossbar network using multiple gigabit ethernet, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, pp.107-115, 2006.
DOI : 10.1145/1183401.1183418

P. Willmann, S. Rixner, and A. L. Cox, An Evaluation of Network Stack Parallelization Strategies in Modern Operating Systems, Proceedings of the USENIX Technical Conference, pp.91-96, 2006.

Z. Yi and P. P. Waskiewicz, Enabling Linux Network Support of Hardware Multiqueue Devices, Proceedings of the Linux Symposium (OLS2007), pp.305-310, 2007.