M. Frigo and S. G. Johnson, The Design and Implementation of FFTW3, Proceedings of the IEEE, pp.216-231, 2005.
DOI : 10.1109/JPROC.2004.840301

M. Jeffrey, A. Squyres, and . Lumsdaine, The Component Architecture of Open MPI : Enabling Third-Party Collective Algorithms, Component Models and Systems for Grid Applications, pp.167-185, 2005.

C. Szyperski, Component Software : Beyond Object-Oriented Programming, 2002.

J. Bigot and C. Pérez, High Performance Composition Operators in Component Models In High Performance Computing : From Grids and Clouds to Exascale, of Advances in Parallel Computing, pp.182-201, 2011.

M. Bozga, M. Jaber, and J. Sifakis, Source-to-Source Architecture Transformation for Performance Optimization in BIP, IEEE Transactions on Industrial Informatics, vol.6, issue.4, pp.708-718, 2010.
DOI : 10.1109/TII.2010.2069102

URL : https://hal.archives-ouvertes.fr/hal-00722520

J. Bigot, Z. Hou, C. Pérez, and V. Pichon, A low level component model easing performance portability of HPC applications. Computing, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00911231

J. Michael and . Flynn, Some Computer Organizations and Their Effectiveness, IEEE Trans. Comput, vol.21, issue.9, pp.948-960, 1972.

R. Duncan, A survey of parallel computer architectures, Computer, vol.23, issue.2, pp.5-16, 1990.
DOI : 10.1109/2.44900

L. Dagum and R. Menon, OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998.
DOI : 10.1109/99.660313

V. Laxmikant, S. Kale, and . Krishnan, CHARM++ : A Portable Concurrent Object Oriented System Based on C++, SIGPLAN Not, vol.28, issue.10, pp.91-108, 1993.

J. E. Stone, D. Gohara, and G. Shi, OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems, Computing in Science & Engineering, vol.12, issue.3, pp.66-73, 2010.
DOI : 10.1109/MCSE.2010.69

S. Wienke, P. Springer, C. Terboven, and D. Mey, OpenACC ??? First Experiences with Real-World Applications, Euro-Par 2012 Parallel Processing, pp.859-870, 2012.
DOI : 10.1007/978-3-642-32820-6_85

D. Pekurovsky, P3DFFT: A Framework for Parallel Computations of Fourier Transforms in Three Dimensions, SIAM Journal on Scientific Computing, vol.34, issue.4, p.2012
DOI : 10.1137/11082748X

R. Schulz, 3D FFT with 2D decomposition. CS project report http://cmb.ornl.gov/ Members, 2008.

R. C. Le and . Bail, Use of Fast Fourier Transforms for Solving Partial Differential Equations in Physics, J. Comput. Phys, vol.9, issue.3, pp.440-65, 1972.

D. Guinier, The multiplication of very large integers using the discrete fast Fourier transform, ACM SIGSAC Review, vol.9, issue.3, pp.26-27, 1991.
DOI : 10.1145/127024.127033

J. Cooley and J. Tukey, An algorithm for the machine calculation of complex Fourier series, Mathematics of Computation, vol.19, issue.90, pp.297-301, 1965.
DOI : 10.1090/S0025-5718-1965-0178586-1

J. Bruck, C. Ho, S. Kipnis, and D. Weathersby, Efficient algorithms for all-to-all communications in multi-port message-passing systems, Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures , SPAA '94, pp.298-309, 1994.
DOI : 10.1145/181014.181756

B. Prisacari, G. Rodriguez, C. Minkenberg, and T. Hoefler, Bandwidth-optimal all-to-all exchanges in fat tree networks, Proceedings of the 27th international ACM conference on International conference on supercomputing, ICS '13, pp.139-148, 2013.
DOI : 10.1145/2464996.2465434

M. Jeffrey, A. Squyres, and . Lumsdaine, The Component Architecture of Open MPI : Enabling Third-Party Collective Algorithms, Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications, pp.167-185, 2004.

R. Thakur and R. Rabenseifner, Optimization of Collective Communication Operations in MPICH, International Journal of High Performance Computing Applications, vol.19, issue.1, pp.49-66, 2005.
DOI : 10.1177/1094342005051521

N. Li and S. Laizet, 2DECOMP&FFT ? A highly scalable 2D decomposition library and FFT interface, Cray User Group 2010 conference, 2010.

K. Chaitanya-kandalla, H. Subramoni, K. A. Tomko, D. Pekurovsky, S. Sur et al., High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters : a study with parallel 3D FFT, Computer Science -R&D, vol.26, pp.3-4237, 2011.

R. Agarwal and J. Cooley, New algorithms for digital convolution, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.25, issue.5, pp.392-410, 1977.
DOI : 10.1109/TASSP.1977.1162981

G. Guennebaud and B. Jacob, Eigen v3, 2010.

B. Gough, GNU Scientific Library Reference Manual -Third Edition, Network Theory Ltd, 2009.

J. Xiong, J. Johnson, R. W. Johnson, and D. Padua, SPL : A Language and Compiler for DSP Algorithms, Programming Languages Design and Implementation (PLDI), pp.298-308, 2001.

A. Bonelli, F. Franchetti, J. Lorenz, M. Püschel, and C. W. Ueberhuber, Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers, International Symposium on Parallel and Distributed Processing and Application (ISPA), pp.818-832, 2006.
DOI : 10.1007/11946441_74

M. D. Mcilroy, Mass-produced Software Components, Proc. NATO Conf. on Software Engineering, 1968.

D. Batory, The design and implementation of hierarchical software systems with reusable components, ACM Transactions on Software Engineering and Methodology, vol.1, issue.4, pp.355-398, 1992.
DOI : 10.1145/136586.136587

E. Bruneton, T. Coupaye, M. Leclercq, V. Quéma, and J. Stefani, The FRACTAL component model and its support in Java, Software: Practice and Experience, vol.6, issue.11-12, pp.11-121257, 2006.
DOI : 10.1002/spe.767

J. Bigot, Du support générique d'opérateurs de composition dans les modèles de composants logiciels, application au calcul scientifique, 2010.

J. Bigot and C. Pérez, Increasing Reuse in Component Models through Genericity, Proceedings of the 11th International Conference on Software Reuse, ICSR '09, pp.21-30, 2009.
DOI : 10.1007/3-540-51084-2_2

URL : https://hal.archives-ouvertes.fr/inria-00457905

J. Boldt, The Common Object Request Broker : Architecture and Specification, 1995.

F. Baude, D. Caromel, C. Dalmasso, M. Danelutto, V. Getov et al., GCM: a grid extension to Fractal for autonomous distributed components, annals of telecommunications - annales des t??l??communications, vol.36, issue.1, pp.5-24, 2009.
DOI : 10.1007/s12243-008-0068-8

URL : https://hal.archives-ouvertes.fr/inria-00323919

R. Armstrong, D. Gannon, A. Geist, K. Keahey, S. Kohn et al., Toward a common component architecture for high-performance scientific computing, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469), p.13, 1999.
DOI : 10.1109/HPDC.1999.805289

M. Imbert, L. Pouilloux, J. Rouzaud-cornabas, A. Lèbre, and T. Hirofuchi, Using the EXECO toolbox to perform automatic and reproducible cloud experiments, 1st International Workshop on UsiNg and building ClOud Testbeds (UNICO, collocated with IEEE CloudCom 2013, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00861886