The Design and Implementation of FFTW3, Proceedings of the IEEE, pp.216-231, 2005. ,
DOI : 10.1109/JPROC.2004.840301
The Component Architecture of Open MPI : Enabling Third-Party Collective Algorithms, Component Models and Systems for Grid Applications, pp.167-185, 2005. ,
Component Software : Beyond Object-Oriented Programming, 2002. ,
High Performance Composition Operators in Component Models In High Performance Computing : From Grids and Clouds to Exascale, of Advances in Parallel Computing, pp.182-201, 2011. ,
Source-to-Source Architecture Transformation for Performance Optimization in BIP, IEEE Transactions on Industrial Informatics, vol.6, issue.4, pp.708-718, 2010. ,
DOI : 10.1109/TII.2010.2069102
URL : https://hal.archives-ouvertes.fr/hal-00722520
A low level component model easing performance portability of HPC applications. Computing, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00911231
Some Computer Organizations and Their Effectiveness, IEEE Trans. Comput, vol.21, issue.9, pp.948-960, 1972. ,
A survey of parallel computer architectures, Computer, vol.23, issue.2, pp.5-16, 1990. ,
DOI : 10.1109/2.44900
OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998. ,
DOI : 10.1109/99.660313
CHARM++ : A Portable Concurrent Object Oriented System Based on C++, SIGPLAN Not, vol.28, issue.10, pp.91-108, 1993. ,
OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems, Computing in Science & Engineering, vol.12, issue.3, pp.66-73, 2010. ,
DOI : 10.1109/MCSE.2010.69
OpenACC ??? First Experiences with Real-World Applications, Euro-Par 2012 Parallel Processing, pp.859-870, 2012. ,
DOI : 10.1007/978-3-642-32820-6_85
P3DFFT: A Framework for Parallel Computations of Fourier Transforms in Three Dimensions, SIAM Journal on Scientific Computing, vol.34, issue.4, p.2012 ,
DOI : 10.1137/11082748X
3D FFT with 2D decomposition. CS project report http://cmb.ornl.gov/ Members, 2008. ,
Use of Fast Fourier Transforms for Solving Partial Differential Equations in Physics, J. Comput. Phys, vol.9, issue.3, pp.440-65, 1972. ,
The multiplication of very large integers using the discrete fast Fourier transform, ACM SIGSAC Review, vol.9, issue.3, pp.26-27, 1991. ,
DOI : 10.1145/127024.127033
An algorithm for the machine calculation of complex Fourier series, Mathematics of Computation, vol.19, issue.90, pp.297-301, 1965. ,
DOI : 10.1090/S0025-5718-1965-0178586-1
Efficient algorithms for all-to-all communications in multi-port message-passing systems, Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures , SPAA '94, pp.298-309, 1994. ,
DOI : 10.1145/181014.181756
Bandwidth-optimal all-to-all exchanges in fat tree networks, Proceedings of the 27th international ACM conference on International conference on supercomputing, ICS '13, pp.139-148, 2013. ,
DOI : 10.1145/2464996.2465434
The Component Architecture of Open MPI : Enabling Third-Party Collective Algorithms, Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications, pp.167-185, 2004. ,
Optimization of Collective Communication Operations in MPICH, International Journal of High Performance Computing Applications, vol.19, issue.1, pp.49-66, 2005. ,
DOI : 10.1177/1094342005051521
2DECOMP&FFT ? A highly scalable 2D decomposition library and FFT interface, Cray User Group 2010 conference, 2010. ,
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters : a study with parallel 3D FFT, Computer Science -R&D, vol.26, pp.3-4237, 2011. ,
New algorithms for digital convolution, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.25, issue.5, pp.392-410, 1977. ,
DOI : 10.1109/TASSP.1977.1162981
Eigen v3, 2010. ,
GNU Scientific Library Reference Manual -Third Edition, Network Theory Ltd, 2009. ,
SPL : A Language and Compiler for DSP Algorithms, Programming Languages Design and Implementation (PLDI), pp.298-308, 2001. ,
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers, International Symposium on Parallel and Distributed Processing and Application (ISPA), pp.818-832, 2006. ,
DOI : 10.1007/11946441_74
Mass-produced Software Components, Proc. NATO Conf. on Software Engineering, 1968. ,
The design and implementation of hierarchical software systems with reusable components, ACM Transactions on Software Engineering and Methodology, vol.1, issue.4, pp.355-398, 1992. ,
DOI : 10.1145/136586.136587
The FRACTAL component model and its support in Java, Software: Practice and Experience, vol.6, issue.11-12, pp.11-121257, 2006. ,
DOI : 10.1002/spe.767
Du support générique d'opérateurs de composition dans les modèles de composants logiciels, application au calcul scientifique, 2010. ,
Increasing Reuse in Component Models through Genericity, Proceedings of the 11th International Conference on Software Reuse, ICSR '09, pp.21-30, 2009. ,
DOI : 10.1007/3-540-51084-2_2
URL : https://hal.archives-ouvertes.fr/inria-00457905
The Common Object Request Broker : Architecture and Specification, 1995. ,
GCM: a grid extension to Fractal for autonomous distributed components, annals of telecommunications - annales des t??l??communications, vol.36, issue.1, pp.5-24, 2009. ,
DOI : 10.1007/s12243-008-0068-8
URL : https://hal.archives-ouvertes.fr/inria-00323919
Toward a common component architecture for high-performance scientific computing, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469), p.13, 1999. ,
DOI : 10.1109/HPDC.1999.805289
Using the EXECO toolbox to perform automatic and reproducible cloud experiments, 1st International Workshop on UsiNg and building ClOud Testbeds (UNICO, collocated with IEEE CloudCom 2013, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00861886