J. W. Cooley and J. W. Tukey, An algorithm for the machine calculation of complex Fourier series, Mathematics of Computation, vol.19, issue.90, pp.297-301, 1965.
DOI : 10.1090/S0025-5718-1965-0178586-1

O. Ayala, W. W. Grabowski, and L. P. Wang, A hybrid approach for simulating turbulent collisions of hydrodynamically-interacting particles, Journal of Computational Physics, vol.225, issue.1, pp.51-73, 2007.
DOI : 10.1016/j.jcp.2006.11.016

K. Laasonen, A. Pasquarello, R. Car, C. Lee, and D. Vanderbilt, Car-Parrinello molecular dynamics with Vanderbilt ultrasoft pseudopotentials, Physical Review B, vol.136, issue.16, p.10142, 1993.
DOI : 10.1103/PhysRevA.31.1695

E. J. Bylaska, M. Valiev, R. Kawai, and J. H. Weare, Parallel implementation of the projector augmented plane wave method for charged systems, Computer Physics Communications, vol.143, issue.1, pp.11-28, 2002.
DOI : 10.1016/S0010-4655(01)00413-1

H. Calandra, F. Bothorel, and P. Vezolle, A massively parallel implementation of the common azimuth pre-stack depth migration, IBM Journal of Research and Development, vol.52, issue.1.2, pp.83-91, 2008.
DOI : 10.1147/rd.521.0083

S. Stellmach and U. Hansen, An efficient spectral method for the simulation of dynamos in Cartesian geometry and its implementation on massively parallel computers, Geochemistry, Geophysics, Geosystems, vol.88, issue.4, p.5003, 2008.
DOI : 10.1080/03091929808245476

L. Wang, O. Ayala, H. Parishani, W. Grabowski, A. Wyszogrodzki et al., Towards an integrated multiscale simulation of turbulent clouds on PetaScale computers, Journal of Physics: Conference Series, p.72021, 2011.
DOI : 10.1088/1742-6596/318/7/072021

P. Dmitruk, L. P. Wang, W. Matthaeus, R. Zhang, and D. Seckel, Scalable parallel FFT for spectral simulations on a Beowulf cluster, Parallel Computing, vol.27, issue.14, 1921.
DOI : 10.1016/S0167-8191(01)00120-X

N. Li and S. Laizet, 2decomp fft a highly scalable 2d decomposition library and fft interface, 2010.

D. Pekurovsky, Ultrascalable fourier transfroms in three dimensions Extreme Digital Discovery, ser. TG '11, Proceedings of the 2011 TeraGrid Conference1{9:2. [Online]. Available, 2011.
DOI : 10.1145/2016741.2016751

O. Ayala and L. P. Wang, Parallel implementation and scalability analysis of 3d fast fourier transform using 2d domain decomposition Under review, Parallel Computing, 2012.

M. Frigo and S. Johnson, FFTW: an adaptive software architecture for the FFT, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181), pp.1381-1384, 1998.
DOI : 10.1109/ICASSP.1998.681704