E. Baysal, D. D. Kosloff, and J. W. Sherwood, Reverse time migration, GEOPHYSICS, vol.48, issue.11, pp.1514-1524, 1983.

S. Brandsberg-dahl, High-performance computing for seismic imaging, pp.5273-5277, 2017.

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.2, pp.187-198, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00550877

, The Top500 List

D. Michéa and D. Komatitsch, Accelerating a 3D finite-difference wave propagation code using GPU graphics cards, Geophysical Journal International, vol.182, issue.1, pp.389-402, 2010.

K. Datta, Auto-tuning stencil codes for cache-based multicore platforms, 2009.

A. Nguyen, N. Satish, J. Chhugani, C. Kim, and P. Dubey, 3.5-D blocking optimization for stencil computations on modern CPUs and GPUs, Int. Conf. for High Performance Computing, Networking, Storage and Analysis, pp.1-13, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01793035

V. Etienne, T. Tonellot, T. Malas, H. Ltaief, S. Kortas et al., High-Performance Seismic Modeling with Finite-Difference Using Spatial and Temporal Cache Blocking, 3rd EAGE Workshop High Performance Computing for Upstream, 2017.

S. Titarenko and M. Hildyard, Hybrid multicore/vectorisation technique applied to the elastic wave equation on a staggered grid, Computer Physics Communications, vol.216, pp.53-62, 2017.

U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, A practical automatic polyhedral parallelizer and locality optimizer, ACM SIGPLAN Notices, vol.43, issue.6, pp.101-113, 2008.

D. Orozco, E. Garcia, and G. Gao, Locality optimization of stencil applications using data dependency graphs, Languages and Compilers for Parallel Computing, pp.77-91, 2011.

T. Henretty, R. Veras, F. Franchetti, L. N. Pouchet, J. Ramanujam et al., A stencil compiler for short-vector SIMD architectures, 27th ACM Int. Conf. on Supercomputing, pp.13-24, 2013.

J. Holewinski, L. N. Pouchet, and P. Sadayappan, High-performance code generation for stencil computations on GPU architectures, 26th ACM Int. Conf. on Supercomputing, pp.311-320, 2012.

D. G. Wonnacott and M. M. Strout, On the scalability of loop tiling techniques, 3rd Int. Workshop on Polyhedral Compilation Techniques, pp.3-11, 2013.

T. Grosser, A. Cohen, J. Holewinski, P. Sadayappan, and S. Verdoolaege, Hybrid hexagonal/classical tiling for GPUs, IEEE/ACM Int. Symposium on Code Generation and Optimization, p.66, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00911177

D. Orozco and G. Gao, Diamond tiling: A tiling framework for timeiterated scientific applications, CAPSL Technical Memo, vol.091, 2009.

G. Wellein, G. Hager, T. Zeiser, M. Wittmann, and H. Fehske, Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization, 33rd Annual IEEE Int. Computer Software and Applications Conference, vol.1, pp.579-586, 2009.

R. Strzodka, M. Shaheen, D. Pajak, and H. Seidel, Cache accurate time skewing in iterative stencil computations, Int. Conf. on Parallel Processing, pp.571-581, 2011.

D. G. Wonnacott, Using time skewing to eliminate idle time due to memory bandwidth and network limitations, International Parallel and Distributed Processing Symposium, pp.171-180, 2000.

V. Bandishti, I. Pananilath, and U. Bondhugula, Tiling stencil computations to maximize parallelism, Int. Conf. for High Performance Computing, Networking, Storage and Analysis, pp.1-11, 2012.

X. Zhou, Tiling optimizations for stencil computations, 2013.

T. Grosser, S. Verdoolaege, A. Cohen, and P. Sadayappan, The relation between diamond tiling and hexagonal tiling, Parallel Processing Letters, vol.24, issue.03, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01257248

T. Malas, G. Hager, H. Ltaief, H. Stengel, G. Wellein et al., Multicore Optimized Wavefront Diamond Blocking for Optimizing Stencil Updates, SIAM Journal on Scientific Computing, vol.37, issue.4, pp.439-464, 2015.

S. Moustafa, W. Kirschenmann, F. Dupros, and H. Aochi, Task-Based Programming on Emerging Parallel Architectures for Finite-Differences Seismic Numerical Kernel, Euro-Par 2018: Parallel Processing, pp.764-777, 2018.

T. M. Malas, G. Hager, H. Ltaief, and D. E. Keyes, Multidimensional intratile parallelization for memory-starved stencil computations, TOPC, vol.4, p.32, 2017.

L. Yuan, Y. Zhang, P. Guo, and S. Huang, Tessellating stencils, Int. Conf. for High Performance Computing, Networking, Storage and Analysis, ser. SC '17, vol.49, pp.1-49, 2017.

R. Abdelkhalak, K. Akbudak, V. Etienne, H. Ltaief, T. Tonellot et al., Application of high performance asynchronous acoustic wave equation stencil solver into a land survey, 2019.

D. Komatitsch and R. Martin, An Unsplit Convolutional Perfectly Matched Layer Improved at Grazing Incidence for the Seismic Wave Equation, GEOPHYSICS, vol.72, issue.5, pp.155-167, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00528418

M. Louboutin, M. Lange, F. Luporini, N. Kukreja, P. A. Witte et al., Devito: an embedded domain-specific language for finite differences and geophysical exploration, CoRR, 2018.

F. Luporini, M. Lange, M. Louboutin, N. Kukreja, J. Hückelheim et al., Architecture and performance of Devito, a system for automated stencil computation, CoRR, 2018.

, OpenMP Application Program Interface, 2018.

W. W. Symes, Reverse time migration with optimal checkpointing, GEOPHYSICS, vol.72, issue.5, pp.213-221, 2007.

T. Okamoto, H. Takenaka, T. Nakamura, and T. Aoki, Accelerating Large-Scale Simulation of Seismic Wave Propagation by Multi-GPUs and Three-Dimensional Domain Decomposition, pp.375-389, 2013.

D. Imbert, K. Imadoueddine, P. Thierry, H. Chauris, and L. Borgues, Tips and trick for Finite Difference and I/O less FWI, SEG Annual Meeting, Expanded Abstracts, pp.3174-3178, 2011.

A. Sena, A. Nascimento, C. Boeres, V. Rebello, and A. Bulcao, An Approach to Optimise the Execution of RTM Algorithm in Multicore Machines, Seventh IEEE International Conference on eScience, pp.403-410, 2011.

V. Etienne, T. Tonellot, P. Thierry, V. Berthoumieux, and C. Andreolli, Optimization of the seismic modeling with the time-domain finitedifference method, SEG Annual Meeting, Expanded Abstracts, pp.3536-3540, 2014.

S. Toledo, A Survey of Out-of-core Algorithms in Numerical Linear Algebra, pp.161-179, 1999.

J. S. Vitter, External memory algorithms and data structures: Dealing with massive data, ACM Computing Surveys, vol.33, p.2001, 2001.

, Algorithms and data structures for external memory, Found. Trends Theor. Comput. Sci, vol.2, issue.4, pp.305-474, 2008.

T. Joffrain, E. S. Quintana-ortí, and R. A. Van-de-geijn, Rapid development of high-performance out-of-core solvers, Applied Parallel Computing. State of the Art in Scientific Computing, pp.413-422, 2006.

K. Kabir, A. Haidar, S. Tomov, A. Bouteiller, and J. Dongarra, A framework for out of memory svd algorithms, pp.158-178, 2017.

S. Kumar, V. Vishwanath, P. Carns, B. Summa, G. Scorzelli et al., PIDX: Efficient parallel I/O for multi-resolution multi-dimensional scientific datasets, 2011 IEEE International Conference on Cluster Computing, pp.103-111, 2011.

S. Papadopoulos, K. Datta, S. Madden, and T. Mattson, The TileDB Array Data Storage Manager, Proc. VLDB Endow, vol.10, issue.4, pp.349-360, 2016.

R. Kosloff and D. Kosloff, Absorbing Boundaries for Wave Propagation Problems, J. Comput. Phys, vol.63, issue.2, pp.90199-90202, 1986.

E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost et al., Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, IEEE Transactions on Parallel and Distributed Systems, pp.1-1, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01618526

, SEG/EAGE Salt C3 Dataset

. Seg/eage-salt and . Model, , 1996.

H. Topcuoglu, S. Hariri, and M. Wu, Task scheduling algorithms for heterogeneous processors, Proceedings of the Eighth Heterogeneous Computing Workshop, ser. HCW '99, p.3, 1999.