R. Allen and K. Kennedy, Optimizing Compilers for Modern Architectures: A Dependence-based Approach, 2001.

S. Campanoni, T. Jones, G. Holloway, V. J. Reddi, G. Y. Wei et al., HE- LIX: Automatic Parallelization of Irregular Programs for Chip Multiprocessing, Proc. Int'1 Symp. on Code Generation and Optimization, pp.84-93, 2012.

J. Ceng, J. Castrillón, W. Sheng, H. Scharwächter, R. Leupers et al., MAPS, Proceedings of the 45th annual conference on Design automation, DAC '08, pp.754-759, 2008.
DOI : 10.1145/1391469.1391663

R. Cytron, Doacross: Beyond Vectorization for Multiprocessors, Proc. Int'l Conf. on Parallel Processing, pp.836-844, 1986.

J. Huang, A. Raman, T. B. Jablin, Y. Zhang, T. H. Hung et al., Decoupled software pipelining creates parallelization opportunities, Proceedings of the 8th annual IEEE/ ACM international symposium on Code generation and optimization, CGO '10, pp.121-130, 2010.
DOI : 10.1145/1772954.1772973

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.299.5232

Y. Liu, P. Liu, Y. Jiang, M. Yang, K. Wu et al., Building a multi-FPGA-based emulation framework to support networks-on-chip design and verification, International Journal of Electronics, vol.97, issue.10, pp.1241-1262, 2010.
DOI : 10.1504/IJHPSA.2010.034538

N. Nethercote and J. Seward, Valgrind: A Framework for Heavyweight Dynamic Binary Instrumentation, Proc. of the ACM SIGPLAN Conf. on Programming Language Design and Implementation, pp.89-100, 2007.

G. Ottoni, R. Rangan, A. Stoler, and D. I. August, Automatic Thread Extraction with Decoupled Software Pipelining, 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'05), pp.105-116, 2005.
DOI : 10.1109/MICRO.2005.13

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.131.4010

R. Rangan, N. Vachharajani, M. Vachharajani, and D. I. August, Decoupled software pipelining with the synchronization array, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pp.177-188, 2004.
DOI : 10.1109/PACT.2004.1342552

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.74.1006

O. Sinnen, Task Scheduling for Parallel Systems, 2007.
DOI : 10.1002/0470121173

W. Thies, V. Chandrasekhar, and S. Amarasinghe, A Practical Approach to Exploiting Coarse-Grained Pipeline Parallelism in C Programs, 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2007), pp.356-369, 2007.
DOI : 10.1109/MICRO.2007.38

N. Vachharajani, R. Rangan, E. Raman, M. J. Bridges, G. Ottoni et al., Speculative Decoupled Software Pipelining, 16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007), pp.49-59, 2007.
DOI : 10.1109/PACT.2007.4336199

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.73.9040

H. Vandierendonck, S. Rul, and K. D. Bosschere, The Paralax infrastructure, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.389-400, 2010.
DOI : 10.1145/1854273.1854322

URL : http://pure.qub.ac.uk/ws/files/5240407/postprint.pdf