G. Kadi and A. S. Terechko, A hardware task scheduler for embedded video processing, Proc. of the 4th Intl. Conf. on High Performance and Embedded Architectures and Compilers (HiPEAC'09), 2009.
URL : https://hal.archives-ouvertes.fr/inria-00445874

M. Aldinucci, M. Meneghin, and M. Torquati, Efficient Smith-Waterman on Multi-core with FastFlow, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, pp.195-199, 2010.
DOI : 10.1109/PDP.2010.93

R. S. Arvind, K. Nikhil, and . Pingali, I-structures: data structures for parallel computing, ACM Transactions on Programming Languages and Systems, vol.11, issue.4, pp.598-632, 1989.
DOI : 10.1145/69558.69562

C. Augonnet, S. Thibault, R. Namyst, and M. Nijhuis, Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System, Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS'09), pp.329-339, 2009.
DOI : 10.1007/978-3-642-03138-0_36

URL : https://hal.archives-ouvertes.fr/inria-00378705

A. Azevedo, C. Meenderinck, B. H. Juurlink, A. Terechko, J. Hoogerbrugge et al., Parallel H.264 Decoding on an Embedded Multicore Processor, Proc. of the 4th Intl. Conf. on High Performance and Embedded Architectures and Compilers (HiPEAC'09), 2009.
DOI : 10.1007/978-3-540-92990-1_29

URL : https://hal.archives-ouvertes.fr/inria-00446428

P. M. Carpenter, D. Ródenas, X. Martorell, A. Ramírez, and E. Ayguadé, A Streaming Machine Description and Programming Model, Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS'07), pp.107-116, 2007.
DOI : 10.1007/978-3-540-73625-7_13

P. Caspi and M. Pouzet, Synchronous Kahn networks, ACM Intl. Conf. on Functional programming (ICFP'96), pp.226-238, 1996.

A. Cohen, L. Mandel, F. Plateau, and M. Pouzet, Abstraction of Clocks in Synchronous Data-Flow Systems, 6th Asian Symp. on Programming Languages and Systems (APLAS 08), 2008.
DOI : 10.1016/0167-6423(91)90001-E

URL : https://hal.archives-ouvertes.fr/hal-01257274

I. Corp, Occam Programming Manual, 1984.

D. E. Culler and A. , Resource requirements of dataflow programs, ISCA, pp.141-150, 1988.

J. B. Dennis and G. R. Gao, An efficient pipelined dataflow processor architecture, Proceedings. SUPERCOMPUTING '88, pp.368-373, 1988.
DOI : 10.1109/SUPERC.1988.44674

F. L. Fessant and L. Maranget, Compiling join-patterns, Electr. Notes Theor. Comput. Sci, vol.16, issue.3, 1998.

C. Fournet and G. Gonthier, The reflexive chemical abstract machine and the join-calculus, In ACM Symp. on Principles of Programming Languages, pp.372-385, 1996.

J. Giacomoni, T. Moseley, and M. Vachharajani, FastForward for efficient pipeline parallelism, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming , PPoPP '08, pp.43-52, 2008.
DOI : 10.1145/1345206.1345215

R. Gupta, Exploiting parallelism on a fine-grained MIMD architecture based upon channel queues, International Journal of Parallel Programming, vol.2, issue.3, pp.169-192, 1992.
DOI : 10.1007/BF01408554

W. Haid, L. Schor, K. Huang, I. Bacivarov, and L. Thiele, Efficient execution of Kahn process networks on multi-processor systems using protothreads and windowed FIFOs, 2009 IEEE/ACM/IFIP 7th Workshop on Embedded Systems for Real-Time Multimedia, pp.35-44, 2009.
DOI : 10.1109/ESTMED.2009.5336828

N. Halbwachs, P. Caspi, P. Raymond, and D. Pilaud, The synchronous dataflow programming language Lustre, Proc. of the IEEE, pp.1305-1320, 1991.

R. H. Halstead and J. , MULTILISP: a language for concurrent symbolic computation, ACM Transactions on Programming Languages and Systems, vol.7, issue.4, pp.501-538, 1985.
DOI : 10.1145/4472.4478

T. Henriksson, P. Van, and . Wolf, TTL Hardware Interface: A High-Level Interface for Streaming Multiprocessor Architectures, 2006 IEEE/ACM/IFIP Workshop on Embedded Systems for Real Time Multimedia, pp.107-112, 2006.
DOI : 10.1109/ESTMED.2006.321282

C. A. Hoare, Communicating Sequential Processes, 1985.

G. Kahn, The semantics of a simple language for parallel programming, Information processing, pp.471-475, 1974.

C. Kim, J. Gaudiot, and W. Proskurowski, Parallel computing with the sisal applicative language: Programmability and performance issues. Software, Practice and Experience, pp.1025-1051, 1996.

C. Kyriacou, P. Evripidou, and P. Trancoso, Data-Driven Multithreading Using Conventional Microprocessors, IEEE Transactions on Parallel and Distributed Systems, vol.17, issue.10, pp.1176-1188, 2006.
DOI : 10.1109/TPDS.2006.136

E. A. Lee and D. G. Messerschmitt, Static Scheduling of Synchronous Data Flow Programs for Digital Signal Processing, IEEE Transactions on Computers, vol.36, issue.1, pp.24-25, 1987.
DOI : 10.1109/TC.1987.5009446

E. A. Lee and A. L. Sangiovanni-vincentelli, A framework for comparing models of computation, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.17, issue.12
DOI : 10.1109/43.736561

K. H. Frigo and C. E. Leiserson, The implementation of the Cilk-5 multithreaded language, ACM Symp. on Programming Language Design and Implementation (PLDI'98), pp.212-223, 1998.

V. Marjanovic, J. Labarta, E. Ayguadé, and M. Valero, Effective communication and computation overlap with hybrid MPI/SMPSs, PPOPP, 2010.

R. Milner, J. Parrow, and D. Walker, A calculus of mobile processes, II, Information and Computation, vol.100, issue.1, pp.1-40, 1992.
DOI : 10.1016/0890-5401(92)90009-5

M. Olszewski, J. Ansel, and S. Amarasinghe, Kendo: Efficient deterministic multithreading in software, The Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, 2009.

G. Ottoni, R. Rangan, A. Stoler, and D. I. , Automatic Thread Extraction with Decoupled Software Pipelining, 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'05), pp.105-118, 2005.
DOI : 10.1109/MICRO.2005.13

J. M. Pérez, P. Bellens, R. M. Badia, and J. Labarta, CellSs: Making it easier to program the Cell Broadband Engine processor, IBM Journal of Research and Development, vol.51, issue.5, pp.593-604, 2007.
DOI : 10.1147/rd.515.0593

J. Planas, R. M. Badia, E. Ayguadé, and J. Labarta, Hierarchical Task-Based Programming With StarSs, International Journal of High Performance Computing Applications, vol.23, issue.3, pp.284-299, 2009.
DOI : 10.1177/1094342009106195

A. Pop and A. Cohen, A stream-comptuting extension to OpenMP, Proc. of the 4th Intl. Conf. on High Performance and Embedded Architectures and Compilers (HiPEAC'11), 2011.
URL : https://hal.archives-ouvertes.fr/inria-00551507

A. Pop, S. Pop, and J. Sjödin, Automatic streamization in GCC, GCC Developer's Summit, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00817455

M. C. Rinard and M. S. Lam, The design, implementation, and evaluation of Jade, ACM Transactions on Programming Languages and Systems, vol.20, issue.3, pp.483-545, 1998.
DOI : 10.1145/291889.291893

M. Själander, A. Terechko, and M. Duranton, A Look-Ahead Task Management Unit for Embedded Multi-Core Architectures, 2008 11th EUROMICRO Conference on Digital System Design Architectures, Methods and Tools, 2008.
DOI : 10.1109/DSD.2008.45

K. Stavrou, M. Nikolaides, D. Pavlou, S. Arandi, P. Evripidou et al., TFlux: A Portable Platform for Data-Driven Multithreading on Commodity Multicore Systems, 2008 37th International Conference on Parallel Processing, pp.25-34, 2008.
DOI : 10.1109/ICPP.2008.74

S. Stuijk, Concurrency in computational networks Master's thesis, 2002.

W. Thies and S. Amarasinghe, An empirical characterization of stream programs and its implications for language and compiler design, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, 2010.
DOI : 10.1145/1854273.1854319

W. Thies, M. Karczmarek, and S. Amarasinghe, StreamIt: A Language for Streaming Applications, Intl. Conf. on Compiler Construction, 2002.
DOI : 10.1007/3-540-45937-5_14

I. Watson and J. R. Gurd, A Practical Data Flow Computer, Computer, vol.15, issue.2, pp.51-57, 1982.
DOI : 10.1109/MC.1982.1653941