. Intel, Threading Building Blocks, 2009.

C. Nvidia, Compute Unified Device Architecture Programming Guide, 2007.

J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer et al., Introduction to the Cell multiprocessor, IBM Journal of Research and Development, vol.49, issue.4.5, pp.4-5, 2005.
DOI : 10.1147/rd.494.0589

H. Pan, B. Hindman, and K. Asanovi´casanovi´c, Composing Parallel Software Efficiently with Lithe, Proceedings of the 2010 ACM SIGPLAN Conference on Programming Language Design and Implementation, ser. PLDI '10, pp.376-387, 2010.

J. Ansel, C. P. Chan, Y. L. Wong, M. Olszewski, Q. Zhao et al., PetaBricks: A Language and Compiler for Algorithmic Choice, Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, pp.38-49, 2009.

J. R. Wernsing and G. Stitt, Elastic Computing: A Framework for Transparent , Portable, and Adaptive Multi-core Heterogeneous Computing, Proceedings of the ACM SIGPLAN/SIGBED 2010 Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES, pp.115-124, 2010.

H. Vandierendonck, P. Pratikakis, and D. S. Nikolopoulos, Parallel Programming of General-Purpose Programs using Task-based Programming Models, Proceedings of the 3rd USENIX Conference on Hot Topics in Parallelism, ser. HotPar'11, pp.13-13, 2011.

S. Benkner, S. Pllana, J. Traff, P. Tsigas, U. Dolinsky et al., PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems, IEEE Micro, vol.31, issue.5, pp.28-41, 2011.
DOI : 10.1109/MM.2011.67
URL : https://hal.archives-ouvertes.fr/hal-00648480

M. Sandrieser, S. Benkner, and S. Pllana, Using explicit platform descriptions to support programming of heterogeneous many-core systems, Parallel Computing, vol.38, issue.1-2, pp.52-65, 2012.
DOI : 10.1016/j.parco.2011.10.008

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures Concurrency and Computation: Practice and Experience, pp.187-198, 2011.

D. Quinlan, ROSE: Compiler Support for Object-Oriented Frameworks, Parallel Processing Letters, vol.49, 2005.

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, 2002.
DOI : 10.1109/71.993206

M. Burrows, A Block-Sorting Lossless Data Compression Algorithm, Digital Systems Research Center, 1994.

. Intel, Intel Threading Building Blocks -Pipeline Documentation

J. Seward, BZIP2 Library Utility Function Documentation, 2011.

J. Gilchrist, Parallel Data Compression with bzip2, Proceedings of the 16th IASTED International Conference on Parallel and Distributed Computing and Systems, pp.559-564, 2004.

B. Gary, Learning openCV: Computer Vision with the openCV Library, 2008.

A. Benoit and Y. Robert, Mapping Pipeline Skeletons onto Heterogeneous Platforms, ICCS '07: Proceedings of the 7th international conference on Computational Science, 2007.
DOI : 10.1016/j.jpdc.2007.11.004
URL : https://hal.archives-ouvertes.fr/hal-00118921

M. Cole, Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programming, Parallel Computing, vol.30, issue.3, 2004.
DOI : 10.1016/j.parco.2003.12.002

A. Pop and A. Cohen, A stream-computing extension to OpenMP, Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers, HiPEAC '11, 2011.
DOI : 10.1145/1944862.1944867
URL : https://hal.archives-ouvertes.fr/hal-00659411

W. Thies, M. Karczmarek, and S. Amarasinghe, StreamIt: A Language for Streaming Applications, CC '02: Proceedings of the 11th International Conference on Compiler Construction, 2002.
DOI : 10.1007/3-540-45937-5_14

J. Sermulins, W. Thies, R. Rabbah, and S. Amarasinghe, Cache Aware Optimization of Stream Programs, ACM SIGPLAN Notices, vol.40, issue.7, 2005.

C. Schaefer, V. Pankratius, and W. Tichy, Engineering parallel applications with tunable architectures, Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering, ICSE '10, 2010.
DOI : 10.1145/1806799.1806859

F. Otto, C. Schaefer, M. Dempe, and W. Tichy, A Language-Based Tuning Mechanism for Task and Pipeline Parallelism, Euro-Par'10: Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II, 2010.
DOI : 10.1007/978-3-642-15291-7_30

M. Suleman, M. Qureshi, Y. Khubaib, and . Patt, Feedback-directed pipeline parallelism, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, 2010.
DOI : 10.1145/1854273.1854296

E. Ayguadé, R. M. Badia, D. Cabrera, A. Duran, M. González et al., A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures, Evolving OpenMP in an Age of Extreme Parallelism, 5th International Workshop on OpenMP (IWOMP), pp.154-167, 2009.
DOI : 10.1007/978-3-540-79561-2_10

M. Wolfe, Implementing the PGI Accelerator model, Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, GPGPU '10, 2010.
DOI : 10.1145/1735688.1735697

F. Bodin and S. Bihan, Heterogeneous Multicore Parallel Programming for Graphics Processing Units, Scientific Programming, pp.325-335, 2009.
DOI : 10.1155/2009/784893