Skip to Main content Skip to Navigation
Conference papers

Efficient Parallel Selective Separable-kernel Convolution on Heterogeneous Processors

Abstract : Separable kernel convolution is a fundamental operation for image processing and computer vision applications. Existing parallel implementations of convolution operation convolve a whole image with a separable kernel. While such algorithms are efficient, not all the computed convolutions are necessarily utilized by the underlying application. In this paper we introduce an integrated parallel selective separable-kernel convolution method with adaptive load balancing, for both homogeneous and heterogeneous multi-core processors.. The method allows for only computing convolutions at selected points efficiently, making use of partial convolution results computed previously. The method also automatically load-balances the convolution computation among core/processors with different processing speeds. The load-balancing is a hybrid dynamic/static one that adapts load-balancing for sequences of video frames, allowing for fast convolution operation on video streams. Our method is studied on the computation of the Difference of Gaussian operation. Performance assessment results show a remarkable load-balancing reaching 80% of optimum load balance for both heterogeneous and homogeneous processor systems.
Document type :
Conference papers
Complete list of metadata
Contributor : Ist Rennes Connect in order to contact the contributor
Submitted on : Thursday, June 17, 2010 - 2:03:45 PM
Last modification on : Thursday, October 10, 2019 - 2:38:11 PM


  • HAL Id : inria-00492909, version 1



Ahmed El-Mahdy, Hisham El-Shishiny. Efficient Parallel Selective Separable-kernel Convolution on Heterogeneous Processors. IFMT'10 - Second International Forum on Next Generation Multicore/Manycore Technologies, Jun 2010, Saint Malo, France. ⟨inria-00492909⟩



Record views