GPU Implementation of Image Convolution Using Sparse Model with Efficient Storage Format
With the growth of data parallel computing, role of GPU computing in non-graphic applications such as image processing becomes a focus in research fields. Convolution is an integral operation in filtering, smoothing and edge detection. In this article, the process of convolution is realized as a sparse linear system and is solved using Sparse Matrix Vector Multiplication (SpMV). The Compressed Sparse Row (CSR) format of SPMV shows better CPU performance compared to normal convolution. To overcome the stalling of threads for short rows in the GPU implementation of CSR SpMV, a more efficient model is proposed, which uses the Adaptive-Compressed Row Storage (A-CSR) format to implement the same. Using CSR in the convolution process achieves a 1.45x and a 1.159x increase in speed compared to the normal convolution of image smoothing and edge detection operations, respectively. An average speedup of 2.05x is achieved for image smoothing technique and 1.58x for edge detection technique in GPU platform usig adaptive CSR format.
Year of publication: |
2018
|
---|---|
Authors: | Babu, M. Rajasekhara ; Mohammed, Saira Banu Jamal ; Sriram, Sumithra |
Published in: |
International Journal of Grid and High Performance Computing (IJGHPC). - IGI Global, ISSN 1938-0267, ZDB-ID 2703335-1. - Vol. 10.2018, 1 (01.01.), p. 54-70
|
Publisher: |
IGI Global |
Subject: | Convolution | CSR | Edge Detection and Image Smoothening | GPU | SpMV |
Saved in:
Saved in favorites
Similar items by subject
-
Parallel Sequential Monte Carlo for Efficient Density Combination: The Deco Matlab Toolbox
Casarin, Roberto, (2013)
-
Parallel Sequential Monte Carlo for Efficient Density Combination: The DeCo MATLAB Toolbox
Casarin, Roberto, (2014)
-
Pricing options and computing implied volatilities using neural networks
Liu, Shuaiqiang, (2019)
- More ...
Similar items by person