A Parallel Error Diffusion Implementation on a GPU

被引:7
|
作者
Zhang, Yao [1 ]
Recker, John Ludd [2 ]
Ulichney, Robert [3 ]
Beretta, Giordano B. [2 ]
Tastl, Ingeborg [2 ]
Lin, I-Jong [2 ]
Owens, John D. [1 ]
机构
[1] Univ Calif Davis, 1 Shields Ave, Davis, CA 95616 USA
[2] Hewlett Packard Labs, Palo Alto, CA 94304 USA
[3] Hewlett Packard Corp, Andover, MA 94304 USA
关键词
Halftoning; Error Diffusion; Parallel Processing; GPU Computing;
D O I
10.1117/12.872616
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we investigate the suitability of the GPU for a parallel implementation of the pinwheel error diffusion. We demonstrate a high-performance GPU implementation by efficiently parallelizing and unrolling the image processing algorithm. Our GPU implementation achieves a 10 - 30x speedup over a two-threaded CPU error diffusion implementation with comparable image quality. We have conducted experiments to study the performance and quality tradeoffs for differences in image block sizes. We also present a performance analysis at assembly level to understand the performance bottlenecks.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Boundary stitching algorithm for parallel implementation of error diffusion
    He, Z
    Chang, TC
    Allebach, JP
    Bouman, CA
    [J]. COLOR IMAGING IX: PROCESSING, HARDCOPY, AND APPLICATIONS, 2004, 5293 : 344 - 355
  • [2] Implementation of Direct Binary Search and Error Diffusion Algorithm Using GPU
    Zhang, Jiajin
    Wei, Qun
    Chen, Lichang
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL AND INFORMATION SCIENCES (ICCIS 2014), 2014, : 1163 - 1170
  • [3] A Parallel Implementation of RFT on GPU
    Shang, Zhe-ran
    Tan, Xian-si
    Qu, Zhi-guo
    Wang, Hong
    [J]. 2016 CIE INTERNATIONAL CONFERENCE ON RADAR (RADAR), 2016,
  • [4] Fast and flexible GPU implementation of the view-dependent error diffusion algorithm
    Lagrange, Antoine
    Gilles, Antonin
    Heggarty, Kevin
    Fracasso, Bruno
    [J]. OPTICS, PHOTONICS, AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VIII, 2024, 12998
  • [5] Parallel error diffusion
    Kang, HR
    [J]. COLOR IMAGING: DEVICE-INDEPENDENT COLOR, COLOR HARDCOPY, AND APPLICATIONS VII, 2002, 4663 : 360 - 369
  • [6] Parallel design and implementation of Error Diffusion Algorithm and IP core for FPGA
    Pengfei Yang
    Quan Wang
    Jiyang Zhang
    [J]. Multimedia Tools and Applications, 2016, 75 : 4723 - 4733
  • [7] Parallel design and implementation of Error Diffusion Algorithm and IP core for FPGA
    Yang, Pengfei
    Wang, Quan
    Zhang, Jiyang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (08) : 4723 - 4733
  • [8] Implementation of a parallel tree method on a GPU
    Nakasato, Naohito
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2012, 3 (03) : 132 - 141
  • [9] A parallel Bees Algorithm implementation on GPU
    Luo, Guo-Heng
    Huang, Sheng-Kai
    Chang, Yue-Shan
    Yuan, Shyan-Ming
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2014, 60 (03) : 271 - 279
  • [10] GPU-Based Parallel Implementation of 5-Layer Thermal Diffusion Scheme
    Huang, Melin
    Mielikainen, Jarno
    Huang, Bormin
    Huang, H. -L. Allen
    Goldberg, Mitchell D.
    [J]. HIGH-PERFORMANCE COMPUTING IN REMOTE SENSING II, 2012, 8539