Optimizing image processing on multi-core CPUs with Intel parallel programming technologies

被引:0
|
作者
Cheong Ghil Kim
Jeom Goo Kim
Do Hyeon Lee
机构
[1] Namseoul University,Department of Computer Science
[2] Namseoul University,IT Convergence Technology Research & Education Center
来源
关键词
Multi-core; Streaming SIMD extension; Threading building block; Sobel operator; Sub-word parallelism; Task-level parallelism; Multimedia;
D O I
暂无
中图分类号
学科分类号
摘要
The rapid advance of computer hardware and popularity of multimedia applications enable multi-core processors with sub-word parallelism instructions to become a dominant market trend in desk-top PCs as well as high end mobile devices. This paper presents an efficient parallel implementation of 2D convolution algorithm demanding high performance computing power in multi-core desktop PCs. It is a representative computation intensive algorithm, in image and signal processing applications, accompanied by heavy memory access; on the other hand, their computational complexities are relatively low. The purpose of this study is to explore the effectiveness of exploiting the streaming SIMD (Single Instruction Multiple Data) extension (SSE) technology and TBB (Threading Building Block) run-time library in Intel multi-core processors. By doing so, we can take advantage of all the hardware features of multi-core processor concurrently for data- and task-level parallelism. For the performance evaluation, we implemented a 3 × 3 kernel based convolution algorithm using SSE2 and TBB with different combinations and compared their processing speeds. The experimental results show that both technologies have a significant effect on the performance and the processing speed can be greatly improved when using two technologies at the same time; for example, 6.2, 6.1, and 1.4 times speedup compared with the implementation of either of them are suggested for 256 × 256, 512 × 512, and 1024 × 1024 data sets, respectively.
引用
收藏
页码:237 / 251
页数:14
相关论文
共 50 条
  • [21] Design of multi-core rasterizer for parallel processing
    Lee, Jung-yong
    Heo, Hoon
    Lee, Kwang-yeob
    Koo, Yong Seo
    [J]. 2012 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2012, : 494 - 497
  • [22] Using Multi-core Processor to support Network Parallel Image Processing Applications
    Ramraj, E.
    Rajan, A. Senthil
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 232 - +
  • [23] Parallel convolution algorithm using implicit matrix multiplication on multi-core CPUs
    Wang, Qinglin
    Mei, Songzhu
    Liu, Jie
    Gong, Chunye
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [24] A Parallel Dynamic Programming Algorithm on a Multi-core Architecture
    Tan, Guangming
    Sun, Ninghui
    Gao, Guang R.
    [J]. SPAA'07: PROCEEDINGS OF THE NINETEENTH ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2007, : 135 - +
  • [25] Parallel online spatial and temporal aggregations on multi-core CPUs and many-core GPUs
    Zhang, Jianting
    You, Simin
    Gruenwald, Le
    [J]. INFORMATION SYSTEMS, 2014, 44 : 134 - 154
  • [26] Optimizing Winograd-Based Fast Convolution Algorithm on Phytium Multi-Core CPUs
    Wang, Qinglin
    Li, Dongsheng
    Mei, Songzhu
    Lai, Zhiquan
    Dou, Yong
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (06): : 1140 - 1151
  • [27] Optimizing Large Language Models on Multi-Core CPUs: A Case Study of the BERT Model
    Zhao, Lanxin
    Gao, Wanrong
    Fang, Jianbin
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [28] Optimizing FFT-Based Convolution on ARMv8 Multi-core CPUs
    Wang, Qinglin
    Li, Dongsheng
    Huang, Xiandong
    Shen, Siqi
    Mei, Songzhu
    Liu, Jie
    [J]. EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 248 - 262
  • [29] Optimizing one by one direct convolution on ARMv8 multi-core CPUs
    Wang, Qinglin
    Li, Dongsheng
    Mei, Songzhu
    Shen, Siqi
    Huang, Xiandong
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2020), 2020, : 43 - 47
  • [30] Parallel Programming on a Soft-Core Based Multi-core System
    Lee, Liang-Teh
    Lee, Shin-Tsung
    Chen, Ching-Wei
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 2, PROCEEDINGS, 2010, 6082 : 22 - 31