Energy and Computing Assessment of Video Processing Kernels on CPU and FPGA platforms

被引:0
|
作者
Mangrich, Fillipi [2 ]
Foes, Joao Gabriel Firta [2 ]
Correa, Guilherme [1 ]
Seidel, Ismael [2 ]
Grellert, Mateus [3 ]
机构
[1] Fed Univ Pelotas PPGC UFPel, Pelotas, RS, Brazil
[2] Fed Univ Santa Catarina UFSC, Embedded Comp Lab ECL, Florianopolis, SC, Brazil
[3] Fed Univ Rio Grande Do Sul UFRGS, Porto Alegre, RS, Brazil
关键词
video coding; similarity metrics; energy comparison; CPU; FPGA;
D O I
10.1109/SBCCI60457.2023.10261966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous architectures are becoming increasingly common, allowing the acceleration of smaller modules that compose complex systems. This is specially beneficial when said systems contain mixed data-flow and control-flow algorithms, in which the former can be hardware-optimized whereas the latter can still execute in a CPU. In video encoders, the intra- and interprediction are typical examples of data-flow operations. These steps involve block-matching searches that aim at finding the most similar pair of blocks, one being encoded and one that is generated during prediction. The similarity can be measured in different ways, but the most common ones are the Sum of Absolute Differences (SAD), the Sum of Absolute Transformed Differences (SATD), and the Sum of Squared Differences (SSD). All of these distortion metrics are executed several times for each block being encoded, so reducing the time or energy required to compute them is extremely beneficial. This paper presents a comparison of the energy costs of the SAD and SSD operations on a CPU and on dedicated VLSI designs. The experiments were conducted in an Artix-7 based FPGA component. The VLSI architectures and simulation routines were designed with VHDL, and the software versions were described in C. To optimize throughput and resource utilization, the dedicated units were designed using pipeline and resource sharing when possible. Our results show that, as expected, FPGA has a great gain of energy efficiency over CPU, with power efficiency gains in the range of 100 times.
引用
收藏
页码:89 / 94
页数:6
相关论文
共 50 条
  • [1] Analyzing the Energy-Efficiency of Vision Kernels on Embedded CPU, GPU and FPGA Platforms
    Qasaimeh, Murad
    Zambreno, Joseph
    Jones, Phillip H.
    Denolf, Kristof
    Lo, Jack
    Vissers, Kees
    2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, : 336 - 336
  • [2] Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels
    Qasaimeh, Murad
    Denolf, Kristof
    Lo, Jack
    Vissers, Kees
    Zambreno, Joseph
    Jones, Phillip H.
    2019 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2019,
  • [3] Exploration of OpenCL 2D Convolution Kernels on Intel FPGA, CPU, and GPU Platforms
    Jin, Zheming
    Finkel, Hal
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4460 - 4465
  • [4] Exploring Slice-Energy Saving on an Video Processing FPGA Platform with Approximate Computing
    Zhang, Yunxiang
    Yang, Xiaokun
    Wu, Lei
    Lu, Jiang
    Sha, Kewei
    Gajjar, Archit
    He, Han
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS (ICACS 2018), 2018, : 138 - 143
  • [5] Computing to the Limit with Heterogeneous CPU-FPGA Devices in a Video Fusion Application
    Nunez-Yanez, Jose
    APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 41 - 53
  • [6] SQL2FPGA: Automated Acceleration of SQL Query Processing on Modern CPU-FPGA Platforms
    Lu, Alec
    Narendra, Jahanvi
    Fang, Zhenman
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (03)
  • [7] Energy Efficient Video Fusion with Heterogeneous CPU-FPGA Devices
    Sun, Peng
    Achim, Alin
    Hasler, Ian
    Hill, Paul
    Nunez-Yanez, Jose
    PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1399 - 1404
  • [8] SQL2FPGA: Automatic Acceleration of SQL Query Processing on Modern CPU-FPGA Platforms
    Lu, Alec
    Fang, Zhenman
    2023 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM, 2023, : 184 - 194
  • [9] Energy-Efficient Algebra Kernels in FPGA for High Performance Computing
    Favaro, Federico
    Dufrechou, Ernesto
    Ezzatti, Pablo
    Oliver, Juan P.
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2021, 21 (02): : 80 - 92
  • [10] A Quantitative Analysis on Microarchitectures of Modern CPU-FPGA Platforms
    Choi, Young-Kyu
    Cong, Jason
    Fang, Zhenman
    Hao, Yuchen
    Reinman, Glenn
    Wei, Peng
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,