Tailored AVX2 Transform Kernels for Versatile Video Coding

被引:0
|
作者
Siivonen, Kari [1 ]
Sainio, Joose [1 ]
Mercat, Alexandre [1 ]
Vanne, Jarno [1 ]
机构
[1] Tampere Univ, Ultra Video Grp, Tampere, Finland
基金
芬兰科学院;
关键词
Versatile Video Coding (VVC); transform; complexity reduction; Advanced Vector Extensions 2 (AVX2); practical encoder implementation;
D O I
10.1109/NorCAS58970.2023.10305449
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Transform coding tools play an integral part in video codecs due to their substantial impact on coding efficiency. The latest video coding standard, Versatile Video Coding (VVC), makes the most of these tools by introducing new DST7, DCT8, and non-square transforms alongside the conventional DCT2 transform. This paper proposes optimized AVX2 kernels for all these transforms to speed up VVC coding. Unlike existing solutions, our kernels are specially tailored for each VVC transform type and block size. Accelerating our open-source uvg266 VVC encoder with the proposed kernels yields up to a 1.1x speedup under all intra (AI) coding condition without any coding overhead. Our implementations make forward DCT2 and DST7/DCT8 transforms 4.0x and 6.7x as fast as their respective scalar implementations in the VTM reference encoder. They also outpace the AVX2 kernels of the practical VVenC encoder by factors of 3.0x and 2.8x. The respective speedups rise up to 5.3x, 11.1x, 3.4x, and 3.0x with inverse transforms.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Accelerating Stereo Vision Algorithm using SSE3, AVX2, and CUDA
    Kokhazadeh, M.
    Kokhazad, Z.
    Dehyadegari, M.
    Daneshtalab, M.
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2194 - 2199
  • [42] Study on versatile video coding multiple transform selection of hardware architecture based on FPGA
    Zhang, Jun
    Shi, Wenchen
    Zhang, Hao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 14929 - 14944
  • [43] An FPGA-Based Architecture for the Versatile Video Coding Multiple Transform Selection Core
    Garrido, Matias J.
    Pescador, Fernando
    Chavarrias, Miguel
    Lobo, Pedro J.
    Sanz, Cesar
    Paz, Pedro
    IEEE ACCESS, 2020, 8 : 81887 - 81903
  • [44] Study on versatile video coding multiple transform selection of hardware architecture based on FPGA
    Jun Zhang
    Wenchen Shi
    Hao Zhang
    Multimedia Tools and Applications, 2023, 82 : 14929 - 14944
  • [45] Versatile video coding: A Next-generation Video Coding Standard
    Takamura, Seishi
    NTT Technical Review, 2019, 17 (06): : 49 - 52
  • [46] Transform selection for video coding
    Kim, Nam Uk
    Lee, Yung-Lyul
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2020, 2020, 11515
  • [47] RATE CONTROL FOR VERSATILE VIDEO CODING
    Li, Yiming
    Liu, Zizheng
    Chen, Zhenzhong
    Liu, Shan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1176 - 1180
  • [48] MEMORY ASSESSMENT OF VERSATILE VIDEO CODING
    Cerveira, Arthur
    Agostini, Luciano
    Zatt, Bruno
    Sampaio, Felipe
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1186 - 1190
  • [49] 基于Intel AVX2的LTE DFT和IDFT快速算法
    曹如球
    信息通信, 2014, (08) : 11 - 12
  • [50] Lightweight Hardware Transform Design for the Versatile Video Coding 4K ASIC Decoders
    Farhat, Ibrahim
    Hamidouche, Wassim
    Grill, Adrien
    Menard, Daniel
    Deforges, Olivier
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2021, 67 (04) : 329 - 340