Fast VVC Intra Encoding for Video Coding for Machines

被引:2
|
作者
Gou, Aorui [1 ]
Sun, Heming [2 ,3 ]
Zeng, Xiaoyang [1 ]
Fan, Yibo [1 ]
机构
[1] Fudan Univ, Shanghai 200433, Peoples R China
[2] Waseda Univ, Waseda Res Inst Sci & Engn, Tokyo, Japan
[3] JST, PRESTO, Tokyo, Japan
基金
日本学术振兴会; 中国国家自然科学基金;
关键词
Video coding for machines; fast intra encoding; versatile video coding; histogram of oriented gradient; semantic segmentation;
D O I
10.1109/ISCAS46773.2023.10181507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional video coding technologies compress and reconstruct the video frames, which focus on human perception. However, video coding for machines (VCM) uses the feature stream to bridge the correlation between human perception and machine intelligence for vision tasks. We extract the features for the CU with different shapes with part of resnet architecture for VCM. However, the feature-based methods use the model to complete the forward process, which is very time-consuming for its complex architecture and parameter size. The CU architecture for the feature extraction further increases the operation times. A fast algorithm based on the Histogram of oriented gradient (HOG) is proposed for the video coding for machines with VVC intra to overcome the time-consuming problems while maintaining the performance for the vision tasks with codec. The correlation of the mode decision with the VCM performance is discussed to motivate the fast intra coding for VCM. Moreover, the VTM and VVenc are used to verify the universality of the proposed method. The proposed methods can speed up the fast encoding for 35.21% time saving with 0.26 increment for AP50 for the cityscapes dataset compared with the VTM10.0.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Efficient VVC Intra Coding for 360° Video with Residual Weighting and Adaptive Quantization
    Adhuran, Jayasingam
    Galkandage, Chathura
    Kulupana, Gosala
    Fernando, Anil
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 144 - 148
  • [22] A Fast Encoding Algorithm for Multiview Video Coding
    Peng, Zongju
    Yu, Mei
    Jiang, Gangyi
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 1, PROCEEDINGS, 2009, : 497 - 500
  • [23] Fast encoding techniques for Multiview Video Coding
    Khattak, S.
    Hamzaoui, R.
    Ahmad, S.
    Frossard, P.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (06) : 569 - 580
  • [24] Intra Prediction and Mode Coding in VVC
    Pfaff, Jonathan
    Filippov, Alexey
    Liu, Shan
    Zhao, Xin
    Chen, Jianle
    De-Luxan-Hernandez, Santiago
    Wiegand, Thomas
    Rufitskiy, Vasily
    Ramasubramonian, Adarsh Krishnan
    Van der Auwera, Geert
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) : 3834 - 3847
  • [25] AN INTRA SUBPARTITION CODING MODE FOR VVC
    De-Luxan-Hernandez, Santiago
    George, Valeri
    Ma, Jackie
    Tung Nguyen
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1203 - 1207
  • [26] COMPLEXITY ANALYSIS OF VVC INTRA CODING
    Saldanha, Mario
    Sanchez, Gustavo
    Marcon, Cesar
    Agostini, Luciano
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3119 - 3123
  • [27] Texture-based fast QTMT partition algorithm in VVC intra coding
    Qiang Li
    Hui Meng
    Ya Li
    Signal, Image and Video Processing, 2023, 17 : 1581 - 1589
  • [28] Fast H.266/VVC intra-coding by mode inheritance
    Chen, Jiann-Jone
    Su, Jiann-Ann
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (23) : 36041 - 36065
  • [29] CNN-BASED FAST CU PARTITIONING ALGORITHM FOR VVC INTRA CODING
    Xu, Jun
    Wu, Guoqing
    Zhu, Chen
    Huang, Yan
    Song, Li
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2706 - 2710
  • [30] Fast CU Partition Method Based on Extra Trees for VVC Intra Coding
    Wang, Kaijie
    Liang, Hong
    Zhang, Saiping
    Yang, Fuzheng
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,