Deep learning-based Feature compression for Video Coding for Machine

被引:0
|
作者
Do, Jihoon [1 ]
Lee, Jooyoung [1 ]
Kim, Younhee [1 ]
Jeong, Se Yoon [1 ]
Choi, Jin Soo [1 ]
机构
[1] Elect & Telecommun Res Inst, 218 Gajeong Ro, Daejeon, South Korea
关键词
video data; feature map; VVC; compression network; feature domain MSE; machine vision task loss;
D O I
10.1117/12.2626099
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We previously trained the compression network via optimization of bit-rate and distortion (feature domain MSE) [1]. In this paper, we propose feature map compression method for Video Coding for Machine (VCM) based on deep learning-based compression network that joint training for optimizing both compressed bit rate and machine vision task performance. We use bmshij2018-hyperporior model in the CompressAI [2] as the compression network and compress the feature map which is the output of stem layer in the Faster R-CNN X101-FPN network of Detectron2 [3]. We evaluated the proposed method by Evaluation Framework for MPEG VCM. The proposed method shows the better results than VVC of MPEG VCM Anchor.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Learning-Based Video Coding with Joint Deep Compression and Enhancement
    Zhao, Tiesong
    Feng, Weize
    Zeng, Hongji
    Xu, Yiwen
    Niu, Yuzhen
    Liu, Jiaying
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3045 - 3054
  • [2] Deep video compression with conditional feature coding
    Pientka, Sophie
    Pfaff, Jonathan
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    [J]. 2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [3] A Comparison of Machine Learning-Based and Conventional Technologies for Video Compression
    Mochurad, Lesia
    [J]. TECHNOLOGIES, 2024, 12 (04)
  • [4] Deep Learning-Based Video Coding: A Review and a Case Study
    Liu, Dong
    Li, Yue
    Lin, Jianping
    Li, Houqiang
    Wu, Feng
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [5] Deep learning-based video quality enhancement for the new versatile video coding
    Soulef Bouaafia
    Randa Khemiri
    Seifeddine Messaoud
    Olfa Ben Ahmed
    Fatma Ezahra Sayadi
    [J]. Neural Computing and Applications, 2022, 34 : 14135 - 14149
  • [6] Deep learning-based video quality enhancement for the new versatile video coding
    Bouaafia, Soulef
    Khemiri, Randa
    Messaoud, Seifeddine
    Ben Ahmed, Olfa
    Sayadi, Fatma Ezahra
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14135 - 14149
  • [7] Deep learning-based video coding optimisation of H.265
    Karthikeyan, C.
    Vivek, Tammineedi Venkata Satya
    Narayanan, S. Lakshmi
    Markkandan, S.
    Babu, D. Vijendra
    Laddha, Shilpa
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2023, 14 (01) : 52 - 57
  • [8] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Wang, Shiqi
    Kwong, Sam
    Jin, Xin
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
  • [9] Deep Learning-Based Intra Mode Derivation for Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Li, Na
    Jiang, Gangyi
    Kwong, Sam
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [10] Deep Learning-Based Luma and Chroma Fractional Interpolation in Video Coding
    Pham, Chi Do-Kim
    Zhou, Jinjia
    [J]. IEEE ACCESS, 2019, 7 : 112535 - 112543