Shortcut-V2V: Compression Framework for Video-to-Video Translation based on Temporal Redundancy Reduction

被引:0
|
作者
Chung, Chaeyeon [1 ]
Park, Yeojeong [1 ,2 ]
Choi, Seunghwan [1 ]
Ganbat, Munkhsoyol [1 ]
Choo, Jaegul [1 ]
机构
[1] IKAIST AI, Daejeon, South Korea
[2] KT Corp, KT Res & Dev Ctr, Seongnam Si, South Korea
关键词
D O I
10.1109/ICCV51070.2023.00700
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-to-video translation aims to generate video frames of a target domain from an input video. Despite its usefulness, the existing networks require enormous computations, necessitating their model compression for wide use. While there exist compression methods that improve computational efficiency in various image/video tasks, a generally-applicable compression method for video-to-video translation has not been studied much. In response, we present Shortcut-V2V, a general-purpose compression framework for video-to-video translation. Shortcut-V2V avoids full inference for every neighboring video frame by approximating the intermediate features of a current frame from those of the previous frame. Moreover, in our framework, a newly-proposed block called AdaBD adaptively blends and deforms features of neighboring frames, which makes more accurate predictions of the intermediate features possible. We conduct quantitative and qualitative evaluations using well-known video-to-video translation models on various tasks to demonstrate the general applicability of our framework. The results show that Shortcut-V2V achieves comparable performance compared to the original video-to-video translation model while saving 3.2-5.7x computational cost and 7.8-44x memory at test time. Our code and videos are available at https://shortcut-v2v.github.io/.
引用
收藏
页码:7578 / 7588
页数:11
相关论文
共 24 条
  • [1] Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
    Zhuo, Long
    Wang, Guangcong
    Li, Shikai
    Wu, Wayne
    Liu, Ziwei
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 289 - 305
  • [2] An Efficient Temporal Redundancy Transformation for Wavelet Based Video Compression
    Sowmyayani, S.
    Rani, P. Arockia Jansi
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2016, 16 (03)
  • [3] Reduction of Video Compression Artifacts Based on Deep Temporal Networks
    Soh, Jae Woong
    Park, Jaewoo
    Kim, Yoonsik
    Ahn, Byeongyong
    Lee, Hyun-Seung
    Moon, Young-Su
    Cho, Nam Ik
    IEEE ACCESS, 2018, 6 : 63094 - 63106
  • [4] I2V-GAN: Unpaired Infrared-to-Visible Video Translation
    Li, Shuang
    Han, Bingfeng
    Yu, Zhenjie
    Liu, Chi Harold
    Chen, Kai
    Wang, Shuigen
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3061 - 3069
  • [5] Deepfake Video Detection Based on EfficientNet-V2 Network
    Deng, Liwei
    Suo, Hongfei
    Li, Dongjie
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [6] ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation
    Liu, Jiawei
    Wang, Weining
    Liu, Wei
    He, Qian
    Liu, Jing
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [7] Neural Network-Based Video Compression Artifact Reduction Using Temporal Correlation and Sparsity Prior Predictions
    Chen, Wei-Gang
    Yu, Runyi
    Wang, Xun
    IEEE ACCESS, 2020, 8 : 162479 - 162490
  • [8] Cluster Based V2V Communications for Enhanced QoS of SVC Video Streaming over Vehicular Networks
    Yaacoub, Elias
    Filali, Fethi
    2014 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2014, : 678 - 683
  • [9] Consortium blockchain-based secure cross-operator V2V video content distribution
    Shen, Hang
    Zhang, Beining
    Wang, Tianjing
    Liu, Xin
    Bai, Guangwei
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (03) : 1631 - 1644
  • [10] Blockchain-Based Dashcam Video Management Method for Data Sharing and Integrity in V2V Network
    Na, Dongjun
    Park, Sejin
    IEEE ACCESS, 2022, 10 : 3307 - 3319