VMG: Rethinking U-Net Architecture for Video Super-Resolution

被引:0
|
作者
Tang, Jun [1 ]
Niu, Lele [1 ]
Liu, Linlin [1 ]
Dai, Hang [2 ]
Ding, Yong [1 ]
机构
[1] Zhejiang Univ, Coll Integrated Circuits, Hangzhou 310000, Peoples R China
[2] Univ Glasgow, Sch Comp Sci, Glasgow G12 8QQ, Scotland
关键词
Computer architecture; Data mining; Superresolution; Mixers; Feature extraction; Computational modeling; Transformers; Logic gates; Correlation; Decoding; Video super-resolution; U-Net architecture; spatial-temporal; complexity; MLP;
D O I
10.1109/TBC.2024.3486967
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The U-Net architecture has exhibited significant efficacy across various vision tasks, yet its adaptation for Video Super-Resolution (VSR) remains underexplored. While the Video Restoration Transformer (VRT) introduced U-Net into the VSR domain, it poses challenges due to intricate design and substantial computational overhead. In this paper, we present VMG, a streamlined framework tailored for VSR. Through empirical analysis, we identify the crucial stages of the U-Net architecture contributing to performance enhancement in VSR tasks. Our optimized architecture substantially reduces model parameters and complexity while improving performance. Additionally, we introduce two key modules, namely the Gated MLP-like Mixer (GMM) and the Flow-Guided cross-attention Mixer (FGM), designed to enhance spatial and temporal feature aggregation. GMM dynamically encodes spatial correlations with linear complexity in space and time, and FGM leverages optical flow to capture motion variation and implement sparse attention to efficiently aggregate temporally related information. Extensive experiments demonstrate that VMG achieves nearly 70% reduction in GPU memory usage, 30% fewer parameters, and 10% lower computational complexity (FLOPs) compared to VRT, while yielding highly competitive or superior results across four benchmark datasets. Qualitative assessments reveal VMG's ability to preserve remarkable details and sharp structures in the reconstructed videos.
引用
收藏
页码:334 / 349
页数:16
相关论文
共 50 条
  • [1] Super-Resolution of Brain MRI via U-Net Architecture
    Kalluvila, Aryan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 26 - 31
  • [2] J-Net: Improved U-Net for Terahertz Image Super-Resolution
    Yeo, Woon-Ha
    Jung, Seung-Hwan
    Oh, Seung Jae
    Maeng, Inhee
    Lee, Eui Su
    Ryu, Han-Cheol
    SENSORS, 2024, 24 (03)
  • [3] U-Net Based Discriminator for Real-World Super-Resolution
    Ruiz Vargas, Kevin Ian
    Guerrero Pena, Fidel Alejandro
    Marrero Fernandez, Pedro Diamel
    Lanfranchi, Leonardo
    Tsang, Ing Jyh
    Ren, Tsang Ing
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 873 - 880
  • [4] Terahertz image super-resolution using an improved Attention U-net
    Li, Le
    Zou, Yan
    Wang, Bowen
    Zhang, Linfei
    Zhang, Yuzhen
    COMPUTATIONAL IMAGING VI, 2021, 11731
  • [5] UArch: A Super-Resolution Processor With Heterogeneous Triple-Core Architecture for Workloads of U-Net Networks
    Duan, Xuyang
    Chen, Yufan
    Li, Menghan
    Rong, Yitong
    Xie, Ruiqi
    Han, Jun
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2023, 17 (03) : 633 - 647
  • [6] Multi-level U-net network for image super-resolution reconstruction
    Han, Ning
    Zhou, Li
    Xie, Zhengmao
    Zheng, Jingli
    Zhang, Liuxin
    DISPLAYS, 2022, 73
  • [7] Infrared Image Super-Resolution Network Utilizing the Enhanced Transformer and U-Net
    Huang, Feng
    Li, Yunxiang
    Ye, Xiaojing
    Wu, Jing
    SENSORS, 2024, 24 (14)
  • [8] Dense U-Net for single image super-resolution using shuffle pooling
    Lu, Zhengyang
    Chen, Ying
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
  • [9] Crossed Dual-Branch U-Net for Hyperspectral Image Super-Resolution
    Zhang, Jingyi
    Liu, Jianjun
    Yang, Jinlong
    Wu, Zebin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2296 - 2307
  • [10] Super U-Net: A modularized generalizable architecture
    Beeche, Cameron
    Singh, Jatin P.
    Leader, Joseph K.
    Gezer, Naciye S.
    Oruwari, Amechi P.
    Dansingani, Kunal K.
    Chhablani, Jay
    Pu, Jiantao
    PATTERN RECOGNITION, 2022, 128