An Efficient Multi-Scale Attention Feature Fusion Network for 4K Video Frame Interpolation

被引:1
|
作者
Ning, Xin [1 ]
Li, Yuhang [1 ]
Feng, Ziwei [1 ]
Liu, Jinhua [1 ]
Ding, Youdong [1 ,2 ]
机构
[1] Shanghai Univ, Coll Shanghai Film, 788 Guangzhong Rd, Shanghai 200072, Peoples R China
[2] Shanghai Engn Res Ctr Mot Picture Special Effects, 788 Guangzhong Rd, Shanghai 200072, Peoples R China
基金
中国国家自然科学基金;
关键词
4K video frame interpolation; 4K video dataset; self-attention; multi-scale; high frame rate;
D O I
10.3390/electronics13061037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video frame interpolation aims to generate intermediate frames in a video to showcase finer details. However, most methods are only trained and tested on low-resolution datasets, lacking research on 4K video frame interpolation problems. This limitation makes it challenging to handle high-frame-rate video processing in real-world scenarios. In this paper, we propose a 4K video dataset at 120 fps, named UHD4K120FPS, which contains large motion. We also propose a novel framework for solving the 4K video frame interpolation task, based on a multi-scale pyramid network structure. We introduce self-attention to capture long-range dependencies and self-similarities in pixel space, which overcomes the limitations of convolutional operations. To reduce computational cost, we use a simple mapping-based approach to lighten self-attention, while still allowing for content-aware aggregation weights. Through extensive quantitative and qualitative experiments, we demonstrate the excellent performance achieved by our proposed model on the UHD4K120FPS dataset, as well as illustrate the effectiveness of our method for 4K video frame interpolation. In addition, we evaluate the robustness of the model on low-resolution benchmark datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] SSD with multi-scale feature fusion and attention mechanism
    Liu, Qiang
    Dong, Lijun
    Zeng, Zhigao
    Zhu, Wenqiu
    Zhu, Yanhui
    Meng, Chen
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [22] SSD with multi-scale feature fusion and attention mechanism
    Qiang Liu
    Lijun Dong
    Zhigao Zeng
    Wenqiu Zhu
    Yanhui Zhu
    Chen Meng
    Scientific Reports, 13 (1)
  • [23] An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network
    He, Gang
    Wu, Chang
    Xu, Li
    Li, Lei
    Xu, Ziyao
    Xie, Weiying
    Li, Yunsong
    IEEE ACCESS, 2021, 9 : 64022 - 64032
  • [24] Video Frame Interpolation via Multi-scale Expandable Deformable Convolution
    Zhang, Dengyong
    Huang, Pu
    Ding, Xiangling
    Li, Feng
    Yang, Gaobo
    PROCEEDINGS OF THE 2023 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2023, 2023, : 19 - 28
  • [25] MLANet: multi-level attention network with multi-scale feature fusion for crowd counting
    Xiong, Liyan
    Zeng, Yijuan
    Huang, Xiaohui
    Li, Zhida
    Huang, Peng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 6591 - 6608
  • [26] Multi-scale high and low feature fusion attention network for intestinal image classification
    Li, Sheng
    Zhu, Beibei
    Guo, Xinran
    Ye, Shufang
    Ye, Jietong
    Zhuang, Yongwei
    He, Xiongxiong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 2877 - 2886
  • [27] Collaborative Attention Guided Multi-Scale Feature Fusion Network for Medical Image Segmentation
    Xu, Zhenghua
    Tian, Biao
    Liu, Shijie
    Wang, Xiangtao
    Yuan, Di
    Gu, Junhua
    Chen, Junyang
    Lukasiewicz, Thomas
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1857 - 1871
  • [28] Siamese Network with Multi-scale Feature Fusion and Dual Attention Mechanism for Template Matching
    Zhao, Kai
    He, Binbing
    Pan, Shiju
    Zhu, Yuan
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6588 - 6592
  • [29] Multi-scale high and low feature fusion attention network for intestinal image classification
    Sheng Li
    Beibei Zhu
    Xinran Guo
    Shufang Ye
    Jietong Ye
    Yongwei Zhuang
    Xiongxiong He
    Signal, Image and Video Processing, 2023, 17 : 2877 - 2886
  • [30] Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images
    Liu, Jia
    Gu, Hang
    Li, Zuhe
    Chen, Hongyang
    Chen, Hao
    ELECTRONICS, 2024, 13 (05)