Feature-Space Optimization-Inspired and Multi-Hypothesis Cross-Attention Reconstruction Neural Network for Video Compressive Sensing

被引:0
|
作者
Yang, Chunling [1 ]
Chen, Wenjun [1 ]
Liu, Jiahui [1 ]
机构
[1] School of Electronic and Information Engineering, South China University of Technology, Guangdong, Guangzhou,510640, China
关键词
Building materials - Convolutional neural networks - Electric towers - Image compression - Image reconstruction - Linear programming - Motion compensation - Motion estimation - Optical correlation - Religious buildings - Shape optimization - Water towers;
D O I
10.12141/j.issn.1000-565X.230578
中图分类号
学科分类号
摘要
The existing video compressive sensing reconstruction network usually uses the optical flow network to achieve pixel domain motion estimation and motion compensation. However, during the reconstruction process, the input of the optical flow network is the estimated frame with poor quality, resulting in inaccurate optical flow. The optical flow-based pixel domain alignment and fusion operation will cause noise accumulation, lead to obvious artificial effects in video reconstruction frames and affect the reconstruction quality. Based on the fact that multichannel information in the feature space has strong robustness to interference noise, this paper applied the idea of feature space optimization to the design of the video compressive sensing reconstruction neural network, and proposed a feature-space optimization-inspired and flow-guided multi-hypothesis cross-attention network (FOFMCNet). To avoid the image structure destruction caused by the noise in the optical flow when warping the image, the study designed multi-hypothesis motion estimation module guided by optical flow and the motion compensation module based on cross-attention to realize the motion estimation and motion compensation of inter-frame in feature space, so as to make full use of inter-frame correlation to assist non-key frame reconstruction. In order to strengthen the reuse of effective information in the process of feature optimization, improve the learning ability of the network and alleviate the problem of gradient explosion, this paper designed a feature-space optimization-inspired u-shape network (FOUNet) as a sub-network of FOFMCNet. Through the cascade of multiple FOUNets, the FOFMCNet realizes the optimization and reconstruction of non-key frames in the feature space. Experimental results show that the reconstruction results of the proposed algorithm are obviously better than those of the existing video compression sensing algorithms on the classical low-resolution dataset (UCF-101 and QCIF) and new high-resolution dataset (REDS4). © 2024 South China University of Technology. All rights reserved.
引用
收藏
页码:9 / 21
相关论文
共 8 条
  • [1] Feature-Space Optimization-Inspired and Self-Attention Enhanced Neural Network Reconstruction Algorithm for Image Compressive Sensing
    Chen W.-J.
    Yang C.-L.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (11): : 2629 - 2637
  • [2] FSOINET: FEATURE-SPACE OPTIMIZATION-INSPIRED NETWORK FOR IMAGE COMPRESSIVE SENSING
    Chen, Wenjun
    Yang, Chunling
    Yang, Xin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2460 - 2464
  • [3] Optimization-Inspired Cross-Attention Transformer for Compressive Sensing
    Song, Jiechong
    Mou, Chong
    Wang, Shiqi
    Ma, Siwei
    Zhang, Jian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6174 - 6184
  • [4] Feature-Domain Multi-Hypothesis Prediction Neural Network for Compressed Video Sensing Reconstruction
    Yang C.
    Ling X.
    Lü Z.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (06): : 80 - 90
  • [5] Residual Reconstruction Algorithm Based on Half-Pixel Multi-Hypothesis Prediction for Distributed Compressive Video Sensing
    Tong, Ying
    Chen, Rui
    Yang, Jie
    Wu, Minghu
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2018, 9 (04) : 16 - 33
  • [6] Residual Reconstruction Algorithm Based on Sub-pixel Multi-hypothesis Prediction for Distributed Compressive Video Sensing
    Chen, Rui
    Tong, Ying
    Yang, Jie
    Wu, Minghu
    COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, 2019, 772 : 599 - 605
  • [7] Two-Stage Multi-Hypothesis Network for Compressed Video Sensing Reconstruction Algorithms Based on Deep Learning
    Yang C.
    Ling X.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2021, 49 (06): : 88 - 99
  • [8] Adaptive Multi-Feature Fusion Visual Target Tracking Based on Siamese Neural Network with Cross-Attention Mechanism
    Zhou, Qian
    Xia, Haoran
    Yan, Hongzheng
    Yang, Ming
    Chen, Shidong
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 307 - 316