Feature-Space Optimization-Inspired and Multi-Hypothesis Cross-Attention Reconstruction Neural Network for Video Compressive Sensing

被引：0

作者：

Yang, Chunling ^{[1
]}

Chen, Wenjun ^{[1
]}

Liu, Jiahui ^{[1
]}

机构：

[1] School of Electronic and Information Engineering, South China University of Technology, Guangdong, Guangzhou,510640, China

来源：

Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) | 2024年 / 52卷 / 10期

关键词：

Building materials - Convolutional neural networks - Electric towers - Image compression - Image reconstruction - Linear programming - Motion compensation - Motion estimation - Optical correlation - Religious buildings - Shape optimization - Water towers;

D O I：

10.12141/j.issn.1000-565X.230578

中图分类号：

学科分类号：

摘要：

The existing video compressive sensing reconstruction network usually uses the optical flow network to achieve pixel domain motion estimation and motion compensation. However, during the reconstruction process, the input of the optical flow network is the estimated frame with poor quality, resulting in inaccurate optical flow. The optical flow-based pixel domain alignment and fusion operation will cause noise accumulation, lead to obvious artificial effects in video reconstruction frames and affect the reconstruction quality. Based on the fact that multichannel information in the feature space has strong robustness to interference noise, this paper applied the idea of feature space optimization to the design of the video compressive sensing reconstruction neural network, and proposed a feature-space optimization-inspired and flow-guided multi-hypothesis cross-attention network (FOFMCNet). To avoid the image structure destruction caused by the noise in the optical flow when warping the image, the study designed multi-hypothesis motion estimation module guided by optical flow and the motion compensation module based on cross-attention to realize the motion estimation and motion compensation of inter-frame in feature space, so as to make full use of inter-frame correlation to assist non-key frame reconstruction. In order to strengthen the reuse of effective information in the process of feature optimization, improve the learning ability of the network and alleviate the problem of gradient explosion, this paper designed a feature-space optimization-inspired u-shape network (FOUNet) as a sub-network of FOFMCNet. Through the cascade of multiple FOUNets, the FOFMCNet realizes the optimization and reconstruction of non-key frames in the feature space. Experimental results show that the reconstruction results of the proposed algorithm are obviously better than those of the existing video compression sensing algorithms on the classical low-resolution dataset (UCF-101 and QCIF) and new high-resolution dataset (REDS4). © 2024 South China University of Technology. All rights reserved.

引用

页码：9 / 21

共 8 条

[1] Feature-Space Optimization-Inspired and Self-Attention Enhanced Neural Network Reconstruction Algorithm for Image Compressive Sensing
Chen W.-J.
Yang C.-L.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (11): : 2629 - 2637
[2] FSOINET: FEATURE-SPACE OPTIMIZATION-INSPIRED NETWORK FOR IMAGE COMPRESSIVE SENSING
Chen, Wenjun
Yang, Chunling
Yang, Xin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2460 - 2464
[3] Optimization-Inspired Cross-Attention Transformer for Compressive Sensing
Song, Jiechong
Mou, Chong
Wang, Shiqi
Ma, Siwei
Zhang, Jian
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6174 - 6184
[4] Feature-Domain Multi-Hypothesis Prediction Neural Network for Compressed Video Sensing Reconstruction
Yang C.
Ling X.
Lü Z.
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (06): : 80 - 90
[5] Residual Reconstruction Algorithm Based on Half-Pixel Multi-Hypothesis Prediction for Distributed Compressive Video Sensing
Tong, Ying
Chen, Rui
Yang, Jie
Wu, Minghu
INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2018, 9 (04) : 16 - 33
[6] Residual Reconstruction Algorithm Based on Sub-pixel Multi-hypothesis Prediction for Distributed Compressive Video Sensing
Chen, Rui
Tong, Ying
Yang, Jie
Wu, Minghu
COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, 2019, 772 : 599 - 605
[7] Two-Stage Multi-Hypothesis Network for Compressed Video Sensing Reconstruction Algorithms Based on Deep Learning
Yang C.
Ling X.
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2021, 49 (06): : 88 - 99
[8] Adaptive Multi-Feature Fusion Visual Target Tracking Based on Siamese Neural Network with Cross-Attention Mechanism
Zhou, Qian
Xia, Haoran
Yan, Hongzheng
Yang, Ming
Chen, Shidong
2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 307 - 316

← 1 →