From Patch to Pixel: A Transformer-Based Hierarchical Framework for Compressive Image Sensing

被引:12
|
作者
Gan, Hongping [1 ,2 ]
Shen, Minghe [1 ,2 ]
Hua, Yi [3 ]
Ma, Chunyan [1 ]
Zhang, Tao [4 ]
机构
[1] Northwestern Polytech Univ, Sch Software, Taicang 215400, Peoples R China
[2] Northwestern Polytech Univ, Yangtze River Delta Res Inst, Taicang 215400, Peoples R China
[3] Northwestern Polytech Univ, Sch Aeronaut, Xian 710072, Peoples R China
[4] Shanghai Jiao Tong Univ, Shanghai Key Lab Intelligent Sensing & Recognit, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Image reconstruction; Transformers; Imaging; Decoding; Computer architecture; Sensors; Image coding; Compressive sensing; image reconstruction; patch-to-pixel; transformer; RECONSTRUCTION; NETWORK;
D O I
10.1109/TCI.2023.3244396
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The convolutional neural network (CNN)-based reconstruction methods have dominated the compressive sensing (CS) in recent years. However, existing CNN-based approaches show potential restrictions in capturing non-local similarity of images, because of the intrinsic characteristic of convolutional layers, i.e., locality and weight sharing. In parallel, the emerging Transformer architecture shows fine capacity in modeling long-distance correlations onto embedded tokens for language and images. Yet vanilla Transformer does not exceed CNN-based networks considerably but shows roughly comparable performance, and the culprit can be the missing of sophisticated inductive bias regarding the local image structures. In this article, to eliminate the restrictions of the aforementioned paradigms, we propose a Transformer-based hierarchical framework, dubbed TCS-Net, for compressive image sensing (or image compressive sensing) with a patch-to-pixel manner. Concretely, the proposed TCS-Net consists of an image acquisition module and a reconstruction module (includes two key decoding phases: a patch-wise decoding phase and a pixel-wise decoding phase). The acquisition module can implement data-driven image sampling by jointly learning with the decoding phases. By adjusting the Transformer architecture to the patch-to-pixel multi-stage pattern, our reconstruction module can gradually decode the CS measurements from the patch-wise outlines to the pixel-wise textures, thereby building a high-precision mapping for image reconstruction. Extensive experiments on several datasets verify that the proposed TCS-Net outperforms existing state-of-the-art image CS methods by considerable margins.
引用
收藏
页码:133 / 146
页数:14
相关论文
共 50 条
  • [1] Hierarchical Image Generation via Transformer-Based Sequential Patch Selection
    Xu, Xiaogang
    Xu, Ning
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2938 - 2945
  • [2] A transformer-based hierarchical registration framework for multimodality deformable image registration
    Zhao, Yao
    Chen, Xinru
    Mcdonald, Brigid
    Yu, Cenji
    Mohamed, Abdalah S. R.
    Fuller, Clifton D.
    Court, Laurence E.
    Pan, Tinsu
    Wang, He
    Wang, Xin
    Phan, Jack
    Yang, Jinzhong
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 108
  • [3] FusionOpt-Net: A Transformer-Based Compressive Sensing Reconstruction Algorithm
    Zhang, Honghao
    Chen, Bi
    Gao, Xianwei
    Yao, Xiang
    Hou, Linyu
    [J]. SENSORS, 2024, 24 (18)
  • [4] A truncated patch-group-based hierarchical reconstruction model for color image compressive sensing
    Jain, Abhishek
    Swami, Preety D.
    Datar, Ashutosh
    [J]. SOFT COMPUTING, 2023,
  • [5] TransCS: A Transformer-Based Hybrid Architecture for Image Compressed Sensing
    Shen, Minghe
    Gan, Hongping
    Ning, Chao
    Hua, Yi
    Zhang, Tao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6991 - 7005
  • [6] Image Reconstruction from Patch Compressive Sensing Measurements
    Wang, Yahan
    Bai, Huihui
    Zhao, Yao
    [J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [7] Transformer-based contrastive learning framework for image anomaly detection
    Fan, Wentao
    Shangguan, Weimin
    Chen, Yewang
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3413 - 3426
  • [8] Transformer-based contrastive learning framework for image anomaly detection
    Wentao Fan
    Weimin Shangguan
    Yewang Chen
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 3413 - 3426
  • [9] Transformer-based Image Compression
    Lu, Ming
    Guo, Peiyao
    Shi, Huiqing
    Cao, Chuntong
    Ma, Zhan
    [J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 469 - 469
  • [10] A hierarchical framework for recovery in compressive sensing
    Colbourn, Charles J.
    Horsley, Daniel
    Syrotiuk, Violet R.
    [J]. DISCRETE APPLIED MATHEMATICS, 2018, 236 : 96 - 107