Multi-modal deep convolutional dictionary learning for image denoising

被引:4
|
作者
Sun, Zhonggui [1 ,2 ]
Zhang, Mingzhu [1 ]
Sun, Huichao [1 ]
Li, Jie [2 ]
Liu, Tingting [3 ]
Gao, Xinbo [3 ]
机构
[1] Liaocheng Univ, Sch Math Sci, Liaocheng 252000, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Video & Image Proc Syst Lab, Xian 710071, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep convolutional dictionary learning; Multi-modal; Channel attention; Image denoising; SPARSE; REMOVAL;
D O I
10.1016/j.neucom.2023.126918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Leveraging the capabilities of traditional dictionary learning (DicL) and drawing upon the success of deep neural networks (DNNs), the recently proposed framework of deep convolutional dictionary learning (DCDicL) has exhibited remarkable behaviours in image denoising. Note that, the application of the DCDicL method is confined to single modality scenarios, whereas the images in practice often originate from diverse modalities. In this paper, to broaden the application scope of the DCDicL method, we design a multi-modal version of it, dubbed MMDCDicL. Specifically, within the mathematical model of MMDCDicL, we adopt an analytical approach to tackle the sub-problem linked to the guidance modality, harnessing its inherent reliability. Meanwhile, like in DCDicL, we utilize a network-based learning approach for the noisy modality to extract trustworthy information from the data. Based on the solution, we establish an interpretable network structure for MMDCDicL. Additionally, wherein, we design a multi-kernel channel attention block (MKCAB) in the structure to efficiently integrate the information from diverse modalities. Experimental results suggest that MMDCDicL can reconstruct higher-quality outcomes both quantitatively and perceptually. Code is available at http://www.diplab.net/lunwen/mmdcdicl.htm.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] CIRF: Coupled Image Reconstruction and Fusion Strategy for Deep Learning Based Multi-Modal Image Fusion
    Zheng, Junze
    Xiao, Junyan
    Wang, Yaowei
    Zhang, Xuming
    SENSORS, 2024, 24 (11)
  • [42] Image Denoising using Deep Learning: Convolutional Neural Network
    Ghose, Shreyasi
    Singh, Nishi
    Singh, Prabhishek
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 511 - 517
  • [43] Multi-Modal Convolutional Parameterisation Network for Guided Image Inverse Problems
    Czerkawski, Mikolaj
    Upadhyay, Priti
    Davison, Christopher
    Atkinson, Robert
    Michie, Craig
    Andonovic, Ivan
    Macdonald, Malcolm
    Cardona, Javier
    Tachtatzis, Christos
    JOURNAL OF IMAGING, 2024, 10 (03)
  • [44] Splenomegaly Segmentation on Multi-Modal MRI Using Deep Convolutional Networks
    Huo, Yuankai
    Xu, Zhoubing
    Bao, Shunxing
    Bermudez, Camilo
    Moon, Hyeonsoo
    Parvathaneni, Prasanna
    Moyo, Tamara K.
    Savona, Michael R.
    Assad, Albert
    Abramson, Richard G.
    Landman, Bennett A.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (05) : 1185 - 1196
  • [45] Dictionary-Induced Manifold Learning for Incomplete Multi-modal Fusion
    Xu, Bingliang
    Ye, Haizhou
    Zhang, Zheng
    Zhang, Daoqiang
    Zhu, Qi
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 529 - 537
  • [46] Deep Multi-Modal Metric Learning with Multi-Scale Correlation for Image-Text Retrieval
    Hua, Yan
    Yang, Yingyun
    Du, Jianhe
    ELECTRONICS, 2020, 9 (03)
  • [47] Three-dimensional seismic denoising based on deep convolutional dictionary learning
    Li, Yuntong
    Liu, Lina
    RESULTS IN APPLIED MATHEMATICS, 2024, 24
  • [48] Multi-modal anchor adaptation learning for multi-modal summarization
    Chen, Zhongfeng
    Lu, Zhenyu
    Rong, Huan
    Zhao, Chuanjun
    Xu, Fan
    NEUROCOMPUTING, 2024, 570
  • [49] Dynamic Deep Multi-modal Fusion for Image Privacy Prediction
    Tonge, Ashwini
    Caragea, Cornelia
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1829 - 1840
  • [50] Robust multi-modal pedestrian detection using deep convolutional neural network with ensemble learning model
    Jain, Deepak Kumar
    Zhao, Xudong
    Garcia, Salvador
    Neelakandan, Subramani
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249