Multi-scale Multi-attention Network for Moire Document Image Binarization

被引:5
|
作者
Guo, Yanqing [1 ,2 ]
Ji, Caijuan [1 ]
Zheng, Xin [1 ]
Wang, Qianyu [1 ]
Luo, Xiangyang [3 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
[2] Key Lab Artificial Intelligence Percept & Underst, Shenyang, Liaoning, Peoples R China
[3] State Key Lab Math Engn & Adv Comp, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Moire patterns; Document Image Binarization; Multi-scale Multi-attention Network;
D O I
10.1016/j.image.2020.116046
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a Multi-scale Multi-attention Network (MsMa-Net) to binarize document images contaminated by moire patterns from camera-captured screens. Given a polluted image, MsMa-Net first learns to distinguish clean features from contaminated ones at different spatial scales via a Multi-scale feature extraction submodule (Ms-sub). In this way, detailed text information could be preserved as much as possible. Meanwhile, moire patterns could be purified preliminarily. Then, obtained multi-scale features are adaptively interweaved through a proposed Multi-attention submodule (Ma-sub) at the channel level, the spatial level, and the correlation level, respectively. By modelling such relationships among multi-scale features, Ma-sub can further highlight text contents and suppress moire patterns for yielding clean demoire document images. All the demoire images flow to a proposed Binarization submodule (Bi-sub) to produce final high-quality binarized document images. Besides, considering the scarce data support for the moire document image binarization task, we create a new Moire Document Image (MoDI) dataset for training and evaluating the proposed model. Extensive experiments demonstrate that MsMa-Net achieves state-of-the-art performance over several available datasets and MoDI dataset.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] MsRAN: a multi-scale residual attention network for multi-model image fusion
    Wang, Jing
    Yu, Long
    Tian, Shengwei
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (12) : 3615 - 3634
  • [32] MsRAN: a multi-scale residual attention network for multi-model image fusion
    Jing Wang
    Long Yu
    Shengwei Tian
    [J]. Medical & Biological Engineering & Computing, 2022, 60 : 3615 - 3634
  • [33] Multi-Attention Object Detection Model in Remote Sensing Images Based on Multi-Scale
    Ying, Xiang
    Wang, Qiang
    Li, Xuewei
    Yu, Mei
    Jiang, Han
    Gao, Jie
    Liu, Zhiqiang
    Yu, Ruiguo
    [J]. IEEE ACCESS, 2019, 7 : 94508 - 94519
  • [34] Msap: multi-scale attention probabilistic network for underwater image enhancement network
    Chang, Baocai
    Li, Jinjiang
    Wang, Haiyang
    Li, Mengjun
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 653 - 661
  • [35] MMSMAPlus: a multi-view multi-scale multi-attention embedding model for protein function prediction
    Wang, Zhongyu
    Deng, Zhaohong
    Zhang, Wei
    Lou, Qiongdan
    Choi, Kup-Sze
    Wei, Zhisheng
    Wang, Lei
    Wu, Jing
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [36] Hierarchical multi-attention networks for document classification
    Huang, Yingren
    Chen, Jiaojiao
    Zheng, Shaomin
    Xue, Yun
    Hu, Xiaohui
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1639 - 1647
  • [37] Multi-Scale Attention Feature Enhancement Network for Single Image Dehazing
    Dong, Weida
    Wang, Chunyan
    Sun, Hao
    Teng, Yunjie
    Xu, Xiping
    [J]. SENSORS, 2023, 23 (19)
  • [38] MGTANet: Multi-Scale Guided Token Attention Network for Image Captioning
    Jia, Wenhao
    Wang, Ronggui
    Yang, Juan
    Xua, Lixia
    [J]. PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 237 - 245
  • [39] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE RESIDUAL ATTENTION NETWORK
    Xie, Wen
    Wu, Qinzhe
    Ren, Wen
    Zhang, Yuzhuo
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7649 - 7652
  • [40] MAFUNet: Multi-Attention Fusion Network for Medical Image Segmentation
    Wang, Lili
    Zhao, Jiayu
    Yang, Hailu
    [J]. IEEE ACCESS, 2023, 11 : 109793 - 109802