Multi-scale Multi-attention Network for Moire Document Image Binarization

被引:5
|
作者
Guo, Yanqing [1 ,2 ]
Ji, Caijuan [1 ]
Zheng, Xin [1 ]
Wang, Qianyu [1 ]
Luo, Xiangyang [3 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
[2] Key Lab Artificial Intelligence Percept & Underst, Shenyang, Liaoning, Peoples R China
[3] State Key Lab Math Engn & Adv Comp, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Moire patterns; Document Image Binarization; Multi-scale Multi-attention Network;
D O I
10.1016/j.image.2020.116046
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a Multi-scale Multi-attention Network (MsMa-Net) to binarize document images contaminated by moire patterns from camera-captured screens. Given a polluted image, MsMa-Net first learns to distinguish clean features from contaminated ones at different spatial scales via a Multi-scale feature extraction submodule (Ms-sub). In this way, detailed text information could be preserved as much as possible. Meanwhile, moire patterns could be purified preliminarily. Then, obtained multi-scale features are adaptively interweaved through a proposed Multi-attention submodule (Ma-sub) at the channel level, the spatial level, and the correlation level, respectively. By modelling such relationships among multi-scale features, Ma-sub can further highlight text contents and suppress moire patterns for yielding clean demoire document images. All the demoire images flow to a proposed Binarization submodule (Bi-sub) to produce final high-quality binarized document images. Besides, considering the scarce data support for the moire document image binarization task, we create a new Moire Document Image (MoDI) dataset for training and evaluating the proposed model. Extensive experiments demonstrate that MsMa-Net achieves state-of-the-art performance over several available datasets and MoDI dataset.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Multi-scale multi-attention network for diabetic retinopathy grading
    Xia, Haiying
    Long, Jie
    Song, Shuxiang
    Tan, Yumei
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
  • [2] A Multi-scale and Multi-attention Network for Skin Lesion Segmentation
    Wu, Cong
    Zhang, Hang
    Chen, Dingsheng
    Gan, Haitao
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 537 - 550
  • [3] Document Image Binarization Using "Multi-Scale" Predefined Filters
    Saabni, Raid M.
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [4] A multi-scale multi-attention network for dynamic facial expression recognition
    Xiaohan Xia
    Le Yang
    Xiaoyong Wei
    Hichem Sahli
    Dongmei Jiang
    [J]. Multimedia Systems, 2022, 28 : 479 - 493
  • [5] A multi-scale multi-attention network for dynamic facial expression recognition
    Xia, Xiaohan
    Yang, Le
    Wei, Xiaoyong
    Sahli, Hichem
    Jiang, Dongmei
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (02) : 479 - 493
  • [6] Remote Sensing Image Change Detection Based on Deep Multi-Scale Multi-Attention Siamese Transformer Network
    Zhang, Mengxuan
    Liu, Zhao
    Feng, Jie
    Liu, Long
    Jiao, Licheng
    [J]. REMOTE SENSING, 2023, 15 (03)
  • [7] Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms
    Sun, Qian
    Zhao, Guangrui
    Xia, Xinyuan
    Xie, Yu
    Fang, Chenrong
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    [J]. REMOTE SENSING, 2024, 16 (12)
  • [8] Multi-Scale Attention Network for Image Cropping
    Lian, Tianpei
    Xian, Ke
    Pan, Zhiyu
    Hong, Chaoyi
    Cao, Zhiguo
    Zhong, Weicai
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2640 - 2645
  • [9] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [10] MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion
    Liang, Zhiwei
    Zhao, Kui
    Liang, Gang
    Li, Siyu
    Wu, Yifei
    Zhou, Yiping
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280