DDocE: Deep Document Enhancement with Multi-scale Feature Aggregation and Pixel-Wise Adjustments

被引:0
|
作者
Bogdan, Karina O. M. [1 ]
Megeto, Guilherme A. S. [1 ]
Leal, Rovilson [1 ]
Souza, Gustavo [1 ]
Valente, Augusto C. [1 ]
Kirsten, Lucas N. [2 ]
机构
[1] Inst Pesquisas Eldorado, Campinas, SP, Brazil
[2] HP Inc, Porto Alegre, RS, Brazil
关键词
Deep learning; Document enhancement; Image enhancement;
D O I
10.1007/978-3-030-86198-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Digitizing a document with a smartphone might be a difficult task when there are suboptimal environment lighting conditions. The presence of shadows and insufficient illumination may reduce the quality of the content, such as its readability, colors or other aesthetic aspect. In this work, we propose a lightweight neural network to enhance photographed document images using a feature extraction that aggregates multi-scale features and a pixel-wise adjustment refinement step. We also provide a comparison with different methods, including methods not originally proposed for document enhancement. We focused on aesthetics aspects of the images, for which we used traditional image quality assessment (IQA) metrics and others based on deep learning models of human quality perception of natural images. Our deep document enhancement (DDocE) method was able to lessen the negative effects of different artifacts, such as shadows and insufficient illumination, while also maintaining a good color consistency, resulting in a better final enhanced image than the ones obtained with other methods.
引用
收藏
页码:229 / 244
页数:16
相关论文
共 50 条
  • [1] A Pixel-Wise Foreign Object Debris Detection Method Based on Multi-Scale Feature Inpainting
    Jing, Ying
    Zheng, Hong
    Zheng, Wentao
    Dong, Kaihan
    AEROSPACE, 2022, 9 (09)
  • [2] A new multi-scale CNN with pixel-wise attention for image denoising
    Deng, Jibin
    Hu, Chaohua
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2733 - 2741
  • [3] A new multi-scale CNN with pixel-wise attention for image denoising
    Jibin Deng
    Chaohua Hu
    Signal, Image and Video Processing, 2024, 18 : 2733 - 2741
  • [4] Multi-scale convolutional neural network for pixel-wise reconstruction of Van Gogh’s drawings
    Y. Zeng
    J. C. A. van der Lubbe
    M. Loog
    Machine Vision and Applications, 2019, 30 : 1229 - 1241
  • [5] Multi-scale convolutional neural network for pixel-wise reconstruction of Van Gogh's drawings
    Zeng, Y.
    van der Lubbe, J. C. A.
    Loog, M.
    MACHINE VISION AND APPLICATIONS, 2019, 30 (7-8) : 1229 - 1241
  • [6] Evaluation of Deep Feature Reconstruction for Pixel-Wise Hyperspectral Image Classification
    Okada, Junkei
    Ozasa, Yuko
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2024, 12 (03): : 190 - 196
  • [7] Deep Photometric Stereo Network with Multi-Scale Feature Aggregation
    Yu, Chanki
    Lee, Sang Wook
    SENSORS, 2020, 20 (21) : 1 - 14
  • [8] Real-time pixel-wise grasp affordance prediction based on multi-scale context information fusion
    Wu, Yongxiang
    Fu, Yili
    Wang, Shuguo
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (02): : 368 - 381
  • [9] Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition
    Jiang, Shuqiang
    Min, Weiqing
    Liu, Linhu
    Luo, Zhengdong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 265 - 276
  • [10] Multi-Scale Hierarchy Deep Feature Aggregation for Compact Image Representations
    Zhao, Zhenbing
    Xu, Guozhi
    Qi, Yincheng
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III, 2017, 10118 : 557 - 571