Adaptive dewarping of severely warped camera-captured document images based on document map generation

被引:1
|
作者
Nachappa, C. H. [1 ]
Rani, N. Shobha [1 ]
Pati, Peeta Basa [2 ]
Gokulnath, M. [3 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Dept Comp Sci, Mysuru, India
[2] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Bengaluru, India
[3] Amrita Vishwa Vidyapeetham, Amrita Sch Phys Sci, Dept Sci, Mysuru, India
关键词
Dewarping; Document images; Smart innovations; Computer vision model; Control point generation;
D O I
10.1007/s10032-022-00425-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated dewarping of camera-captured handwritten documents is a challenging research problem in Computer Vision and Pattern Recognition. Most available systems assume the shape of the camera-captured image boundaries to be anywhere between trapezoidal and octahedral, with linear distortion in areas between the boundaries for dewarping. The majority of the state-of-the-art applications successfully dewarp the simple-to-medium range geometrical distortions with partial selection of control points by a user. The proposed work implements a fully automated technique for control point detection from simple-to-complex geometrical distortions in camera-captured document images. The input image is subject to preprocessing, corner point detection, document map generation, and rendering of the de-warped document image. The proposed algorithm has been tested on five different camera-captured document datasets (one internal and four external publicly available) consisting of 958 images. Both quantitative and qualitative evaluations have been performed to test the efficacy of the proposed system. On the quantitative front, an Intersection Over Union (IoU) score of 0.92, 0.88, and 0.80 for document map generation for low-, medium-, and high-complexity datasets, respectively. Additionally, accuracies of the recognized texts, obtained from a market leading OCR engine, are utilized for quantitative comparative analysis on document images before and after the proposed enhancement. Finally, the qualitative analysis visually establishes the system's reliability by demonstrating improved readability even for severely distorted image samples.
引用
收藏
页码:149 / 169
页数:21
相关论文
共 50 条
  • [1] Adaptive dewarping of severely warped camera-captured document images based on document map generation
    C. H. Nachappa
    N. Shobha Rani
    Peeta Basa Pati
    M. Gokulnath
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 149 - 169
  • [2] Automatic dewarping of camera-captured comic document images
    Arpan Garai
    Arpita Dutta
    Samit Biswas
    [J]. Multimedia Tools and Applications, 2023, 82 : 1537 - 1552
  • [3] Automatic dewarping of camera-captured comic document images
    Garai, Arpan
    Dutta, Arpita
    Biswas, Samit
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 1537 - 1552
  • [4] Mosaicing of camera-captured document images
    Liang, Jian
    DeMenthon, Daniel
    Doermann, David
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (04) : 572 - 579
  • [5] Restoring camera-captured distorted document images
    Liu, Changsong
    Zhang, Yu
    Wang, Baokang
    Ding, Xiaoqing
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (02) : 111 - 124
  • [6] Geometric rectification of camera-captured document images
    Liang, Jian
    DeMenthon, Daniel
    Doermann, David
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) : 591 - 605
  • [7] Restoring camera-captured distorted document images
    Changsong Liu
    Yu Zhang
    Baokang Wang
    Xiaoqing Ding
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 111 - 124
  • [8] Binarization of Camera-Captured Document using A MAP Approach
    Peng, Xujun
    Setlur, Srirangaraj
    Govindaraju, Venu
    Sitaram, Ramachandrula
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [9] Robust perspective rectification of camera-captured document images
    Takezawa, Yusuke
    Hasegawa, Makoto
    Tabbone, Salvatore
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 6, 2017, : 27 - 32
  • [10] Appearance Enhancement for Camera-Captured Document Images in the Wild
    Zhang, Jiaxin
    Liang, Lingyu
    Ding, Kai
    Guo, Fengjun
    Jin, Lianwen
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2319 - 2330