Robust Document Image Dewarping Method using Text-lines and Line Segments

被引:22
|
作者
Kil, Taeho [1 ,2 ]
Seo, Wonkyo [1 ,2 ]
Koo, Hyung Il [3 ]
Cho, Nam Ik [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Seoul Natl Univ, INMC, Seoul, South Korea
[3] Ajou Univ, Dept Elect & Comp Engn, Suwon, South Korea
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
ALGORITHM;
D O I
10.1109/ICDAR.2017.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional text-line based document dewarping methods have problems when handling complex layout and/or very few text-lines. When there are few aligned text-lines in the image, this usually means that photos, graphics and/or tables take large portion of the input instead. Hence, for the robust document dewarping, we propose to use line segments in the image in addition to the aligned text-lines. Based on the assumption and observation that many of the line segments in the image are horizontally or vertically aligned in the well-rectified images, we encode this property into the cost function in addition to the text-line alignment cost. By minimizing the function, we can obtain transformation parameters for camera pose, page curve, etc., which are used for document rectification. Considering that there are many outliers in line segment directions and missed text-lines in some cases, the overall algorithm is designed in an iterative manner. At each step, we remove text components and line segments that are not well aligned, and then minimize the cost function with the updated information. Experimental results show that the proposed method is robust to the variety of page layouts.
引用
收藏
页码:865 / 870
页数:6
相关论文
共 50 条
  • [1] Document image dewarping using robust estimation of curled text lines
    Ulges, A
    Lampert, CH
    Breuel, TM
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1001 - 1005
  • [2] Foreground and Text-lines Aware Document Image Rectification
    Li, Heng
    Wu, Xiangping
    Chen, Qingcai
    Xiang, Qianjin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19517 - 19526
  • [3] Document Image Dewarping Based on Text Line Detection and Surface Modeling
    Shamgholi, M.
    Khosravi, H.
    Riazi, S. M.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2014, 27 (12): : 1855 - 1862
  • [4] Review on Extraction Techniques for Images, Text-lines and Keywords from Document Image
    Bagadkar, Sneha L.
    Malik, L. G.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 1091 - 1093
  • [5] An Efficient Algorithm for Segmenting Warped Text-lines in Document Images
    Oliveira, Daniel
    Lins, Rafael
    Torreao, Gabriel
    Fan, Jian
    Thielo, Marcelo
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 250 - 254
  • [6] Document dewarping via text-line based optimization
    Kim, Beom Su
    Koo, Hyung Il
    Cho, Nam Ik
    PATTERN RECOGNITION, 2015, 48 (11) : 3600 - 3614
  • [7] Document image dewarping based on line estimation for visually impaired
    Kakumanu, P.
    Bourbakis, N.
    Black, J.
    Panchanathan, S.
    ICTAI-2006: EIGHTEENTH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 625 - +
  • [8] Document Image Dewarping using Deep Learning
    Ramanna, Vijaya
    Bukhari, Saqib
    Dengel, Andreas
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 524 - 531
  • [9] Document Image Dewarping using Kinect Depth Sensor
    Ghods, Amir Reza
    Mozaffari, Saeed
    Ahmadpanahi, Farhad
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [10] Semi-supervised Learning For Detecting Text-lines in Noisy Document Images
    Liu, Zongyi
    Zhou, Hanning
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534