Robust Document Image Dewarping Method using Text-lines and Line Segments

被引:22
|
作者
Kil, Taeho [1 ,2 ]
Seo, Wonkyo [1 ,2 ]
Koo, Hyung Il [3 ]
Cho, Nam Ik [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Seoul Natl Univ, INMC, Seoul, South Korea
[3] Ajou Univ, Dept Elect & Comp Engn, Suwon, South Korea
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
ALGORITHM;
D O I
10.1109/ICDAR.2017.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional text-line based document dewarping methods have problems when handling complex layout and/or very few text-lines. When there are few aligned text-lines in the image, this usually means that photos, graphics and/or tables take large portion of the input instead. Hence, for the robust document dewarping, we propose to use line segments in the image in addition to the aligned text-lines. Based on the assumption and observation that many of the line segments in the image are horizontally or vertically aligned in the well-rectified images, we encode this property into the cost function in addition to the text-line alignment cost. By minimizing the function, we can obtain transformation parameters for camera pose, page curve, etc., which are used for document rectification. Considering that there are many outliers in line segment directions and missed text-lines in some cases, the overall algorithm is designed in an iterative manner. At each step, we remove text components and line segments that are not well aligned, and then minimize the cost function with the updated information. Experimental results show that the proposed method is robust to the variety of page layouts.
引用
收藏
页码:865 / 870
页数:6
相关论文
共 50 条
  • [21] Handwritten document image segmentation into text lines and words
    Papavassiliou, Vassilis
    Stafylakis, Themos
    Katsouros, Vassilis
    Carayannis, George
    PATTERN RECOGNITION, 2010, 43 (01) : 369 - 377
  • [22] Experimental application of a Japanese historical document image synthesis method to text line segmentation
    Inuzuka, Naoto
    Suzuki, Tetsuya
    ICPRAM 2021 - Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods, 2021, : 628 - 634
  • [23] Experimental Application of a Japanese Historical Document Image Synthesis Method to Text Line Segmentation
    Inuzuka, Naoto
    Suzuki, Tetsuya
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 628 - 634
  • [24] Robust scene matching using line segments
    Chen, JY
    Guo, ZM
    Tan, P
    Goh, T
    ELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY III, 2002, 4925 : 46 - 54
  • [25] Robust and efficient map-to-image registration with line segments
    Wolfgang Krüger
    Machine Vision and Applications, 2001, 13 : 38 - 50
  • [26] Robust and efficient map-to-image registration with line segments
    Krüger, W
    MACHINE VISION AND APPLICATIONS, 2001, 13 (01) : 38 - 50
  • [27] Robust Stereo Matching for Document Images Using Parameter Selection of Text-Line Extraction
    Afzal, Muhammad Zeshan
    Bukhari, Syed Saqib
    Kraemer, Martin
    Shafait, Faisal
    Breuel, Thomas M.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 331 - 334
  • [28] A Text Line Extraction Method for Archival Document Transcription
    Mechi, Olfa
    Mehri, Maroua
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    PROCEEDINGS OF THE 2020 17TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD 2020), 2020, : 479 - 484
  • [29] Correcting document image warping based on regression of curved text lines
    Zhang, Z
    Tan, CL
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 589 - 593
  • [30] TEXT LINE DETECTION IN MULTICOLUMN FOR INDIAN SCRIPTS USING HISTOGRAM: A DOCUMENT IMAGE ANALYSIS APPLICATION
    Kumar, Umesh
    Raheja, Jagdish
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 161 - 168