Robust Document Image Dewarping Method using Text-lines and Line Segments

被引:22
|
作者
Kil, Taeho [1 ,2 ]
Seo, Wonkyo [1 ,2 ]
Koo, Hyung Il [3 ]
Cho, Nam Ik [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Seoul Natl Univ, INMC, Seoul, South Korea
[3] Ajou Univ, Dept Elect & Comp Engn, Suwon, South Korea
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
ALGORITHM;
D O I
10.1109/ICDAR.2017.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional text-line based document dewarping methods have problems when handling complex layout and/or very few text-lines. When there are few aligned text-lines in the image, this usually means that photos, graphics and/or tables take large portion of the input instead. Hence, for the robust document dewarping, we propose to use line segments in the image in addition to the aligned text-lines. Based on the assumption and observation that many of the line segments in the image are horizontally or vertically aligned in the well-rectified images, we encode this property into the cost function in addition to the text-line alignment cost. By minimizing the function, we can obtain transformation parameters for camera pose, page curve, etc., which are used for document rectification. Considering that there are many outliers in line segment directions and missed text-lines in some cases, the overall algorithm is designed in an iterative manner. At each step, we remove text components and line segments that are not well aligned, and then minimize the cost function with the updated information. Experimental results show that the proposed method is robust to the variety of page layouts.
引用
收藏
页码:865 / 870
页数:6
相关论文
共 50 条
  • [31] A robust Hough transform technique for description of multiple line segments in an image
    Kamat, V
    Ganesan, S
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 216 - 220
  • [32] An effective method for text line segmentation in historical document images
    Tien-Nam Nguyen
    Burie, Jean-Christophe
    Thi-Lan Le
    Schweyer, Anne-Valerie
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1593 - 1599
  • [33] A Hybrid Method for Text Line Extraction in Handwritten Document Images
    Kiumarsi, Ehsan
    Alaei, Alireza
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 241 - 246
  • [34] A simple text/graphic separation method for document image segmentation
    Zirari, F.
    Ennaji, A.
    Nicolas, S.
    Mammass, D.
    2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [35] State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction
    Koo, Hyung Il
    Cho, Nam Ik
    COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 421 - +
  • [36] Document image de-warping based on detection of distorted text lines
    Mischke, L
    Luther, W
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2005, PROCEEDINGS, 2005, 3617 : 1068 - 1075
  • [37] TEXT LOCALIZATION USING IMAGE CUES AND TEXT LINE INFORMATION
    Toan Nguyen Dinh
    Park, Jonghyun
    Lee, Gueesang
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2261 - 2264
  • [38] A Robust Lane Detection Method Based on Vanishing Point Estimation Using the Relevance of Line Segments
    Yoo, Ju Han
    Lee, Seong-Whan
    Park, Sung-Kee
    Kim, Dong Hwan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (12) : 3254 - 3266
  • [39] Correcting bound document images based on automatic and robust curved text lines estimation
    Ma, Yichao
    Wang, Chunheng
    Dai, Ruwei
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 197 - +
  • [40] Extracting curved text lines using local linearity of the text line
    Hideaki Goto
    Hirotomo Aso
    International Journal on Document Analysis and Recognition, 1999, 2 (2-3) : 111 - 119