Junction-based table detection in camera-captured document images

被引:0
|
作者
Wonkyo Seo
Hyung Il Koo
Nam Ik Cho
机构
[1] Seoul National University,Department of Electrical and Computer Engineering and INMC
[2] Ajou University,Department of Electrical and Computer Engineering
来源
International Journal on Document Analysis and Recognition (IJDAR) | 2015年 / 18卷
关键词
Camera-based document analysis and recognition (CBDAR); Table detection; Junction detection; Table understanding;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present a method that locates tables and their cells in camera-captured document images. In order to deal with this problem in the presence of geometric and photometric distortions, we develop new junction detection and labeling methods, where junction detection means to find candidates for the corners of cells, and junction labeling is to infer their connectivity. We consider junctions as the intersections of curves, and so we first develop a multiple curve detection algorithm. After the junction detection, we encode the connectivity information (including false detection) between the junctions into 12 labels, and design a cost function reflecting pairwise relationships as well as local observations. The cost function is minimized via the belief propagation algorithm, and we can locate tables and their cells from the inferred labels. Also, in order to handle multiple tables in a single page, we propose a table area detection method. Our method is based on the well-known recursive X-Y cut, however, we modify the method so that we can also deal with curved seams caused by the geometric distortions. For the evaluation of our method, we build a data set that includes a variety of camera-captured table images and make the set publicly available. Experimental results on the set show that our method successfully locates tables and their cells in camera-captured images.
引用
收藏
页码:47 / 57
页数:10
相关论文
共 50 条
  • [11] Ridges Based Curled Textline Region Detection from Grayscale Camera-Captured Document Images
    Bukhari, Syed Saqib
    Shafait, Faisal
    Breuel, Thomas M.
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 173 - +
  • [12] DetectGAN: GAN-based text detector for camera-captured document images
    Jinyuan Zhao
    Yanna Wang
    Baihua Xiao
    Cunzhao Shi
    Fuxi Jia
    Chunheng Wang
    International Journal on Document Analysis and Recognition (IJDAR), 2020, 23 : 267 - 277
  • [13] An effective Binarization method for disturbed camera-captured document images
    Zhao, Jinyuan
    Shi, Cunzhao
    Jia, Fuxi
    Wang, Yanna
    Xiao, Baihua
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 339 - 344
  • [14] DetectGAN: GAN-based text detector for camera-captured document images
    Zhao, Jinyuan
    Wang, Yanna
    Xiao, Baihua
    Shi, Cunzhao
    Jia, Fuxi
    Wang, Chunheng
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2020, 23 (04) : 267 - 277
  • [15] Adaptive dewarping of severely warped camera-captured document images based on document map generation
    Nachappa, C. H.
    Rani, N. Shobha
    Pati, Peeta Basa
    Gokulnath, M.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (02) : 149 - 169
  • [16] Adaptive dewarping of severely warped camera-captured document images based on document map generation
    C. H. Nachappa
    N. Shobha Rani
    Peeta Basa Pati
    M. Gokulnath
    International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 149 - 169
  • [17] Rectification of Camera-Captured Document Images with Mixed Contents and Varied Layouts
    Burden, Alexander
    Cote, Melissa
    Albu, Alexandra Branzan
    2019 16TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2019), 2019, : 33 - 40
  • [18] Metric Rectification to Estimate the Aspect Ratio of Camera-Captured Document Images
    Park, Junhee
    Lee, Byung-Uk
    ADVANCES IN VISUAL COMPUTING, PT 2, PROCEEDINGS, 2009, 5876 : 283 - 292
  • [19] TEXTLINE INFORMATION EXTRACTION FROM GRAYSCALE CAMERA-CAPTURED DOCUMENT IMAGES
    Bukhari, Syed Saqib
    Breuel, Thomas M.
    Shafait, Faisal
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2013 - +
  • [20] Text-Line Detection in Camera-Captured Document Images Using the State Estimation of Connected Components
    Koo, Hyung Il
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) : 5358 - 5368