Robust Unsupervised Segmentation of Degraded Document Images with Topic Models

被引:0
|
作者
Burns, Timothy J. [1 ]
Corso, Jason J. [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmentation of document images remains a challenging vision problem. Although document images have a structured layout, capturing enough of it for segmentation can be difficult. Most current methods combine text extraction and heuristics for segmentation, but text extraction is prone to failure and measuring accuracy remains a difficult challenge. Furthermore, when presented with significant degradation many common heuristic methods fall apart. In this paper, we propose a Bayesian generative model for document images which seeks to overcome some of these drawbacks. Our model automatically discovers different regions present in a document image in a completely unsupervised fashion. We attempt no text extraction, but rather use discrete patch-based codebook learning to make our probabilistic representation feasible. Each latent region topic is a distribution over these patch indices. We capture rough document layout with an M R F Potts model. We take an analysis-by-synthesis approach to examine the model, and provide quantitative segmentation results on a manually-labeled document image data set. We illustrate our model's robustness by providing results on a highly degraded version of our test set.
引用
收藏
页码:1287 / 1294
页数:8
相关论文
共 50 条
  • [1] An Unsupervised and Robust Line and Word Segmentation Method for Handwritten and Degraded Printed Document
    Mukherjee, Jayati
    Parui, Swapan K.
    Roy, Utpal
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)
  • [2] YOLO Assisted A* Algorithm for Robust Line Segmentation of Degraded Document Images
    Kundu, Ahana
    Bhattacharya, Ujjwal
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 407 - 424
  • [3] Text segmentation in degraded historical document images
    Kavitha, A. S.
    Shivakumara, P.
    Kumar, G. H.
    Lu, Tong
    EGYPTIAN INFORMATICS JOURNAL, 2016, 17 (02) : 189 - 197
  • [4] Robust Document Image Binarization Technique for Degraded Document Images
    Su, Bolan
    Lu, Shijian
    Tan, Chew Lim
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1408 - 1417
  • [5] Robust Binarization of Degraded Document Images Using Heuristics
    Parker, Jon
    Frieder, Ophir
    Frieder, Gideon
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [6] Robust Math Formula Recognition in Degraded Chinese Document Images
    Liu, Ning
    Zhang, Dongxiang
    Xu, Xing
    Guo, Long
    Chen, Lijiang
    Liu, Wenju
    Ke, Dengfeng
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 113 - 118
  • [7] Adaptive Thresholding to Robust Image Binarization for Degraded Document Images
    Ingle, Prashant Devidas
    Kaur, Parminder
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 189 - 193
  • [8] Markov Chains for unsupervised segmentation of degraded NIR iris images for person recognition
    Yahiaoui, Meriem
    Monfrini, Emmanuel
    Dorizzi, Bernadette
    PATTERN RECOGNITION LETTERS, 2016, 82 : 116 - 123
  • [9] Robust unsupervised texture segmentation for motion analysis in ultrasound images
    Brignol, Arnaud
    Cheriet, Farida
    Aubin-Fournier, Jean-Francois
    Fortin, Carole
    Laporte, Catherine
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, 20 (01) : 97 - 106
  • [10] Unsupervised Document Classification and Topic Detection
    Novotny, Jaromir
    Ircing, Pavel
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 748 - 756