Improved Document Image Segmentation Algorithm using Multiresolution Morphology

被引:14
|
作者
Bukhari, Syed Saqib [1 ]
Shafait, Faisal [2 ]
Breuel, Thomas M. [1 ]
机构
[1] Tech Univ Kaiserslautern, Kaiserslautern, Germany
[2] German Res Ctr Artificial Intelligence DFKI, Kaiserslautern, Germany
来源
关键词
Page Segmentation; Text/Non-Text Segmentation; Multiresolution Morphology;
D O I
10.1117/12.873461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Page segmentation into text and non-text elements is an essential preprocessing step before optical character recognition (OCR) operation. In case of poor segmentation, an OCR classification engine produces garbage characters due to the presence of non-text elements. This paper describes modifications to the text/non-text segmentation algorithm presented by Bloomberg,(1) which is also available in his open-source Leptonica library. 2 The modifications result in significant improvements and achieved better segmentation accuracy than the original algorithm for UW-III, UNLV, ICDAR 2009 page segmentation competition test images and circuit diagram datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Old document image segmentation using the autocorrelation function and multiresolution analysis
    Mehri, Maroua
    Gomez-Kraemer, Petra
    Heroux, Pierre
    Mullot, Remy
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [2] A new multiresolution algorithm for image segmentation
    Saeed, M
    Karl, WC
    Nguyen, TQ
    Rabiee, HR
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2753 - 2756
  • [3] Multiresolution algorithm for image segmentation using MRMRF with edge information
    Liu, Guoying
    Guotao
    Liu, Guoying
    [J]. 2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [4] A region-based image fusion algorithm using multiresolution segmentation
    Li, ZH
    Jing, ZL
    Liu, G
    Sun, SY
    Leung, H
    [J]. 2003 IEEE INTELLIGENT TRANSPORTATION SYSTEMS PROCEEDINGS, VOLS. 1 & 2, 2003, : 96 - 101
  • [5] Document Image Segmentation using Averaging Filtering and Mathematical Morphology
    Polyakova, Marina
    Ishchenko, Alesya
    Huliaieva, Natallia
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON ADVANCED TRENDS IN RADIOELECTRONICS, TELECOMMUNICATIONS AND COMPUTER ENGINEERING (TCSET), 2018, : 966 - 969
  • [6] Image Segmentation Using an Improved Watershed Algorithm
    郭礼华
    李建华
    杨树堂
    陆松年
    [J]. Journal of Shanghai Jiaotong University(Science), 2004, (02) : 16 - 19
  • [7] Image segmentation using an improved differential algorithm
    Gao, Hao
    Shi, Yujiao
    Wu, Dongmei
    [J]. OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY III, 2014, 9273
  • [8] iDocChip: A Configurable Hardware Architecture for Historical Document Image Processing Multiresolution Morphology-based Text and Image Segmentation
    Tekleyohannes, Menbere Kina
    Rybalkin, Vladimir
    Ghaffar, Muhammad Mohsin
    Varela, Javier Alejandro
    Wehn, Norbert
    Dengel, Andreas
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 49 (02) : 253 - 284
  • [9] Scanned color document image segmentation using the EM algorithm
    Handley, John C.
    [J]. ICIS '06: INTERNATIONAL CONGRESS OF IMAGING SCIENCE, FINAL PROGRAM AND PROCEEDINGS: LINKING THE EXPLOSION OF IMAGING APPLICATIONS WITH THE SCIENCE AND TECHNOLOGY OF IMAGING, 2006, : 675 - 678
  • [10] An Improved Algorithm for Image Segmentation
    Wu, Weiwen
    Wang, Zhiyan
    Lin, Zhengchun
    [J]. 2011 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND AUTOMATION (CCCA 2011), VOL III, 2010, : 309 - 312