Page segmentation using the description of the background

被引:51
|
作者
Antonacopoulos, A [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZF, Merseyside, England
关键词
D O I
10.1006/cviu.1998.0691
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is an ever increasing number of publications which do not have the "traditional" layout where printed regions are rectangular. Text paragraphs and areas of graphic type may be of any shape, individually rotated and in any arrangement. Previous document analysis techniques are not well suited to such complex layouts. This paper introduces a new method for the segmentation of images of document pages having both traditional and complex layouts. The underlining idea is to efficiently produce a flexible description (by means of tiles) of the background space which surrounds the printed regions in the page image under all the above conditions. Using this description of space, the contours of printed regions are identified with significant accuracy. The new approach is fast as there is no need for skew detection and correction, and only few simple operations are performed on the description of the background (not on the pixel-based data). (C) 1998 Academic Press.
引用
收藏
页码:350 / 369
页数:20
相关论文
共 50 条
  • [31] Page segmentation for content sequence
    Watcharabutsarakham, Sarin
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 925 - 928
  • [32] Arabic newspaper page segmentation
    Hadjar, K
    Ingold, R
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 895 - 899
  • [33] Web Page Segmentation Evaluation
    Sanoja, Andres
    Gancarski, Stephane
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 753 - 760
  • [34] A multiresolution approach for page segmentation
    Cinque, L
    Lombardi, L
    Manzini, G
    PATTERN RECOGNITION LETTERS, 1998, 19 (02) : 217 - 225
  • [35] Benchmarking of document page segmentation
    Agne, S
    Rogger, M
    Rohrschneider, J
    DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 165 - 171
  • [36] Approach to page segmentation and classification
    Wang, Shuhua
    Cao, Yang
    Li, Zuo
    Cai, Shijie
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2002, 14 (01): : 17 - 20
  • [37] STANDARD PAGE DESCRIPTION LANGUAGE
    ROBINSON, PJ
    STRASEN, SM
    COMPUTER COMMUNICATIONS, 1989, 12 (02) : 85 - 92
  • [38] PostScript® - A page description language
    Geschke, Charles M.
    1600, De Gruyter Oldenbourg (28):
  • [39] Automatic video segmentation using a novel background model
    Lu, Y
    Gao, W
    Wu, F
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III, PROCEEDINGS, 2002, : 807 - 810
  • [40] Page Segmentation for Historical Handwritten Documents Using Fully Convolutional Networks
    Xu, Yue
    He, Wenhao
    Yin, Fei
    Liu, Cheng-Lin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 541 - 546