Document page segmentation and layout analysis using soft ordering

被引:0
|
作者
Mitchell, PE [1 ]
Yan, H [1 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel algorithm for layout analysis of document images. A major component of this algorithm is the independent segmentation algorithm that identifies text and graphics regions. The segmentation algorithm first locates document patterns and then performs classification using run-length characteristics, spread analysis and adjacency relations. A key feature of the layout analysis algorithm is soft ordering which provides a means of ordering regions in a more logical way, and allows for some overlapping between separate regions. This is very useful for processing documents that are slightly skewed ol irregular ill layout. The algorithm has been tested on many different documents, and can successfully recognise single and multicolumn documents, even when the column format varies several times on one page. Furthermore, it can process documents with text tightly wrapped around graphics and documents that are slightly skewed.
引用
收藏
页码:458 / 461
页数:4
相关论文
共 50 条
  • [31] Page Segmentation of Historical Document Images with Convolutional Autoencoders
    Chen, Kai
    Seuret, Mathias
    Liwicki, Marcus
    Hennebert, Jean
    Ingold, Rolf
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1011 - 1015
  • [32] Cross-domain document layout analysis using document style guide
    Wu, Xingjiao
    Xiao, Luwei
    Du, Xiangcheng
    Zheng, Yingbin
    Li, Xin
    Ma, Tianlong
    Jin, Cheng
    He, Liang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [33] Simple Algorithm Page Layout Analysis
    Shigarov A.O.
    Fedorov R.K.
    [J]. Pattern Recognition and Image Analysis, 2011, 21 (2) : 324 - 327
  • [34] Classification of document page images based on visual similarity of layout structures
    Shin, CK
    Doermann, DS
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 182 - 190
  • [35] Document page similarity based on layout visual saliency: Application to query by example and document classification
    Eglin, V
    Bres, S
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1208 - 1212
  • [36] Arabic document layout analysis
    Amany M. Hesham
    Mohsen A. A. Rashwan
    Hassanin M. Al-Barhamtoshy
    Sherif M. Abdou
    Amr A. Badr
    Ibrahim Farag
    [J]. Pattern Analysis and Applications, 2017, 20 : 1275 - 1287
  • [37] Arabic document layout analysis
    Hesham, Amany M.
    Rashwan, Mohsen A. A.
    Al-Barhamtoshy, Hassanin M.
    Abdou, Sherif M.
    Badr, Amr A.
    Farag, Ibrahim
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (04) : 1275 - 1287
  • [38] Multiscale segmentation of unstructured document pages using soft decision integration
    Etemad, K
    Doermann, D
    Chellappa, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (01) : 92 - 96
  • [39] Clustering and searching WWW images using link and page layout analysis
    He, Xiaofei
    Cai, Deng
    Wen, Ji-Rong
    Ma, Wei-Ying
    Zhang, Hong-Jiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2007, 3 (02)
  • [40] Chinese document layout analysis using an adaptive regrouping strategy
    Chang, F
    Chu, SY
    Chen, CY
    [J]. PATTERN RECOGNITION, 2005, 38 (02) : 261 - 271