Adaptive layout analysis of document images

被引:0
|
作者
Malerba, D [1 ]
Esposito, F [1 ]
Altamura, O [1 ]
机构
[1] Univ Bari, Dipartimento Informat, I-70126 Bari, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Layout analysis is the process of extracting a hierarchical structure describing the layout of a page. In the document processing system WISDOM++ the layout analysis is performed in two steps: firstly, the global analysis determines possible areas containing paragraphs, sections, columns, figures and tables, and secondly, the local analysis groups together blocks that possibly fall within the same area. The result of the local analysis process strongly depends on the quality of the results of the first step. In this paper we investigate the possibility of supporting the user during the correction of the results of the global analysis. This is done by allowing the user to correct the results of the global analysis and then by learning rules for layout correction from the sequence of user actions. Experimental results on a set of multi-page documents are reported.
引用
收藏
页码:526 / 534
页数:9
相关论文
共 50 条
  • [1] Layout analysis of urdu document images
    Shafait, Faisal
    Adnan-ul-Hasan
    Keysers, Daniel
    Breuel, Thomas M.
    [J]. 10TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE 2006, PROCEEDINGS, 2006, : 293 - +
  • [2] Adaptive document layout
    Jacobs, C
    Li, W
    Schrier, E
    Bargeron, D
    Salesin, D
    [J]. COMMUNICATIONS OF THE ACM, 2004, 47 (08) : 60 - 66
  • [3] Open Evaluation Tool for Layout Analysis of Document Images
    Alberti, Michele
    Bouillon, Manuel
    Ingold, Rolf
    Liwicki, Marcus
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 4, 2017, : 43 - 47
  • [4] Logical Labeling of document images using layout graph matching with adaptive learning
    Liang, J
    Doermann, D
    [J]. DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 224 - 235
  • [5] High Performance Layout Analysis of Medieval European Document Images
    Bukhari, Syed Saqib
    Gupta, Ashutosh
    Tiwari, Anil Kumar
    Dengel, Andreas
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 324 - 331
  • [6] Word spotting in Chinese document images without layout analysis
    Lu, Y
    Tan, CL
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 57 - 60
  • [7] High Performance Layout Analysis of Arabic and Urdu Document Images
    Bukhari, Syed Saqib
    Shafait, Faisal
    Breuel, Thomas M.
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1275 - 1279
  • [8] Chinese document layout analysis using an adaptive regrouping strategy
    Chang, F
    Chu, SY
    Chen, CY
    [J]. PATTERN RECOGNITION, 2005, 38 (02) : 261 - 271
  • [9] Layout Analysis for Arabic Historical Document Images Using Machine Learning
    Bukhari, Syed Saqib
    Breuel, Thomas M.
    Asi, Abedelkadir
    El-Sana, Jihad
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 639 - 644
  • [10] A Chinese Document Layout Analysis Based on Non-text Images
    Fu Xiaoling
    Li Xiaofeng
    [J]. 2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 326 - 328