A fast algorithm for bottom-up document layout analysis

被引：85

作者：

Simon, A

Pret, JC

Johnson, AP

机构：

[1] Institute for Computer Applications in Molecular Sciences, School of Chemistry, University of Leeds, Leeds

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 1997年 / 19卷 / 03期

关键词：

document analysis; physical page layout; bottom-up layout analysis; Kruskal's algorithm; spanning tree; chemical documents;

D O I：

10.1109/34.584106

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a new bottom-up method for document layout analysis. The algorithm was implemented in the GLIDE (Chemical Literature Data Extraction) system (http://chem.leeds.ac.uk/ICAMS/CLiDE.html) but the method described here is suitable for a broader range of documents. It is based on Kruskal's algorithm and uses a special distance-metric between the components to construct the physical page structure. The method has all the major advantages of bottom-up systems: independence from different text spacing and independence from different block alignments. The algorithms computational complexity is reduced to linear by using heuristics and path-compression.

引用

页码：273 / 277

页数：5

共 50 条

[41] BOTTOM-UP THE SYSTEM
STAPLES, L
SOCIAL POLICY, 1989, 19 (04) : 34 - 39
[42] Bottom-up communication
Milani, Myrna
CANADIAN VETERINARY JOURNAL-REVUE VETERINAIRE CANADIENNE, 2010, 51 (10): : 1163 - 1164
[43] Bottom-up nanoelectronics
Hadley, P
34TH EUROPEAN MICROWAVE CONFERENCE, VOLS 1-3, CONFERENCE PROCEEDINGS, 2004, : 141 - 145
[44] Bottom-up economics
不详
HARVARD BUSINESS REVIEW, 2003, 81 (08) : 18 - +
[45] Bottom-up Conservation
Sodhi, Navjot S.
Butler, Rhett
Raven, Peter H.
BIOTROPICA, 2011, 43 (05) : 521 - 523
[46] Bottom-Up Management
Freeman, Ruth
PERSONNEL PSYCHOLOGY, 1950, 3 (02) : 236 - 237
[47] BOTTOM-UP DDP
YASAKI, EK
DATAMATION, 1983, 29 (04): : 131 - 132
[48] ''Bottom-up'' bioremediation
不详
NATURE BIOTECHNOLOGY, 1997, 15 (05) : 393 - 393
[49] “Bottom-up” bioremediation
Nature Biotechnology, 1997, 15 (5) : 393 - 393
[50] Bottom-up innovation
Sharkey, Noel
NATURE, 2014, 516 (7529) : 36 - 36

← 1 2 3 4 5 →