Unified layout analysis and text localization framework

被引:4
|
作者
Vasilopoulos, Nikos [1 ]
Kavallieratou, Ergina [1 ]
机构
[1] Univ Aegean, Dept Informat & Commun Syst Engn, Samos, Greece
关键词
document images; page layout analysis; text localization; PAGE SEGMENTATION; IMAGES; COMPETITION; EXTRACTION; IDENTIFICATION; RECOGNITION; CHARACTERS; ALGORITHM;
D O I
10.1117/1.JEI.26.1.013009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A technique appropriate for extracting textual information from documents with complex layouts, such as newspapers and journals, is presented. It is a combination of a foreground analysis and a text localization method. The first one is used to segment the page in text and nontext blocks, whereas the second one is used to detect text that may be embedded inside images, charts, diagrams, tables, etc. Detailed experiments on two public databases showed that mixing layout analysis and text localization techniques can lead to improved page segmentation and text extraction results. (C) 2017 SPIE and IS&T
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Ensemble Block Co-clustering: A Unified Framework for Text Data
    Affeldt, Severine
    Labiod, Lazhar
    Nadif, Mohamed
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 5 - 14
  • [42] A unified framework for Enterprise Architecture analysis
    Langermeier, Melanie
    Saad, Christian
    Bauer, Bernhard
    2014 IEEE 18TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE WORKSHOPS AND DEMONSTRATIONS (EDOCW), 2014, : 227 - 236
  • [43] A unified framework for web link analysis
    Chen, Z
    Tao, L
    Wang, JD
    Wenyin, L
    Ma, WY
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, 2002, : 63 - 70
  • [44] A Framework for Logic-Aware Layout Analysis
    Gibson, Patrick
    Lu, Ziyang
    Pikus, Fedor
    Srinivasan, Sridhar
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2010), 2010, : 171 - 175
  • [45] A unified analysis framework for tensor metasurfaces
    Zhu, Bo O.
    Xiong, Xiaoyan Y. Z.
    Jiang, Li Jun
    JOURNAL OF OPTICS, 2018, 20 (08)
  • [46] A UNIFIED APPROACH TO LAYOUT WIRABILITY
    LIPSKI, W
    PREPARATA, FP
    MATHEMATICAL SYSTEMS THEORY, 1987, 19 (03): : 189 - 203
  • [47] Separation of Text and Non-text in Document Layout Analysis using a Recursive Filter
    Tuan-Anh Tran
    Na, In-Seop
    Kim, Soo-Hyung
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (10): : 4072 - 4091
  • [48] Readers as text designers: Personalizing the layout of text
    Hartley, J
    INNOVATIONS IN EDUCATION AND TRAINING INTERNATIONAL, 1999, 36 (04): : 346 - 350
  • [49] ANALYSIS OF TEXT LAYOUT QUALITY USING WEARABLE EYE TRACKERS
    Chanijani, Seyyed Saleh Mozafari
    Bukhari, Syed Saqib
    Dengel, Andreas
    2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2015,
  • [50] Layout Analysis and Text Column Segmentation for Historical Vietnamese Steles
    Scius-Bertrand, Anna
    Voegtlin, Lars
    Alberti, Michele
    Fischer, Andreas
    Bui, Marc
    PROCEEDINGS OF THE 2019 WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING (HIP' 19), 2019, : 84 - 89