Unified layout analysis and text localization framework

被引:4
|
作者
Vasilopoulos, Nikos [1 ]
Kavallieratou, Ergina [1 ]
机构
[1] Univ Aegean, Dept Informat & Commun Syst Engn, Samos, Greece
关键词
document images; page layout analysis; text localization; PAGE SEGMENTATION; IMAGES; COMPETITION; EXTRACTION; IDENTIFICATION; RECOGNITION; CHARACTERS; ALGORITHM;
D O I
10.1117/1.JEI.26.1.013009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A technique appropriate for extracting textual information from documents with complex layouts, such as newspapers and journals, is presented. It is a combination of a foreground analysis and a text localization method. The first one is used to segment the page in text and nontext blocks, whereas the second one is used to detect text that may be embedded inside images, charts, diagrams, tables, etc. Detailed experiments on two public databases showed that mixing layout analysis and text localization techniques can lead to improved page segmentation and text extraction results. (C) 2017 SPIE and IS&T
引用
收藏
页数:11
相关论文
共 50 条
  • [1] DRFN: A unified framework for complex document layout analysis
    Wu, Xingjiao
    Ma, Tianlong
    Du, Xiangcheng
    Hu, Ziling
    Yang, Jing
    He, Liang
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [2] A unified framework for text analysis in Chinese TTS
    Fu, Guohong
    Zhang, Min
    Zhou, GuoDong
    Luke, Kang-Kwong
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 200 - +
  • [3] Unified HMM-based layout analysis framework and algorithm
    Chen, M
    Ding, XQ
    Wu, YS
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2003, 46 (06): : 401 - 408
  • [4] A Unified Framework for Layout Pattern Analysis With Deep Causal Estimation
    Chen, Ran
    Hu, Shoubo
    Chen, Zhitang
    Zhu, Shengyu
    Yu, Bei
    Li, Pengyun
    Chen, Cheng
    Huang, Yu
    Hao, Jianye
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (04) : 1199 - 1211
  • [5] Unified HMM-based layout analysis framework and algorithm
    Ming Chen
    Xiaoqing Ding
    Youshou Wu
    Science in China Series F: Information Sciences, 2003, 46 : 401 - 408
  • [6] A Unified Framework for Layout Pattern Analysis with Deep Causal Estimation
    Chen, Ran
    Hu, Shoubo
    Chen, Zhitang
    Zhu, Shengyu
    Yu, Bei
    Li, Pengyun
    Chen, Cheng
    Huang, Yu
    Hao, Jianye
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [7] Unified HMM-based layout analysis framework and algorithm
    陈明
    丁晓青
    吴佑寿
    ScienceinChina(SeriesF:InformationSciences), 2003, (06) : 401 - 408
  • [8] Towards End-to-End Unified Scene Text Detection and Layout Analysis
    Long, Shangbang
    Qin, Siyang
    Panteleev, Dmitry
    Bissacco, Alessandro
    Fujii, Yasuhisa
    Raptis, Michalis
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1039 - 1049
  • [9] VSR: A Unified Framework for Document Layout Analysis Combining Vision, Semantics and Relations
    Zhang, Peng
    Li, Can
    Qiao, Liang
    Cheng, Zhanzhan
    Pu, Shiliang
    Niu, Yi
    Wu, Fei
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 115 - 130
  • [10] Sequence as a Whole: A Unified Framework for Video Action Localization With Long-Range Text Query
    Su, Yuting
    Wang, Weikang
    Liu, Jing
    Ma, Shuang
    Yang, Xiaokang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1403 - 1418