A Chinese Document Layout Analysis Based on Non-text Images

被引:1
|
作者
Fu Xiaoling [1 ]
Li Xiaofeng [1 ]
机构
[1] N China Univ Informat Engn NCUT, Multimedia Technol Lab, Beijing 100144, Peoples R China
关键词
layout analysis; projection; connective region; threshold;
D O I
10.1109/IFCSTA.2009.85
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images,and changed into electronic documents through OCR(optical character recognition) technology,layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image of the issue of text extraction, and there is great value in practice.
引用
收藏
页码:326 / 328
页数:3
相关论文
共 50 条
  • [31] A recurrent neural network based deep learning model for text and non-text stroke classification in online handwritten Devanagari document
    Rajib Ghosh
    Multimedia Tools and Applications, 2022, 81 : 24245 - 24263
  • [32] Text detection method in document images based on multiresolution analysis
    Lee, Geum-Boon
    Shin, Dong-Guk
    Cho, Beom-Joon
    WMSCI 2007 : 11TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, POST CONFERENCE ISSUE, PROCEEDINGS, 2007, : 200 - +
  • [33] Text Classification and Document Layout Analysis of Paper Fragments
    Diem, Markus
    Kleber, Florian
    Sablatnig, Robert
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 854 - 858
  • [34] A novel OCR approach based on document layout analysis and text block classification
    Zhu, Weiheng
    Liu, Yuanfeng
    Hao, Liang
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 91 - 94
  • [35] INDEXING AND RETRIEVAL OF NON-TEXT INFORMATION
    Vermeij, Hermine
    CATALOGING & CLASSIFICATION QUARTERLY, 2013, 51 (08) : 945 - 946
  • [36] Distance Transform-Based Stroke Feature Descriptor for Text Non-text Classification
    Khan, Tauseef
    Mollah, Ayatullah Faruk
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 189 - 200
  • [37] Segmentation of Text and Non-text in On-Line Handwritten Patient Record Based on Spatio-Temporal Analysis
    Waranusast, Rattapoom
    Haddawy, Peter
    Dailey, Matthew
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2009, 5651 : 345 - 354
  • [38] Text/Image Region Separation for Document Layout Detection of Old Document Images using Non-linear Diffusion and Level Set
    Kumar, Sachin S.
    Rajendran, Parvathy
    Prabaharan, P.
    Soman, K. P.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 469 - 477
  • [39] Open Evaluation Tool for Layout Analysis of Document Images
    Alberti, Michele
    Bouillon, Manuel
    Ingold, Rolf
    Liwicki, Marcus
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 4, 2017, : 43 - 47
  • [40] Retrieval of document images based on page layout similarity
    Naveen
    Guru, D. S.
    ADAPTIVE MULTIMEDIA RETRIEVAL: USER, CONTEXT, AND FEEDBACK, 2007, 4398 : 136 - +