A Chinese Document Layout Analysis Based on Non-text Images

被引:1
|
作者
Fu Xiaoling [1 ]
Li Xiaofeng [1 ]
机构
[1] N China Univ Informat Engn NCUT, Multimedia Technol Lab, Beijing 100144, Peoples R China
关键词
layout analysis; projection; connective region; threshold;
D O I
10.1109/IFCSTA.2009.85
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images,and changed into electronic documents through OCR(optical character recognition) technology,layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image of the issue of text extraction, and there is great value in practice.
引用
收藏
页码:326 / 328
页数:3
相关论文
共 50 条
  • [21] Readability of Non-Text Images on the World Wide Web (WWW)
    Elahi, Ehsan
    Iglesias, Ana
    Morato, Jorge
    IEEE ACCESS, 2022, 10 : 116627 - 116634
  • [22] Adaptive layout analysis of document images
    Malerba, D
    Esposito, F
    Altamura, O
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 526 - 534
  • [23] Layout analysis of urdu document images
    Shafait, Faisal
    Adnan-ul-Hasan
    Keysers, Daniel
    Breuel, Thomas M.
    10TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE 2006, PROCEEDINGS, 2006, : 293 - +
  • [24] Text and Non-text Segmentation based on Connected Component Features
    Viet Phuong Le
    Nayef, Nibal
    Visani, Muriel
    Ogier, Jean-Marc
    Cao De Tran
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1096 - 1100
  • [25] Multi-script text versus non-text classification of regions in scene images
    Sriman, Bowornrat
    Schomaker, Lambert
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 23 - 42
  • [26] Text non-text classification based on area occupancy of equidistant pixels
    Khan, Tauseef
    Mollah, Ayatullah Faruk
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1889 - 1900
  • [27] Segmentation-Less Extraction of Text and Non-Text Regions From JPEG 2000 Compressed Document Images Through Partial and Intelligent Decompression
    Bisen, Tejasvee
    Javed, Mohammed
    Nagabhushan, P.
    Watanabe, Osamu
    IEEE ACCESS, 2023, 11 : 20673 - 20687
  • [28] A recurrent neural network based deep learning model for text and non-text stroke classification in online handwritten Devanagari document
    Ghosh, Rajib
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24245 - 24263
  • [29] User interface for text and non-text classification
    Thanh Thi Xuan Lam
    Anh Duc Le
    Nakagawa, Masaki
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDAR 2019 WORKSHOP) AND 2ND INTERNATIONAL WORKSHOP ON HUMAN-DOCUMENT INTERACTION, VOL 3, 2019, : 1 - 5
  • [30] Distinguishing Text/Non-Text Natural Images with Multi-Dimensional Recurrent Neural Networks
    Lyu, Pengyuan
    Shi, Baoguang
    Zhang, Chengquan
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3981 - 3986