A Chinese Document Layout Analysis Based on Non-text Images

被引:1
|
作者
Fu Xiaoling [1 ]
Li Xiaofeng [1 ]
机构
[1] N China Univ Informat Engn NCUT, Multimedia Technol Lab, Beijing 100144, Peoples R China
关键词
layout analysis; projection; connective region; threshold;
D O I
10.1109/IFCSTA.2009.85
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images,and changed into electronic documents through OCR(optical character recognition) technology,layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image of the issue of text extraction, and there is great value in practice.
引用
收藏
页码:326 / 328
页数:3
相关论文
共 50 条
  • [41] Text and Non-text Separation in Scanned Color-Official Documents
    Nandedkar, Amit Vijay
    Mukherjee, Jayanta
    Sural, Shamik
    COMPUTER VISION, GRAPHICS, AND IMAGE PROCESSING, ICVGIP 2016, 2017, 10481 : 231 - 242
  • [42] A chinese document layout analysis method based on minimal spanning tree clustering
    Tian, XD
    Zhang, C
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3183 - 3187
  • [43] Text retrieval from document images based on word shape analysis
    Tan, CL
    Huang, WH
    Sung, SY
    Yu, ZH
    Xu, Y
    APPLIED INTELLIGENCE, 2003, 18 (03) : 257 - 270
  • [44] Text Retrieval from Document Images Based on Word Shape Analysis
    Chew Lim Tan
    Weihua Huang
    Sam Yuan Sung
    Zhaohui Yu
    Yi Xu
    Applied Intelligence, 2003, 18 : 257 - 270
  • [45] Text segmentation by integrating hybrid strategy and non-text filtering
    Minhua Li
    Meng Bai
    Yingjun Lv
    Multimedia Tools and Applications, 2022, 81 : 44505 - 44522
  • [46] Text and Non-text Recognition using modified HOG descriptor
    Sah, Ankit Kumar
    Bhowmik, Showmik
    Malakar, Samir
    Sarkar, Ram
    Kavallieratou, Ergina
    Vasilopoulos, Nikos
    2017 IEEE CALCUTTA CONFERENCE (CALCON), 2017, : 64 - 68
  • [47] Classification of regions extracted from scene images by morphological filters in text or non-text using decision tree
    Luz Alves, Wonder Alexandre
    Hashimoto, Ronaldo Fumio
    WSCG 2010: FULL PAPERS PROCEEDINGS, 2010, : 165 - 172
  • [48] Text segmentation by integrating hybrid strategy and non-text filtering
    Li, Minhua
    Bai, Meng
    Lv, Yingjun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 44505 - 44522
  • [49] Text and non-text image classification algorithm of computer design scene based on deep learning
    Lai, Shouliang
    Luo, Zihui
    Wang, Meiyan
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 63 - 63
  • [50] Text Content Based Layout Analysis
    Ramon Prieto, Jose
    Bosch, Vicente
    Vidal, Enrique
    Stutzmann, Dominique
    Hamel, Sebastien
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 258 - 263