A Chinese Document Layout Analysis Based on Non-text Images

被引:1
|
作者
Fu Xiaoling [1 ]
Li Xiaofeng [1 ]
机构
[1] N China Univ Informat Engn NCUT, Multimedia Technol Lab, Beijing 100144, Peoples R China
关键词
layout analysis; projection; connective region; threshold;
D O I
10.1109/IFCSTA.2009.85
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images,and changed into electronic documents through OCR(optical character recognition) technology,layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image of the issue of text extraction, and there is great value in practice.
引用
收藏
页码:326 / 328
页数:3
相关论文
共 50 条
  • [1] Separation of Text and Non-text in Document Layout Analysis using a Recursive Filter
    Tuan-Anh Tran
    Na, In-Seop
    Kim, Soo-Hyung
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (10): : 4072 - 4091
  • [2] Text and non-text separation in offline document images: a survey
    Showmik Bhowmik
    Ram Sarkar
    Mita Nasipuri
    David Doermann
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 1 - 20
  • [3] Text/non-text classification of connected components in document images
    Julca-Aguilar, Frank D.
    Maia, Ana L. L. M.
    Hirata, Nina S. T.
    2017 30TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2017, : 450 - 455
  • [4] A Novel Method for Text and Non-Text Segmentation in Document Images
    Deivalakshmi, S.
    Palanisamy, P.
    Vishwanathan, Gayatri
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 255 - 259
  • [5] Text and non-text separation in offline document images: a survey
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    Doermann, David
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 1 - 20
  • [6] Automatic Extraction of Text and Non-text Information Directly from Compressed Document Images
    Javed, Mohammed
    Nagabhushan, P.
    Chaudhuri, Bidyut B.
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 38 - 46
  • [7] Connected Operators for Non-text Object Segmentation in Grayscale Document Images
    Mysore, Sheshera
    Gupta, Manish Kumar
    Belhe, Swapnil
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2016, VOL 1, 2017, 459 : 399 - 407
  • [8] Text/Non-Text Separation from Handwritten Document Images Using LBP Based Features: An Empirical Study
    Ghosh, Sourav
    Lahiri, Dibyadwati
    Bhowmik, Showmik
    Kavallieratou, Ergina
    Sarkar, Ram
    JOURNAL OF IMAGING, 2018, 4 (04)
  • [9] Text and Non-text Separation in Handwritten Document Images Using Local Binary Pattern Operator
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COMMUNICATION, 2017, 458 : 507 - 515
  • [10] Deep features based convolutional neural network model for text and non-text region segmentation from document images
    Umer, Saiyed
    Mondal, Ranjan
    Pandey, Hari Mohan
    Rout, Ranjeet Kumar
    APPLIED SOFT COMPUTING, 2021, 113