NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

被引:4
|
作者
Kumar, Deepak [1 ]
Prasad, M. N. Anil [1 ]
Ramakrishnan, A. G. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Med Intelligence & Language Engn Lab, Bangalore 560012, Karnataka, India
来源
DOCUMENT RECOGNITION AND RETRIEVAL XX | 2013年 / 8658卷
关键词
nonlinear enhancement; power-law transform; text polarity inversion; binarization; evaluation; threshold; recognition; normality test;
D O I
10.1117/12.2008519
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In this paper, we report a breakthrough result on the difficult task of segmentation and recognition of coloured text from the word image dataset of ICDAR robust reading competition challenge 2: reading text in scene images. We split the word image into individual colour, gray and lightness planes and enhance the contrast of each of these planes independently by a power-law transform. The discrimination factor of each plane is computed as the maximum between-class variance used in Otsu thresholding. The plane that has maximum discrimination factor is selected for segmentation. The trial version of Omnipage OCR is then used on the binarized words for recognition. Our recognition results on ICDAR 2011 and ICDAR 2003 word datasets are compared with those reported in the literature. As baseline, the images binarized by simple global and local thresholding techniques were also recognized. The word recognition rate obtained by our non-linear enhancement and selection of plance method is 72.8% and 66.2% for ICDAR 2011 and 2003 word datasets, respectively. We have created ground-truth for each image at the pixel level to benchmark these datasets using a toolkit developed by us. The recognition rate of benchmarked images is 86.7% and 83.9% for ICDAR 2011 and 2003 datasets, respectively.
引用
收藏
页数:10
相关论文
共 32 条
  • [21] Selection and evaluation of optimal segmentation scale for high-resolution remote sensing images based on prior thematic maps and image features
    Wang, Fang
    Yang, Wunian
    Ren, Jintong
    JOURNAL OF APPLIED REMOTE SENSING, 2019, 13 (01)
  • [22] Optimal Segmentation Scale Selection for Object-Based Change Detection in Remote Sensing Images Using Kullback-Leibler Divergence
    Wu, Junzheng
    Li, Biao
    Ni, Weiping
    Yan, Weidong
    Zhang, Han
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (07) : 1124 - 1128
  • [23] Review for Optimal Human-gesture Design Methodology and Motion Representation of Medical Images using Segmentation from Depth Data and Gesture Recognition
    Gupta, Anju
    Kumar, Sanjeev
    CURRENT MEDICAL IMAGING, 2024, 20
  • [24] Optimal segmentation scale selection and evaluation of cultivated land objects based on high-resolution remote sensing images with spectral and texture features
    Heng Lu
    Chao Liu
    Naiwen Li
    Xiao Fu
    Longguo Li
    Environmental Science and Pollution Research, 2021, 28 : 27067 - 27083
  • [25] Optimal segmentation scale selection and evaluation of cultivated land objects based on high-resolution remote sensing images with spectral and texture features
    Lu, Heng
    Liu, Chao
    Li, Naiwen
    Fu, Xiao
    Li, Longguo
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (21) : 27067 - 27083
  • [26] Selection of Optimal Thresholds in Multi-Level Thresholding Using Multi-Objective Emperor Penguin Optimization for Precise Segmentation of Mammogram Images
    Subasree, S.
    Sakthivel, N. K.
    Balasaraswathi, V. R.
    Tyagi, Amit Kumar
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (07)
  • [27] Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition
    Prasad, Abhay
    Ghosh, Prasanta Kumar
    COMPUTER SPEECH AND LANGUAGE, 2016, 39 : 108 - 128
  • [28] Automatic Recognition and Validation of the Common Carotid Artery Wall Segmentation in 100 Longitudinal Ultrasound Images: An Integrated Approach using Feature Selection, Fitting & Classification
    Molinari, Filippo
    Zeng, Guang
    Suri, Jasjit S.
    MEDICAL IMAGING 2010: IMAGE PROCESSING, 2010, 7623
  • [29] Optimal threshold selection for segmentation of Chest X-Ray images using opposition-based swarm-inspired algorithm for diagnosis of pneumonia
    Tejna Khosla
    Om Prakash Verma
    Multimedia Tools and Applications, 2024, 83 : 27089 - 27119
  • [30] Optimal threshold selection for segmentation of Chest X-Ray images using opposition-based swarm-inspired algorithm for diagnosis of pneumonia
    Khosla, Tejna
    Verma, Om Prakash
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27089 - 27119