NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

被引:4
|
作者
Kumar, Deepak [1 ]
Prasad, M. N. Anil [1 ]
Ramakrishnan, A. G. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Med Intelligence & Language Engn Lab, Bangalore 560012, Karnataka, India
来源
关键词
nonlinear enhancement; power-law transform; text polarity inversion; binarization; evaluation; threshold; recognition; normality test;
D O I
10.1117/12.2008519
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In this paper, we report a breakthrough result on the difficult task of segmentation and recognition of coloured text from the word image dataset of ICDAR robust reading competition challenge 2: reading text in scene images. We split the word image into individual colour, gray and lightness planes and enhance the contrast of each of these planes independently by a power-law transform. The discrimination factor of each plane is computed as the maximum between-class variance used in Otsu thresholding. The plane that has maximum discrimination factor is selected for segmentation. The trial version of Omnipage OCR is then used on the binarized words for recognition. Our recognition results on ICDAR 2011 and ICDAR 2003 word datasets are compared with those reported in the literature. As baseline, the images binarized by simple global and local thresholding techniques were also recognized. The word recognition rate obtained by our non-linear enhancement and selection of plance method is 72.8% and 66.2% for ICDAR 2011 and 2003 word datasets, respectively. We have created ground-truth for each image at the pixel level to benchmark these datasets using a toolkit developed by us. The recognition rate of benchmarked images is 86.7% and 83.9% for ICDAR 2011 and 2003 datasets, respectively.
引用
收藏
页数:10
相关论文
共 32 条
  • [1] Word Spotting in Scene Images based on Character Recognition
    Bazazian, Dena
    Karatzas, Dimosthenis
    Bagdanov, Andrew D.
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1953 - 1955
  • [2] Gamma Enhanced Binarization - An Adaptive Nonlinear Enhancement of Degraded Word Images for Improved Recognition of Split Characters
    Kumar, H. R. Shiva
    Ramakrishnan, A. G.
    2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [3] Separation and segmentation for recognition and measurement of overlapped bubbles in plane images
    Zhang, Rongsheng
    Fujikawa, Shigeo
    Chen, Cichang
    Journal of Information and Computational Science, 2005, 2 (04): : 815 - 828
  • [4] Word Recognition in Natural Scene and Video Images using Hidden Markov Model
    Roy, Sangheeta
    Roy, Partha Pratim
    Shivakumara, Palaiahnakote
    Pal, Umapada
    2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [5] LEARNING LIGHT FIELD SYNTHESIS WITH MULTI -PLANE IMAGES: SCENE ENCODING AS A RECURRENT SEGMENTATION TASK
    Volker, Tomas
    Boisson, Guillaume
    Chupeau, Bertrand
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 633 - 637
  • [6] CHARACTER/WORD MODELLING: A TWO-STEP FRAMEWORK FOR TEXT RECOGNITION IN NATURAL SCENE IMAGES
    Priya, M. shanmuga
    Pavithra, A.
    Nelson, Leema
    COMPUTER SCIENCE-AGH, 2024, 25 (04):
  • [7] Segmentation and recognition of characters in scene images using selective binarization in color space and GAT correlation
    Yokobayashi, M
    Wakahara, T
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 167 - 171
  • [8] Nonlinear Quantization Method of SAR Images with SNR Enhancement and Segmentation Strategy Guidance
    Yao, Zijian
    Fang, Linlin
    Yang, Junxin
    Zhong, Lihua
    REMOTE SENSING, 2025, 17 (03)
  • [9] Selection of Optimal Segmentation Algorithm for Satellite Images by Intuitionistic Fuzzy PROMETHEE Method
    Janusonis, Edgaras
    Kazakeviciute-Januskeviciene, Giruta
    Bausys, Romualdas
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [10] Automatic Selection of Optimal Segmentation Scales for High-resolution Remote Sensing Images
    Yin, Ruijuan
    Shi, Runhe
    Gao, Wei
    REMOTE SENSING AND MODELING OF ECOSYSTEMS FOR SUSTAINABILITY X, 2013, 8869