Devanagari Text Recognition: A Transcription Based Formulation

被引:4
|
作者
Sankaran, Naveen [1 ]
Neelappa, Aman [1 ]
Jawahar, C. V. [1 ]
机构
[1] Int Inst Informat Technol, Hyderabad, Andhra Pradesh, India
关键词
D O I
10.1109/ICDAR.2013.139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Character Recognition (OCR) problems are often formulated as isolated character (symbol) classification task followed by a post-classification stage (which contains modules like Unicode generation, error correction etc.) to generate the textual representation, for most of the Indian scripts. Such approaches are prone to failures due to (i) difficulties in designing reliable word-to-symbol segmentation module that can robustly work in presence of degraded (cut/fused) images and (ii) converting the outputs of the classifiers to a valid sequence of Unicodes. In this paper, we propose a formulation, where the expectations on these two modules is minimized, and the harder recognition task is modelled as learning of an appropriate sequence to sequence translation scheme. We thus formulate the recognition as a direct transcription problem. Given many examples of feature sequences and their corresponding Unicode representations, our objective is to learn a mapping which can convert a word directly into a Unicode sequence. This formulation has multiple practical advantages: (i) This reduces the number of classes significantly for the Indian scripts. (ii) It removes the need for a reliable word-to-symbol segmentation. (ii) It does not require strong annotation of symbols to design the classifiers, and (iii) It directly generates a valid sequence of Unicodes. We test our method on more than 6000 pages of printed Devanagari documents from multiple sources. Our method consistently outperforms other state of the art implementations.
引用
下载
收藏
页码:678 / 682
页数:5
相关论文
共 50 条
  • [1] Efficient Recognition of Devanagari Handwritten Text
    Kundaikar, Teja C.
    Laxminarayana, J. A.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 81 - 88
  • [2] RULE BASED CONTEXTUAL POST-PROCESSING FOR DEVANAGARI TEXT RECOGNITION
    SINHA, RMK
    PATTERN RECOGNITION, 1987, 20 (05) : 475 - 485
  • [3] AN ADAPTIVE APPROACH FOR OPTICAL RECOGNITION OF INTER-POINT HINDI DEVANAGARI BRAILLE AND ITS TRANSCRIPTION TO TEXT
    Shreekanth, T.
    Udayashankara, V
    PROCEEDINGS OF THE INTERNATIONAL SEMINAR ON COMPUTATION, COMMUNICATION AND CONTROL, 2015, 4 : 32 - 36
  • [4] Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
    Mathew, Minesh
    Jain, Mohit
    Jawahar, C. V.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 42 - 46
  • [5] Integrating knowledge sources in Devanagari text recognition system
    Bansal, V
    Sinha, RMK
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (04): : 500 - 505
  • [6] Recognition of Printed Devanagari Text Using BLSTM Neural Network
    Sankaran, Naveen
    Jawahar, C. V.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 322 - 325
  • [7] Challenges in recognition of Devanagari Scripts due to Segmentation of Handwritten text
    Bathla, Ashok Kumar
    Gupta, Sunil Kumar
    Jindal, Manish Kumar
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2711 - 2715
  • [8] Feature extraction and classification techniques for handwritten Devanagari text recognition: a survey
    Singh, Sukhjinder
    Garg, Naresh Kumar
    Kumar, Munish
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 747 - 775
  • [9] Feature extraction and classification techniques for handwritten Devanagari text recognition: a survey
    Sukhjinder Singh
    Naresh Kumar Garg
    Munish Kumar
    Multimedia Tools and Applications, 2023, 82 : 747 - 775
  • [10] Multi-font Devanagari Text Recognition Using LSTM Neural Networks
    Kundaikar, Teja
    Pawar, Jyoti D.
    FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 495 - 506