Devanagari Text Recognition: A Transcription Based Formulation

被引:4
|
作者
Sankaran, Naveen [1 ]
Neelappa, Aman [1 ]
Jawahar, C. V. [1 ]
机构
[1] Int Inst Informat Technol, Hyderabad, Andhra Pradesh, India
关键词
D O I
10.1109/ICDAR.2013.139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Character Recognition (OCR) problems are often formulated as isolated character (symbol) classification task followed by a post-classification stage (which contains modules like Unicode generation, error correction etc.) to generate the textual representation, for most of the Indian scripts. Such approaches are prone to failures due to (i) difficulties in designing reliable word-to-symbol segmentation module that can robustly work in presence of degraded (cut/fused) images and (ii) converting the outputs of the classifiers to a valid sequence of Unicodes. In this paper, we propose a formulation, where the expectations on these two modules is minimized, and the harder recognition task is modelled as learning of an appropriate sequence to sequence translation scheme. We thus formulate the recognition as a direct transcription problem. Given many examples of feature sequences and their corresponding Unicode representations, our objective is to learn a mapping which can convert a word directly into a Unicode sequence. This formulation has multiple practical advantages: (i) This reduces the number of classes significantly for the Indian scripts. (ii) It removes the need for a reliable word-to-symbol segmentation. (ii) It does not require strong annotation of symbols to design the classifiers, and (iii) It directly generates a valid sequence of Unicodes. We test our method on more than 6000 pages of printed Devanagari documents from multiple sources. Our method consistently outperforms other state of the art implementations.
引用
下载
收藏
页码:678 / 682
页数:5
相关论文
共 50 条
  • [21] An approach based on classifier combination for online handwritten text and non-text classification in Devanagari script
    Rajib Ghosh
    Saurav Shanu
    Sugandha Ranjan
    Khusboo Kumari
    Sādhanā, 2019, 44
  • [22] A Study for Handwritten Devanagari Word Recognition
    Kumar, Satish
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1009 - 1014
  • [23] An approach based on classifier combination for online handwritten text and non-text classification in Devanagari script
    Ghosh, Rajib
    Shanu, Saurav
    Ranjan, Sugandha
    Kumari, Khusboo
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (08):
  • [24] Devanagari Character Recognition in Scene Images
    Narang, Vipin
    Roy, Sujoy
    Murthy, O. V. R.
    Hanmandlu, M.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 902 - 906
  • [25] A Fuzzy based Classification Scheme for Unconstrained Handwritten Devanagari Character Recognition
    Shelke, Sushama
    Apte, Shaila
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT), 2015,
  • [26] Offline recognition of Devanagari script: A survey
    Pune Institute of Computer Technology, Pune 411043, India
    不详
    不详
    不详
    IEEE Trans Syst Man Cybern Pt C Appl Rev, 6 (782-796):
  • [27] Offline Recognition of Devanagari Script: A Survey
    Jayadevan, R.
    Kolhe, Satish R.
    Patil, Pradeep M.
    Pal, Umapada
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06): : 782 - 796
  • [28] Deep Learning Based Large Scale Handwritten Devanagari Character Recognition
    Acharya, Shailesh
    Pant, Ashok Kumar
    Gyawali, Prashnna Kumar
    2015 9TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA), 2015,
  • [29] A complete OCR for printed Hindi text in Devanagari script
    Bansal, V
    Sinha, RMK
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 800 - 804
  • [30] Machine recognition of online handwritten Devanagari characters
    Joshi, N
    Sita, G
    Ramakrishnan, AG
    Deepu, V
    Madhvanath, S
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1156 - 1160