RULE BASED CONTEXTUAL POST-PROCESSING FOR DEVANAGARI TEXT RECOGNITION

被引:28
|
作者
SINHA, RMK
机构
[1] INRS-Telecom., University of Quebec, 3, Place du Commerce, Nuns' Island, Verdun, Quebec H3E IH6, Canada
关键词
AUTOMATA THEORY;
D O I
10.1016/0031-3203(87)90075-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spatial relationships among the constituent symbols of Devanagari script play an important role in the interpretation of Devanagari words. There are a number of constraints on these spatial relationships which characterise Devanagari script composition syntax. When the word composition is not found to be syntactically correct, the symbols are substituted with their resembling counterparts. The symbol substitution rules are mostly heuristic in nature. Human interpretation normally involves application of script composition syntax rules and the symbol substitution rules in an interleaved fashion. This paper presents a design of a post-processor which corrects the Devangagari symbol string based on this observation. The composition syntax checker is represented in the form of a finite state machine. The substitution rules are in the form of condition action pairs giving flexibility to the system for easy alteration. Each substitution rule has a penalty associated with it and the accumulated penalty value for a word gives a measure of its confidence level.
引用
下载
收藏
页码:475 / 485
页数:11
相关论文
共 50 条
  • [31] Regularization of LDA for face recognition: A post-processing approach
    Zuo, WM
    Wang, KQ
    Zhang, D
    Yang, J
    ANALYSIS AND MODELLING OF FACES AND GESTURES, PROCEEDINGS, 2005, 3723 : 377 - 391
  • [32] Improving LiDAR classification accuracy by contextual label smoothing in post-processing
    Li, Nan
    Liu, Chun
    Pfeifer, Norbert
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 148 : 13 - 31
  • [33] Confidence modeling for verification post-processing for handwriting recognition
    Pitrelli, JF
    Perrone, MP
    EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 30 - 35
  • [34] A post-processing approach to improve emotion recognition rates
    Pittermann, Johannes
    Pittermann, Angela
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 708 - +
  • [36] Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
    Mathew, Minesh
    Jain, Mohit
    Jawahar, C. V.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 42 - 46
  • [37] A novel Arabic OCR post-processing using rule-based and word context techniques
    Abu Doush, Iyad
    Alkhateeb, Faisal
    Gharaibeh, Anwaar Hamdi
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 77 - 89
  • [38] Customizable Cloud-Healthcare Dialogue System Based on LVCSR with Prosodic-Contextual Post-Processing
    Chen, Bo-Wei
    Shih, Po-Yi
    Bharanitharan, Karunanithi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Chen, Chia-Ming
    1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 246 - 249
  • [39] A novel Arabic OCR post-processing using rule-based and word context techniques
    Iyad Abu Doush
    Faisal Alkhateeb
    Anwaar Hamdi Gharaibeh
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 77 - 89
  • [40] Integrating knowledge sources in Devanagari text recognition system
    Bansal, V
    Sinha, RMK
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (04): : 500 - 505