RULE BASED CONTEXTUAL POST-PROCESSING FOR DEVANAGARI TEXT RECOGNITION

被引:28
|
作者
SINHA, RMK
机构
[1] INRS-Telecom., University of Quebec, 3, Place du Commerce, Nuns' Island, Verdun, Quebec H3E IH6, Canada
关键词
AUTOMATA THEORY;
D O I
10.1016/0031-3203(87)90075-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spatial relationships among the constituent symbols of Devanagari script play an important role in the interpretation of Devanagari words. There are a number of constraints on these spatial relationships which characterise Devanagari script composition syntax. When the word composition is not found to be syntactically correct, the symbols are substituted with their resembling counterparts. The symbol substitution rules are mostly heuristic in nature. Human interpretation normally involves application of script composition syntax rules and the symbol substitution rules in an interleaved fashion. This paper presents a design of a post-processor which corrects the Devangagari symbol string based on this observation. The composition syntax checker is represented in the form of a finite state machine. The substitution rules are in the form of condition action pairs giving flexibility to the system for easy alteration. Each substitution rule has a penalty associated with it and the accumulated penalty value for a word gives a measure of its confidence level.
引用
下载
收藏
页码:475 / 485
页数:11
相关论文
共 50 条
  • [1] Contextual post-processing based on the confusion matrix in offline handwritten Chinese script recognition
    Li, YX
    Tan, CL
    Ding, XQ
    Liu, CS
    PATTERN RECOGNITION, 2004, 37 (09) : 1901 - 1912
  • [2] Devanagari Text Recognition: A Transcription Based Formulation
    Sankaran, Naveen
    Neelappa, Aman
    Jawahar, C. V.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 678 - 682
  • [3] Improvement of Speech Recognition Accuracy Using Post-processing of Recognized Text
    Rudzionis, Vytautas
    Malukas, Ugnius
    Danieliene, Renata
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2022, 2022, 1665 : 265 - 270
  • [4] Benchmarking Post-processing Techniques for Offline Arabic Text Recognition System
    Jemni, Sana Khamekhem
    Kesentini, Yousri
    Kanoun, Slim
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 267 - 277
  • [5] On virtual partitioning of large dictionaries for contextual post-processing to improve character recognition
    Hoch, R
    Kieninger, T
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1996, 10 (04) : 273 - 289
  • [6] A Post-Processing Method for Text Detection Based on Geometric Features
    Qiu, Xiaogang
    Chen, Shanxiong
    Li, Rankang
    Wang, Dingwang
    Lin, Xiaoyu
    IEEE ACCESS, 2021, 9 : 36620 - 36633
  • [7] An empirical study of statistical language models for contextual post-processing of Chinese script recognition
    Li, YX
    Tan, CL
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 257 - 262
  • [8] KEDIT - TEXT EDITOR FOR POST-PROCESSING SEARCHES
    WOLFF, TE
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1992, 203 : 14 - CINF
  • [9] A POST-PROCESSING METHOD OF CHARACTER RECOGNITION BASED ON SEMANTIC ANALYSIS
    Li, Li
    Wang, Peifeng
    INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 498 - +
  • [10] VISUAL TEXT RECOGNITION THROUGH CONTEXTUAL PROCESSING
    SINHA, RMK
    PRASADA, B
    PATTERN RECOGNITION, 1988, 21 (05) : 463 - 479