On developing high accuracy OCR systems for Telugu and other Indian scripts

被引:5
|
作者
Bhagvati, C [1 ]
Ravi, T [1 ]
Kumar, SM [1 ]
Negi, A [1 ]
机构
[1] Univ Hyderabad, Dept Comp & Informat Sci, Hyderabad 500046, Andhra Pradesh, India
关键词
D O I
10.1109/LEC.2002.1182286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we list a number of factors that are important in achieving high recognition accuracy in OCR systems for Telugu and other Indian scripts. While it is relatively easy to obtain 85% - 93% accuracy, it becomes increasingly difficult to improve the performance further We discuss how the factors presented in this paper helped achieve an accuracy of nearly 97% with our OCR system for Telugu script. It is expected that these factors are specific not only to Telugu but also work for other Indian scripts in general and south Indian scripts in particular.
引用
收藏
页码:18 / 23
页数:6
相关论文
共 50 条