Automatic recognition of printed Oriya script

被引:0
|
作者
B. B. Chaudhuri
U. Pal
M. Mitra
机构
[1] Indian statistical Institute,Computer Vision and Pattern Recognition Unit
来源
Sadhana | 2002年 / 27卷
关键词
Indian script; Oriya text; character segmentation; skew detection; optical character recognition (OCR);
D O I
暂无
中图分类号
学科分类号
摘要
This paper deals with an Optical Character Recognition (OCR) system for printedOriya script. The development of OCR for this script is difficult because a large number of character shapes in the script have to be recognized. In the proposed system, the document image is first captured using a flat-bed scanner and then passed through different preprocessing modules like skew correction, line segmentation, zone detection, word and character segmentation etc. These modules have been developed by combining some conventional techniques with some newly proposed ones. Next, individual characters are recognized using a combination of stroke and run-number based features, along with features obtained from the concept of water overflow from a reservoir. The feature detection methods are simple and robust, and do not require preprocessing steps like thinning and pruning. A prototype of the system has been tested on a variety of printed Oriya material, and currently achieves 96.3% character level accuracy on average.
引用
收藏
页码:23 / 34
页数:11
相关论文
共 50 条
  • [1] Automatic recognition of printed Oriya script
    Chaudhuri, BB
    Pal, U
    Mitra, M
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1): : 23 - 34
  • [2] Automatic recognition of printed Oriya script
    Chaudhuri, B.B.
    Pal, U.
    Mitra, M.
    [J]. Sadhana - Academy Proceedings in Engineering Sciences, 2002, 27 (01) : 23 - 34
  • [3] Automatic recognition of printed Oriya script
    Chaudhuri, BB
    Pal, U
    Mitra, M
    [J]. SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 795 - 799
  • [4] An Optical Character Recognition of Machine Printed Oriya Script
    Raj, Aditya
    [J]. 2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, : 543 - 547
  • [5] Recognition of Printed Oriya Script using Gradient based Features
    Chaudhary, Sneha
    Sharma, Sandeepika
    Kumar, Bhupendra
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [6] Recognition of printed Urdu script
    Pal, U
    Sarkar, A
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1183 - 1187
  • [7] BENGALI MANUSCRIPTS IN ORIYA SCRIPT
    PANDA, B
    [J]. QUARTERLY REVIEW OF HISTORICAL STUDIES, 1982, 21 (01): : 22 - 27
  • [8] AUTOMATIC RECOGNITION OF PRINT AND SCRIPT
    HARMON, LD
    [J]. PROCEEDINGS OF THE IEEE, 1972, 60 (10) : 1165 - 1176
  • [9] Printed Arabic Script Recognition: A Survey
    Alghamdi, Mansoor
    Teahan, William
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 415 - 428
  • [10] Recognition of newspaper printed in Gurumukhi script
    Kaur, Rupinder Pal
    Jindal, Manish Kumar
    Kumar, Munish
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2019, 26 (09) : 2495 - 2503