An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition

被引:2
|
作者
Dhanikonda, Srinivasa Rao [1 ]
Sowjanya, Ponnuru [1 ]
Ramanaiah, M. Laxmidevi [2 ]
Joshi, Rahul [3 ]
Mohan, B. H. Krishna [4 ]
Dhabliya, Dharmesh [5 ]
Raja, N. Kannaiya [6 ]
机构
[1] GITAM Deemed Be Univ, Dept Comp Sci Engn, Hyderabad, India
[2] Inst Aeronaut Engn, Dept Elect & Elect Engn, Hyderabad, India
[3] Symbiosis Int, Symbiosis Inst Technol, Dept Comp Sci Engn, Pune, Maharashtra, India
[4] RVR & JC Coll Engn, Dept Informat Technol, Guntur, Andhra Pradesh, India
[5] Vishwakarma Inst Informat Technol, Dept Comp Engn, Pune, Maharashtra, India
[6] IOT HH Campus Ambo Univ, Dept Comp Sci, Ambo, Ethiopia
关键词
D O I
10.1155/2022/1059004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there's a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model's performance in text recognition is high.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Optical Character Recognition System for Czech Language Using Hierarchical Deep Learning Networks
    Chaudhuri, Arindam
    Ghosh, Soumya K.
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 114 - 125
  • [42] Unsupervised Feature Learning for Optical Character Recognition
    Sahu, Devendra K.
    Awahar, C. V. J.
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1041 - 1045
  • [43] OMRNet: A lightweight deep learning model for optical mark recognition
    Sayan Mondal
    Pratyay De
    Samir Malakar
    Ram Sarkar
    Multimedia Tools and Applications, 2024, 83 : 14011 - 14045
  • [44] OMRNet: A lightweight deep learning model for optical mark recognition
    Mondal, Sayan
    De, Pratyay
    Malakar, Samir
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14011 - 14045
  • [45] Manuscripts Character Recognition Using Machine Learning and Deep Learning
    Islam, Mohammad Anwarul
    Iacob, Ionut E.
    MODELLING, 2023, 4 (02): : 168 - 188
  • [46] Character Recognition using Machine Learning and Deep Learning - A Survey
    Sharma, Reya
    Kaushik, Baijnath
    Gondhi, Naveen
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 341 - 345
  • [47] An Arabic optical character recognition system using recognition-based segmentation
    Cheung, A
    Bennamoun, M
    Bergmann, NW
    PATTERN RECOGNITION, 2001, 34 (02) : 215 - 233
  • [48] Deep optical character recognition: a case of Pashto language
    Zahoor, Shizza
    Naz, Saeeda
    Khan, Naila H.
    Razzak, Muhammad, I
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (02)
  • [49] Improved Optical Character Recognition with Deep Neural Network
    Wei, Tan Chiang
    Sheikh, U. U.
    Ab Rahman, Ab Al-Hadi
    2018 IEEE 14TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2018), 2018, : 245 - 249
  • [50] Deep Learning in Object Recognition, Detection, and Segmentation
    Wang, Xiaogang
    FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2014, 8 (04): : I - +