An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition

被引:2
|
作者
Dhanikonda, Srinivasa Rao [1 ]
Sowjanya, Ponnuru [1 ]
Ramanaiah, M. Laxmidevi [2 ]
Joshi, Rahul [3 ]
Mohan, B. H. Krishna [4 ]
Dhabliya, Dharmesh [5 ]
Raja, N. Kannaiya [6 ]
机构
[1] GITAM Deemed Be Univ, Dept Comp Sci Engn, Hyderabad, India
[2] Inst Aeronaut Engn, Dept Elect & Elect Engn, Hyderabad, India
[3] Symbiosis Int, Symbiosis Inst Technol, Dept Comp Sci Engn, Pune, Maharashtra, India
[4] RVR & JC Coll Engn, Dept Informat Technol, Guntur, Andhra Pradesh, India
[5] Vishwakarma Inst Informat Technol, Dept Comp Engn, Pune, Maharashtra, India
[6] IOT HH Campus Ambo Univ, Dept Comp Sci, Ambo, Ethiopia
关键词
D O I
10.1155/2022/1059004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there's a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model's performance in text recognition is high.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Video text detection and segmentation for optical character recognition
    Chong-Wah Ngo
    Chi-Kwong Chan
    Multimedia Systems, 2005, 10 : 261 - 272
  • [32] License Plate Recognition Using Three-Dimensional Rotated Character Recognition and Instance Segmentation by Deep Learning
    Sasaki, Tetsuro
    Morita, Kento
    Wakabayashi, Tetsushi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (05) : 1178 - 1185
  • [33] FLEXIBLE SEGMENTATION AND MATCHING FOR OPTICAL CHARACTER-RECOGNITION
    SUN, SW
    KUNG, SY
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING IV, PTS 1-3, 1989, 1199 : 1314 - 1323
  • [34] A Deep Learning-based Pre-Trained VGG19 Model for Optical Character Recognition
    Singh, Gurpreet
    Guleria, Kalpna
    Sharma, Shagun
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 801 - 805
  • [35] An Efficient Multiclassifier System Based on Convolutional Neural Network for Offline Handwritten Telugu Character Recognition
    Soman, Soumya T.
    Nandigam, Ashakranthi
    Chakravarthy, V. Srinivasa
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [36] Deep Learning Based Tangut Character Recognition
    Zhang, Guangwei
    Han, Xiaomang
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 437 - 441
  • [37] Deep Learning Strategy for Braille Character Recognition
    Kausar, Tasleem
    Manzoor, Sajjad
    Kausar, Adeeba
    Lu, Yun
    Wasif, Muhammad
    Ashraf, M. Adnan
    IEEE ACCESS, 2021, 9 : 169357 - 169371
  • [38] Telugu Dialect Speech Dataset Creation and Recognition using Deep Learning Techniques
    Podila, Rama Sai Abhishek
    Kommula, Ganga Sai Sudeep
    Ruthvik, K.
    Vekkot, Susmitha
    Gupta, Deepa
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [39] Character Recognition by Deep Learning: An Enterprise solution
    Bouaziz, Khaled
    Ramakrishnan, Thiagarajan
    Raghavan, Srinivasan
    Grove, Kyle
    Al-Omari, Awny
    Lakshminarayan, Choudur
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1719 - 1727
  • [40] Improving Deep Learning based Optical Character Recognition via Neural Architecture Search
    Zhao, Zhenyao
    Jiang, Min
    Guo, Shihui
    Wang, Zhenzhong
    Chao, Fei
    Tan, Kay Chen
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,