An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition

被引:2
|
作者
Dhanikonda, Srinivasa Rao [1 ]
Sowjanya, Ponnuru [1 ]
Ramanaiah, M. Laxmidevi [2 ]
Joshi, Rahul [3 ]
Mohan, B. H. Krishna [4 ]
Dhabliya, Dharmesh [5 ]
Raja, N. Kannaiya [6 ]
机构
[1] GITAM Deemed Be Univ, Dept Comp Sci Engn, Hyderabad, India
[2] Inst Aeronaut Engn, Dept Elect & Elect Engn, Hyderabad, India
[3] Symbiosis Int, Symbiosis Inst Technol, Dept Comp Sci Engn, Pune, Maharashtra, India
[4] RVR & JC Coll Engn, Dept Informat Technol, Guntur, Andhra Pradesh, India
[5] Vishwakarma Inst Informat Technol, Dept Comp Engn, Pune, Maharashtra, India
[6] IOT HH Campus Ambo Univ, Dept Comp Sci, Ambo, Ethiopia
关键词
D O I
10.1155/2022/1059004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there's a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model's performance in text recognition is high.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Ensemble deep learning model for optical character recognition
    Ashish Shetty
    Sanjeev Sharma
    Multimedia Tools and Applications, 2024, 83 : 11411 - 11431
  • [2] Ensemble deep learning model for optical character recognition
    Shetty, Ashish
    Sharma, Sanjeev
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 11411 - 11431
  • [3] Deep Learning Based Residual Network Features for Telugu Printed Character Recognition
    Sonthi, Vijaya Krishna
    Nagarajan, S.
    Krishnaraj, N.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (03): : 1725 - 1736
  • [4] An optical character recognition system for printed Telugu text
    Lakshmi, CV
    Patvardhan, C
    PATTERN ANALYSIS AND APPLICATIONS, 2004, 7 (02) : 190 - 204
  • [5] An optical character recognition system for printed Telugu text
    C. Vasantha Lakshmi
    C. Patvardhan
    Pattern Analysis and Applications, 2004, 7 : 190 - 204
  • [6] Automated Telugu Printed and Handwritten Character Recognition in Single Image using Aquila Optimizer based Deep Learning Model
    Sonthi, Vijaya Krishna
    Nagarajan, S.
    Krishnaraj, N.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (12) : 597 - 604
  • [7] OPTICAL CHARACTER RECOGNITION (OCR) FOR TELUGU: DATABASE, ALGORITHM AND APPLICATION
    Prakash, Konkimalla Chandra
    Srikar, Y. M.
    Trishal, Gayam
    Mandal, Souraj
    Channappayya, Sumohana S.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3963 - 3967
  • [8] Off-line Telugu Handwritten Characters Recognition using optical character recognition
    Prameela, N.
    Anjusha, P.
    Karthik, R.
    2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 2, 2017, : 223 - 226
  • [9] Optical character recognition without segmentation
    Ozdil, MA
    Vural, FTY
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 483 - 486
  • [10] Optical Character Recognition for Medical Records Digitization with Deep Learning
    Zaryab, Muhammad Ateeque
    Ng, Chuen Rue
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3260 - 3263