An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition

被引:2
|
作者
Dhanikonda, Srinivasa Rao [1 ]
Sowjanya, Ponnuru [1 ]
Ramanaiah, M. Laxmidevi [2 ]
Joshi, Rahul [3 ]
Mohan, B. H. Krishna [4 ]
Dhabliya, Dharmesh [5 ]
Raja, N. Kannaiya [6 ]
机构
[1] GITAM Deemed Be Univ, Dept Comp Sci Engn, Hyderabad, India
[2] Inst Aeronaut Engn, Dept Elect & Elect Engn, Hyderabad, India
[3] Symbiosis Int, Symbiosis Inst Technol, Dept Comp Sci Engn, Pune, Maharashtra, India
[4] RVR & JC Coll Engn, Dept Informat Technol, Guntur, Andhra Pradesh, India
[5] Vishwakarma Inst Informat Technol, Dept Comp Engn, Pune, Maharashtra, India
[6] IOT HH Campus Ambo Univ, Dept Comp Sci, Ambo, Ethiopia
关键词
D O I
10.1155/2022/1059004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there's a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model's performance in text recognition is high.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Robust Character Recognition For Optical And Natural Images Using Deep Learning
    Abdali, Al Maamoon Rasool
    Ghani, Rana Fareed
    2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 152 - 156
  • [22] Evaluation of deep learning approaches for optical character recognition in Urdu language
    Riaz, Mehek
    Monir, Syed Muhammad Ghazanfar
    Hasan, Rija
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2022, 41 (04) : 146 - 156
  • [23] Character-based Joint Word Segmentation and Part-of-Speech Tagging for Tibetan Based on Deep Learning
    Li, Yan
    Li, Xiaomin
    Wang, Yiru
    Lv, Hui
    Li, Fenfang
    Duo, La
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [24] Decision tree and deep learning based probabilistic model for character recognition
    A.K.Sampath
    Dr.N.Gomathi
    JournalofCentralSouthUniversity, 2017, 24 (12) : 2862 - 2876
  • [25] Decision tree and deep learning based probabilistic model for character recognition
    A. K. Sampath
    Dr. N. Gomathi
    Journal of Central South University, 2017, 24 : 2862 - 2876
  • [26] Decision tree and deep learning based probabilistic model for character recognition
    Sampath, A. K.
    Gomathi, N.
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2017, 24 (12) : 2862 - 2876
  • [27] Character Recognition Algorithm Based on Fusion Probability Model and Deep Learning
    Liu, Zhijun
    Pan, Xuefeng
    Peng, Yuan
    COMPUTER JOURNAL, 2021, 64 (11): : 1705 - 1714
  • [28] Denoising of Video Frames Resulting From Video Interface Leakage Using Deep Learning for Efficient Optical Character Recognition
    Galvis, J.
    Morales, S.
    Kasmi, C.
    Vega, F.
    IEEE LETTERS ON ELECTROMAGNETIC COMPATIBILITY PRACTICE AND APPLICATIONS, 2021, 3 (02): : 82 - 86
  • [29] Variational Prototype Learning for Deep Face Recognition
    Deng, Jiankang
    Guo, Jia
    Yang, Jing
    Lattas, Alexandros
    Zafeiriou, Stefanos
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11901 - 11910
  • [30] Video text detection and segmentation for optical character recognition
    Ngo, CW
    Chan, CK
    MULTIMEDIA SYSTEMS, 2005, 10 (03) : 261 - 272