A deep learning model for Ottoman OCR

被引:5
|
作者
Dolek, Ishak [1 ]
Kurt, Atakan [1 ]
机构
[1] Istanbul Univ Cerrahpasa, Engn Sch, Comp Engn Dept, Istanbul, Turkey
来源
关键词
CNN; CTC; deep neural networks; LSTM; OCR; Ottoman; printed naksh font; RNN; NEURAL-NETWORK; RECOGNITION; SEGMENTATION; RETRIEVAL;
D O I
10.1002/cpe.6937
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Ottoman OCR is an open problem because the OCR models for Arabic do not perform well on Ottoman. The models specifically trained with Ottoman documents have not produced satisfactory results either. We present a deep learning model and an OCR tool using that model for the OCR of printed Ottoman documents in the naksh font. We propose an end-to-end trainable CRNN architecture consisting of CNN, RNN (LSTM), and CTC layers for the Ottoman OCR problem. An experimental comparison of this model, called , with the Tesseract Arabic, the Tesseract Persian, Abby Finereader, Miletos, and Google Docs OCR tools or models was performed using a test data set of 21 pages of original documents. With 88.86% raw text, 96.12% normalized text, and 97.37% joined text character recognition accuracy, the Hybrid model outperforms the others with a marked difference. Our model outperforms the next best model by a clear margin of 4% which is a significant improvement considering the difficulty of the Ottoman OCR problem, and the huge size of the Ottoman archives to be processed. The hybrid model also achieves 58% word recognition accuracy on normalized text which is the only rate above 50%.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A Hybrid Deep Learning Model for Trash Classification Based on Deep Trasnsfer Learning
    Yuan, Zhen
    Liu, Jinfeng
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2022, 2022
  • [42] A Hybrid Deep Learning Model for Trash Classification Based on Deep Trasnsfer Learning
    Yuan, Zhen
    Liu, Jinfeng
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2022, 2022
  • [43] Islamic learning in Arabic-Afrikaans between Malay model and Ottoman reform
    Versteegh, Kees
    WACANA-JURNAL ILMU PENGETAHUAN BUDAYA-JOURNAL OF THE HUMANITIES OF INDONESIA, 2015, 16 (02): : 284 - 303
  • [44] Simplifying OCR neural networks with oracle learning
    Menke, J
    Martinez, T
    SCIMA 2003: IEEE INTERNATIONAL WORKSHOP ON SOFT COMPUTING TECHNIQUES IN INSTRUMENTATION, MEASUREMENT AND RELATED APPLICATIONS, 2003, : 6 - 13
  • [45] Learning to navigate a crystallization model with Deep Reinforcement Learning
    Manee, Vidhyadhar
    Baratti, Roberto
    Romagnoli, Jose A.
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 178 : 111 - 123
  • [46] Using Transfer Learning for a Deep Learning Model Observer
    Murphy, W.
    Elangovan, P.
    Halling-Brown, M.
    Lewis, E.
    Young, K. C.
    Dance, D. R.
    Wells, K.
    MEDICAL IMAGING 2019: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2019, 10952
  • [47] Deep Learning application to model learning in cognitive robotics
    Rodriguez-Jimenez, Ariel
    Arias-Mendez, Esteban
    Bellas-Bouza, Francisco
    Becerra-Permuy, Jose
    TECNOLOGIA EN MARCHA, 2020, 33 : 92 - 104
  • [48] An evolutive OCR system based on continuous learning
    Lebourgeois, F
    Henry, JL
    THIRD IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV '96, PROCEEDINGS, 1996, : 272 - 277
  • [49] Comparative Analysis of Machine learning algorithms in OCR
    Jain, Vanita
    Dubey, Arun
    Gupta, Amit
    Sharma, Sanchit
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1089 - 1092
  • [50] MMU-OCR-21: Towards End-to-End Urdu Text Recognition Using Deep Learning
    Nasir, Tayyab
    Malik, Muhammad Kamran
    Shahzad, Khurram
    IEEE ACCESS, 2021, 9 : 124945 - 124962