Recognition of Greek Polytonic on Historical Degraded Texts using HMMs

被引:3
|
作者
Katsouros, Vassilis [1 ]
Papavassiliou, Vassilis [1 ]
Simistira, Fotini [1 ,3 ]
Gatos, Basilis [2 ]
机构
[1] Athena Res & Innovat Ctr, Inst Language & Speech Proc, Athens, Greece
[2] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, Athens, Greece
[3] Univ Fribourg, DIVA Res Grp, CH-1700 Fribourg, Switzerland
关键词
Hidden Markov Models; Optical Character Recognition; Greek polytonic;
D O I
10.1109/DAS.2016.60
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Character Recognition (OCR) of ancient Greek polytonic scripts is a challenging task due to the large number of character classes, resulting from variations of diacritical marks on the vowel letters. Classical OCR systems require a character segmentation phase, which in the case of Greek polytonic scripts is the main source of errors that finally affects the overall OCR performance. This paper suggests a character segmentation free HMM-based recognition system and compares its performance with other commercial, open source, and state-of-the art OCR systems. The evaluation has been carried out on a challenging novel dataset of Greek polytonic degraded texts and has shown that HMM-based OCR yields character and word level error rates of 8.61% and 25.30% respectively, which outperforms most of the available OCR systems and it is comparable with the performance of the state-of-the-art system based on LSTM Networks proposed recently.
引用
收藏
页码:346 / 351
页数:6
相关论文
共 50 条
  • [41] New Testament Greek in texts.: An instructional method for koine Greek using texts drawn from the New Testament
    Bussières, MP
    LAVAL THEOLOGIQUE ET PHILOSOPHIQUE, 2001, 57 (02): : 339 - 340
  • [42] Efficient speech recognition using subvector quantization and discrete-mixture HMMS
    Digalakis, V
    Tsakalidis, S
    Harizakis, C
    Neumeyer, L
    COMPUTER SPEECH AND LANGUAGE, 2000, 14 (01): : 33 - 46
  • [43] An improved Arabic Handwritten Recognition System using Embedded Training based on HMMs
    Amrouch, Mustapha
    Rabi, Mouhcine
    Mammass, Driss
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [44] Human Activity Recognition Based on Acceleration Data From Smartphones Using HMMs
    Iloga, Sylvain
    Bordat, Alexandre
    Le Kernec, Julien
    Romain, Olivier
    IEEE ACCESS, 2021, 9 : 139336 - 139351
  • [45] Using Alliteration in Authorship Attribution of Historical Texts
    Ivanov, Lubomir
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 239 - 248
  • [46] Pattern recognition methods for advanced stochastic protein sequence analysis using HMMs
    Ploetz, Thomas
    Fink, Gernot A.
    PATTERN RECOGNITION, 2006, 39 (12) : 2267 - 2280
  • [47] Using HMMs and Depth Information for Signer-Independent Sign Language Recognition
    Wu, Yeh-Kuang
    Wang, Hui-Chun
    Chang, Liung-Chun
    Li, Ke-Chun
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 79 - 86
  • [48] Automatic facial expression recognition using facial animation parameters and multistream HMMs
    Aleksic, Petar S.
    Katsaggelos, Aggelos K.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2006, 1 (01) : 3 - 11
  • [49] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
    MATSUNAGA, S
    HOMMA, S
    SAGAYAMA, S
    FURUI, S
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
  • [50] STARS: Sign tracking and recognition system using input-output HMMs
    Keskin, C.
    Akarun, L.
    PATTERN RECOGNITION LETTERS, 2009, 30 (12) : 1086 - 1095