Recognition of Greek Polytonic on Historical Degraded Texts using HMMs

被引:3
|
作者
Katsouros, Vassilis [1 ]
Papavassiliou, Vassilis [1 ]
Simistira, Fotini [1 ,3 ]
Gatos, Basilis [2 ]
机构
[1] Athena Res & Innovat Ctr, Inst Language & Speech Proc, Athens, Greece
[2] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, Athens, Greece
[3] Univ Fribourg, DIVA Res Grp, CH-1700 Fribourg, Switzerland
关键词
Hidden Markov Models; Optical Character Recognition; Greek polytonic;
D O I
10.1109/DAS.2016.60
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Character Recognition (OCR) of ancient Greek polytonic scripts is a challenging task due to the large number of character classes, resulting from variations of diacritical marks on the vowel letters. Classical OCR systems require a character segmentation phase, which in the case of Greek polytonic scripts is the main source of errors that finally affects the overall OCR performance. This paper suggests a character segmentation free HMM-based recognition system and compares its performance with other commercial, open source, and state-of-the art OCR systems. The evaluation has been carried out on a challenging novel dataset of Greek polytonic degraded texts and has shown that HMM-based OCR yields character and word level error rates of 8.61% and 25.30% respectively, which outperforms most of the available OCR systems and it is comparable with the performance of the state-of-the-art system based on LSTM Networks proposed recently.
引用
收藏
页码:346 / 351
页数:6
相关论文
共 50 条
  • [31] In-ear microphone speech data recognition using HMMs
    Kurcan, R. S.
    Fargues, M. P.
    Vaidyanathan, R.
    2006 IEEE 12TH DIGITAL SIGNAL PROCESSING WORKSHOP & 4TH IEEE SIGNAL PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, 2006, : 268 - 272
  • [32] Noisy Speech Recognition by using Output Combination of Discrete-Mixture HMMs and Continuous-Mixture HMMs
    Kosaka, Tetsuo
    Saito, You
    Kato, Masaharu
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2355 - 2358
  • [33] Named entity recognition in greek texts with an ensemble of SVMS and active learning
    Lucarelli, Giorgio
    Vasilakos, Xenofon
    Androutsopoulos, Ion
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) : 1015 - 1045
  • [34] Adaptive shape prior for recognition and variational segmentation of degraded historical characters
    Bar-Yosef, Itay
    Mokeichev, Alik
    Kedem, Klara
    Dinstein, Itshak
    Ehrlich, Uri
    PATTERN RECOGNITION, 2009, 42 (12) : 3348 - 3354
  • [35] Arabic handwritten word recognition using HMMs with explicit state duration
    Benouareth, A.
    Ennaji, A.
    Sellami, M.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
  • [36] A new face recognition system - Using HMMs along with SVD coefficients
    Davari, Pooya
    Naimi, Hossein Miar
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 200 - 205
  • [37] Robust recognition and segmentation of human actions using HMMs with missing observations
    Peursum, P
    Bui, HH
    Venkatesh, S
    West, G
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (13) : 2110 - 2126
  • [38] Robust Recognition and Segmentation of Human Actions Using HMMs with Missing Observations
    Patrick Peursum
    Hung H. Bui
    Svetha Venkatesh
    Geoff West
    EURASIP Journal on Advances in Signal Processing, 2005
  • [39] ONLINE ARABIC HANDWRITING RECOGNITION USING CONTINUOUS GAUSSIAN MIXTURE HMMS
    Al-Habian, Ghaleb
    Assaleh, Khaled
    ICIAS 2007: INTERNATIONAL CONFERENCE ON INTELLIGENT & ADVANCED SYSTEMS, VOLS 1-3, PROCEEDINGS, 2007, : 1183 - 1186
  • [40] Arabic Handwritten Word Recognition Using HMMs with Explicit State Duration
    A. Benouareth
    A. Ennaji
    M. Sellami
    EURASIP Journal on Advances in Signal Processing, 2008