Recognition of Greek Polytonic on Historical Degraded Texts using HMMs

被引:3
|
作者
Katsouros, Vassilis [1 ]
Papavassiliou, Vassilis [1 ]
Simistira, Fotini [1 ,3 ]
Gatos, Basilis [2 ]
机构
[1] Athena Res & Innovat Ctr, Inst Language & Speech Proc, Athens, Greece
[2] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, Athens, Greece
[3] Univ Fribourg, DIVA Res Grp, CH-1700 Fribourg, Switzerland
关键词
Hidden Markov Models; Optical Character Recognition; Greek polytonic;
D O I
10.1109/DAS.2016.60
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Character Recognition (OCR) of ancient Greek polytonic scripts is a challenging task due to the large number of character classes, resulting from variations of diacritical marks on the vowel letters. Classical OCR systems require a character segmentation phase, which in the case of Greek polytonic scripts is the main source of errors that finally affects the overall OCR performance. This paper suggests a character segmentation free HMM-based recognition system and compares its performance with other commercial, open source, and state-of-the art OCR systems. The evaluation has been carried out on a challenging novel dataset of Greek polytonic degraded texts and has shown that HMM-based OCR yields character and word level error rates of 8.61% and 25.30% respectively, which outperforms most of the available OCR systems and it is comparable with the performance of the state-of-the-art system based on LSTM Networks proposed recently.
引用
收藏
页码:346 / 351
页数:6
相关论文
共 50 条
  • [1] Recognition of Historical Greek Polytonic Scripts Using LSTM Networks
    Simistira, Fotini
    Ul-Hassan, Adnan
    Papavassiliou, Vassilis
    Gatos, Basilis
    Katsouros, Vassilis
    Liwicki, Marcus
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 766 - 770
  • [2] Using Attributes for Word Spotting and Recognition in Polytonic Greek Documents
    Sfikas, Giorgos
    Giotis, Angelos P.
    Louloudis, Georgios
    Gatos, Basilis
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 686 - 690
  • [3] Offline recognition of unconstrained handwritten texts using HMMs and statistical language models
    Vinciarelli, A
    Bengio, S
    Bunke, H
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (06) : 709 - 720
  • [4] NUMERAL CORRUPTION IN GREEK HISTORICAL TEXTS
    DEVELIN, R
    PHOENIX-THE JOURNAL OF THE CLASSICAL ASSOCIATION OF CANADA, 1990, 44 (01): : 31 - 45
  • [5] Head gesture recognition using HMMs
    Choi, HI
    Rhee, PK
    EXPERT SYSTEMS WITH APPLICATIONS, 1999, 17 (03) : 213 - 221
  • [6] Head gesture recognition using HMMs
    Kim, SH
    Choi, HI
    Rhee, PK
    CISST'98: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS AND TECHNOLOGY, 1998, : 9 - 16
  • [7] Soccer highlights detection and recognition using HMMs
    Assfalg, J
    Bertini, M
    Del Bimbo, A
    Nunziati, W
    Pala, P
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 825 - 828
  • [8] Handwritten Digit Recognition using DCT and HMMs
    Ali, Syed Salman
    Ghani, Muhammad Usman
    PROCEEDINGS OF 2014 12TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY, 2014, : 303 - 306
  • [9] Sports event recognition using layered HMMs
    Barnard, M
    Odobez, JM
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 1151 - 1154
  • [10] Named Entity Recognition for Digitised Historical Texts
    Grover, Claire
    Givon, Sharon
    Tobin, Richard
    Ball, Julian
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1343 - 1346