Recognition of Greek Polytonic on Historical Degraded Texts using HMMs

被引：3

作者：

Katsouros, Vassilis ^{[1
]}

Papavassiliou, Vassilis ^{[1
]}

Simistira, Fotini ^{[1
,3
]}

Gatos, Basilis ^{[2
]}

机构：

[1] Athena Res & Innovat Ctr, Inst Language & Speech Proc, Athens, Greece

[2] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, Athens, Greece

[3] Univ Fribourg, DIVA Res Grp, CH-1700 Fribourg, Switzerland

来源：

PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016) | 2016年

关键词：

Hidden Markov Models; Optical Character Recognition; Greek polytonic;

D O I：

10.1109/DAS.2016.60

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Optical Character Recognition (OCR) of ancient Greek polytonic scripts is a challenging task due to the large number of character classes, resulting from variations of diacritical marks on the vowel letters. Classical OCR systems require a character segmentation phase, which in the case of Greek polytonic scripts is the main source of errors that finally affects the overall OCR performance. This paper suggests a character segmentation free HMM-based recognition system and compares its performance with other commercial, open source, and state-of-the art OCR systems. The evaluation has been carried out on a challenging novel dataset of Greek polytonic degraded texts and has shown that HMM-based OCR yields character and word level error rates of 8.61% and 25.30% respectively, which outperforms most of the available OCR systems and it is comparable with the performance of the state-of-the-art system based on LSTM Networks proposed recently.

引用

页码：346 / 351

页数：6

共 50 条

[41] New Testament Greek in texts.: An instructional method for koine Greek using texts drawn from the New Testament
Bussières, MP
LAVAL THEOLOGIQUE ET PHILOSOPHIQUE, 2001, 57 (02): : 339 - 340
[42] Efficient speech recognition using subvector quantization and discrete-mixture HMMS
Digalakis, V
Tsakalidis, S
Harizakis, C
Neumeyer, L
COMPUTER SPEECH AND LANGUAGE, 2000, 14 (01): : 33 - 46
[43] An improved Arabic Handwritten Recognition System using Embedded Training based on HMMs
Amrouch, Mustapha
Rabi, Mouhcine
Mammass, Driss
2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
[44] Human Activity Recognition Based on Acceleration Data From Smartphones Using HMMs
Iloga, Sylvain
Bordat, Alexandre
Le Kernec, Julien
Romain, Olivier
IEEE ACCESS, 2021, 9 : 139336 - 139351
[45] Using Alliteration in Authorship Attribution of Historical Texts
Ivanov, Lubomir
TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 239 - 248
[46] Pattern recognition methods for advanced stochastic protein sequence analysis using HMMs
Ploetz, Thomas
Fink, Gernot A.
PATTERN RECOGNITION, 2006, 39 (12) : 2267 - 2280
[47] Using HMMs and Depth Information for Signer-Independent Sign Language Recognition
Wu, Yeh-Kuang
Wang, Hui-Chun
Chang, Liung-Chun
Li, Ke-Chun
MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 79 - 86
[48] Automatic facial expression recognition using facial animation parameters and multistream HMMs
Aleksic, Petar S.
Katsaggelos, Aggelos K.
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2006, 1 (01) : 3 - 11
[49] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
MATSUNAGA, S
HOMMA, S
SAGAYAMA, S
FURUI, S
IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
[50] STARS: Sign tracking and recognition system using input-output HMMs
Keskin, C.
Akarun, L.
PATTERN RECOGNITION LETTERS, 2009, 30 (12) : 1086 - 1095

← 1 2 3 4 5 →