A Revised Comparison of Polish Taggers in the Application for Automatic Speech Recognition

被引：0

作者：

Smywinski-Pohl, Aleksander ^{[1
,2
,3
]}

Ziolko, Bartosz ^{[2
,3
]}

机构：

[1] Jagiellonian Univ, Fac Management & Social Commun, Krakow, Poland

[2] AGH Univ Sci & Technol, Fac Comp Sci Elect & Telecommun, Krakow, Poland

[3] Techmo, Krakow, Poland

来源：

HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS | 2016年 / 9561卷

关键词：

Morphosyntactic tagger; Polish; Automatic speech recognition; Language model; MODELS;

D O I：

10.1007/978-3-319-43808-5-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper (This is a revised and extended version of the article A Comparison of Polish Taggers in the Application for Automatic Speech Recognition that appeared in the Proceedings of Language and Tools Conference, Poznan, 2013.) we investigate the performance of Polish taggers in the context of automatic speech recognition (ASR). We use a morphosyntactic language model to improve speech recognition in an ASR system and seek the best Polish tagger for our needs. Polish is an inflectional language and an n-gram model using morphosyntactic features, which reduces data sparsity seems to be a good choice. We investigate the difference between the morphosyntactic taggers in that context. We compare the results of tagging with respect to the reduction of word error rate as well as speed of tagging. As it turns out at present the taggers using conditional random fields (CRF) models perform the best in the context of ASR. A broader audience might be also interested in the other discussed features of the taggers such as easiness of installation and usage, which are usually not covered in the papers describing such systems.

引用

页码：68 / 81

页数：14

共 50 条

[41] Studies on inter-speaker variability in speech and its application in automatic speech recognition
S UMESH
Sadhana, 2011, 36 : 853 - 883
[42] Studies on inter-speaker variability in speech and its application in automatic speech recognition
Umesh, S.
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 853 - 883
[43] Using speech synthesis to explain automatic speaker recognition: a new application of synthetic speech
Brown, Georgina
Kirchhubel, Christin
Cuthbert, Ramiz
INTERSPEECH 2023, 2023, : 4723 - 4727
[44] ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION
Droua-Hamdani, Ghania
Selouani, Sid Ahmed
Boudraa, Malika
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C): : 157 - 166
[45] Automatic Speech Recognition System for Malay Speaking Children Automatic Speech Recognition system
Rahman, Feisal Dani
Mohamed, Noraini
Mustafa, Mumtaz Begum
Salim, Siti Salwah
2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 79 - 82
[46] AN APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
PAY, BE
EVANS, CR
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1981, 14 (01): : 13 - 27
[47] PROSPECTS FOR AUTOMATIC RECOGNITION OF SPEECH
HOUDE, R
AMERICAN ANNALS OF THE DEAF, 1979, 124 (05) : 568 - 572
[48] Automatic speech recognition systems
Catariov, A
Information Technologies 2004, 2004, 5822 : 83 - 93
[49] Automatic speech recognition: A review
Haton, JP
ENTERPRISE INFORMATION SYSTEMS V, 2004, : 6 - 11
[50] FORMANTS IN AUTOMATIC SPEECH RECOGNITION
BROAD, DJ
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1972, 4 (04): : 411 - 424

← 1 2 3 4 5 →