A Revised Comparison of Polish Taggers in the Application for Automatic Speech Recognition

被引:0
|
作者
Smywinski-Pohl, Aleksander [1 ,2 ,3 ]
Ziolko, Bartosz [2 ,3 ]
机构
[1] Jagiellonian Univ, Fac Management & Social Commun, Krakow, Poland
[2] AGH Univ Sci & Technol, Fac Comp Sci Elect & Telecommun, Krakow, Poland
[3] Techmo, Krakow, Poland
关键词
Morphosyntactic tagger; Polish; Automatic speech recognition; Language model; MODELS;
D O I
10.1007/978-3-319-43808-5-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper (This is a revised and extended version of the article A Comparison of Polish Taggers in the Application for Automatic Speech Recognition that appeared in the Proceedings of Language and Tools Conference, Poznan, 2013.) we investigate the performance of Polish taggers in the context of automatic speech recognition (ASR). We use a morphosyntactic language model to improve speech recognition in an ASR system and seek the best Polish tagger for our needs. Polish is an inflectional language and an n-gram model using morphosyntactic features, which reduces data sparsity seems to be a good choice. We investigate the difference between the morphosyntactic taggers in that context. We compare the results of tagging with respect to the reduction of word error rate as well as speed of tagging. As it turns out at present the taggers using conditional random fields (CRF) models perform the best in the context of ASR. A broader audience might be also interested in the other discussed features of the taggers such as easiness of installation and usage, which are usually not covered in the papers describing such systems.
引用
收藏
页码:68 / 81
页数:14
相关论文
共 50 条
  • [41] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    S UMESH
    Sadhana, 2011, 36 : 853 - 883
  • [42] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    Umesh, S.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 853 - 883
  • [43] Using speech synthesis to explain automatic speaker recognition: a new application of synthetic speech
    Brown, Georgina
    Kirchhubel, Christin
    Cuthbert, Ramiz
    INTERSPEECH 2023, 2023, : 4723 - 4727
  • [44] ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION
    Droua-Hamdani, Ghania
    Selouani, Sid Ahmed
    Boudraa, Malika
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C): : 157 - 166
  • [45] Automatic Speech Recognition System for Malay Speaking Children Automatic Speech Recognition system
    Rahman, Feisal Dani
    Mohamed, Noraini
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 79 - 82
  • [46] AN APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
    PAY, BE
    EVANS, CR
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1981, 14 (01): : 13 - 27
  • [47] PROSPECTS FOR AUTOMATIC RECOGNITION OF SPEECH
    HOUDE, R
    AMERICAN ANNALS OF THE DEAF, 1979, 124 (05) : 568 - 572
  • [48] Automatic speech recognition systems
    Catariov, A
    Information Technologies 2004, 2004, 5822 : 83 - 93
  • [49] Automatic speech recognition: A review
    Haton, JP
    ENTERPRISE INFORMATION SYSTEMS V, 2004, : 6 - 11
  • [50] FORMANTS IN AUTOMATIC SPEECH RECOGNITION
    BROAD, DJ
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1972, 4 (04): : 411 - 424