A Revised Comparison of Polish Taggers in the Application for Automatic Speech Recognition

被引:0
|
作者
Smywinski-Pohl, Aleksander [1 ,2 ,3 ]
Ziolko, Bartosz [2 ,3 ]
机构
[1] Jagiellonian Univ, Fac Management & Social Commun, Krakow, Poland
[2] AGH Univ Sci & Technol, Fac Comp Sci Elect & Telecommun, Krakow, Poland
[3] Techmo, Krakow, Poland
关键词
Morphosyntactic tagger; Polish; Automatic speech recognition; Language model; MODELS;
D O I
10.1007/978-3-319-43808-5-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper (This is a revised and extended version of the article A Comparison of Polish Taggers in the Application for Automatic Speech Recognition that appeared in the Proceedings of Language and Tools Conference, Poznan, 2013.) we investigate the performance of Polish taggers in the context of automatic speech recognition (ASR). We use a morphosyntactic language model to improve speech recognition in an ASR system and seek the best Polish tagger for our needs. Polish is an inflectional language and an n-gram model using morphosyntactic features, which reduces data sparsity seems to be a good choice. We investigate the difference between the morphosyntactic taggers in that context. We compare the results of tagging with respect to the reduction of word error rate as well as speed of tagging. As it turns out at present the taggers using conditional random fields (CRF) models perform the best in the context of ASR. A broader audience might be also interested in the other discussed features of the taggers such as easiness of installation and usage, which are usually not covered in the papers describing such systems.
引用
收藏
页码:68 / 81
页数:14
相关论文
共 50 条
  • [1] Automatic Recognition of Emotional State in Polish Speech
    Staroniewicz, Piotr
    [J]. TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 347 - 353
  • [2] Automatic Speech Recognition System Dedicated for Polish
    Ziolko, Mariusz
    Galka, Jakub
    Ziolko, Bartosz
    Jadczyk, Tomasz
    Skurzok, Dawid
    Masior, Mariusz
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3322 - 3323
  • [3] Recognition of Emotional State in Polish Speech - Comparison between Human and Automatic Efficiency
    Staroniewicz, Piotr
    [J]. BIOMETRIC ID MANAGEMENT AND MULTIMODAL COMMUNICATION, PROCEEDINGS, 2009, 5707 : 33 - 40
  • [4] Recognition quality improvement in Automatic Speech Recognition system for Polish
    Wydra, Sebastian
    [J]. EUROCON 2007: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOLS 1-6, 2007, : 1693 - 1698
  • [5] AN APPLICATION OF AUTOMATIC SPEECH RECOGNITION
    HENTHORN, KS
    MACCORMACK, PJ
    [J]. JOURNAL OF MICROCOMPUTER APPLICATIONS, 1982, 5 (03): : 239 - 245
  • [6] Skrybot - A System for Automatic Speech Recognition of Polish Language
    Pawlaczyk, Leslaw
    Bosky, Pawel
    [J]. MAN-MACHINE INTERACTIONS, 2009, 59 : 381 - +
  • [7] Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish
    Smywinski-Pohl, Alexsander
    Ziolko, Bartosz
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2016, 25 (02)
  • [8] Polish Language Modelling for Speech Recognition Application
    Klosowski, Piotr
    [J]. 2017 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2017), 2017, : 313 - 318
  • [9] AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION
    BRUNDAGE, WJ
    [J]. CONTROL ENGINEERING, 1983, 30 (04) : 117 - 117
  • [10] DUAL APPLICATION OF SPEECH ENHANCEMENT FOR AUTOMATIC SPEECH RECOGNITION
    Pandey, Ashutosh
    Liu, Chunxi
    Wang, Yun
    Saraf, Yatharth
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 223 - 228