Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems

被引:1
|
作者
Nouza, Jan [1 ]
Safarik, Radek [1 ]
机构
[1] Tech Univ Liberec, Inst Informat Technol & Elect, Studentska 2, Liberec 46117, Czech Republic
来源
关键词
Speech recognition; Cross-lingual bootstrapping; Parliament speech; TRANSCRIPTION;
D O I
10.1007/978-3-319-64206-2_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the paper we present a fully automated process capable of creating speech databases needed for training acoustic models for speech recognition systems. We show that archives of national parliaments are perfect sources of speech and text data suited for a lightly supervised training scheme, which does not require human intervention. We describe the process and its procedures in details and demonstrate its usage on three Slavic languages (Polish, Russian and Bulgarian). Practical evaluation is done on a broadcast news task and yields better results than those obtained on some established speech databases.
引用
收藏
页码:174 / 182
页数:9
相关论文
共 50 条
  • [1] Multi-lingual Transformer Training for Khmer Automatic Speech Recognition
    Soky, Kak
    Li, Sheng
    Kawahara, Tatsuya
    Seng, Sopheap
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1893 - 1896
  • [2] Automatic Multi-lingual Script Recognition Application
    Abu-Ain, Waleed Abdel Karim
    Abdullah, Siti Norul Huda Sheikh
    Omar, Khairuddin
    Abd Rahman, Siti Zaharah
    [J]. GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (03): : 203 - 221
  • [3] Automatic Language Identification Using Speech Rhythm Features for Multi-Lingual Speech Recognition
    Kim, Hwamin
    Park, Jeong-Sik
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [4] Dataset and Evaluation of Automatic Speech Recognition for Multi-lingual Intent Recognition on Social Robots
    Andriella, Antonio
    Ros, Raquel
    Ellinson, Yoav
    Gannot, Sharon
    Lemaignan, Severin
    [J]. PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 865 - 869
  • [5] Automatic segmentation and labelling of multi-lingual speech data
    Vorstermans, A
    Martens, JP
    VanCoile, B
    [J]. SPEECH COMMUNICATION, 1996, 19 (04) : 271 - 293
  • [6] Automatic learning of numeral grammars for multi-lingual speech synthesizers
    Flach, G
    Holzapfel, M
    Just, C
    Wachtler, A
    Wolff, M
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1291 - 1294
  • [7] An automatic machine translation system for multi-lingual speech to Indian sign language
    Dhanjal, Amandeep Singh
    Singh, Williamjeet
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 4283 - 4321
  • [8] An automatic machine translation system for multi-lingual speech to Indian sign language
    Amandeep Singh Dhanjal
    Williamjeet Singh
    [J]. Multimedia Tools and Applications, 2022, 81 : 4283 - 4321
  • [9] Mobile multi-lingual automatic interpretation system Mobilingual
    Anon
    [J]. 2002, Hitachi Ltd.
  • [10] MAKED: Multi-lingual Automatic Keyword Extraction Dataset
    Verma, Yash
    Jangra, Anubhav
    Saha, Sriparna
    Jatowt, Adam
    Roy, Dwaipayan
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6170 - 6179