Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems

被引：1

作者：

Nouza, Jan ^{[1
]}

Safarik, Radek ^{[1
]}

机构：

[1] Tech Univ Liberec, Inst Informat Technol & Elect, Studentska 2, Liberec 46117, Czech Republic

来源：

TEXT, SPEECH, AND DIALOGUE, TSD 2017 | 2017年 / 10415卷

关键词：

Speech recognition; Cross-lingual bootstrapping; Parliament speech; TRANSCRIPTION;

D O I：

10.1007/978-3-319-64206-2_20

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the paper we present a fully automated process capable of creating speech databases needed for training acoustic models for speech recognition systems. We show that archives of national parliaments are perfect sources of speech and text data suited for a lightly supervised training scheme, which does not require human intervention. We describe the process and its procedures in details and demonstrate its usage on three Slavic languages (Polish, Russian and Bulgarian). Practical evaluation is done on a broadcast news task and yields better results than those obtained on some established speech databases.

引用

页码：174 / 182

页数：9

共 50 条

[1] Multi-lingual Transformer Training for Khmer Automatic Speech Recognition
Soky, Kak
Li, Sheng
Kawahara, Tatsuya
Seng, Sopheap
[J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1893 - 1896
[2] Automatic Multi-lingual Script Recognition Application
Abu-Ain, Waleed Abdel Karim
Abdullah, Siti Norul Huda Sheikh
Omar, Khairuddin
Abd Rahman, Siti Zaharah
[J]. GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (03): : 203 - 221
[3] Automatic Language Identification Using Speech Rhythm Features for Multi-Lingual Speech Recognition
Kim, Hwamin
Park, Jeong-Sik
[J]. APPLIED SCIENCES-BASEL, 2020, 10 (07):
[4] Dataset and Evaluation of Automatic Speech Recognition for Multi-lingual Intent Recognition on Social Robots
Andriella, Antonio
Ros, Raquel
Ellinson, Yoav
Gannot, Sharon
Lemaignan, Severin
[J]. PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 865 - 869
[5] Automatic segmentation and labelling of multi-lingual speech data
Vorstermans, A
Martens, JP
VanCoile, B
[J]. SPEECH COMMUNICATION, 1996, 19 (04) : 271 - 293
[6] Automatic learning of numeral grammars for multi-lingual speech synthesizers
Flach, G
Holzapfel, M
Just, C
Wachtler, A
Wolff, M
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1291 - 1294
[7] An automatic machine translation system for multi-lingual speech to Indian sign language
Dhanjal, Amandeep Singh
Singh, Williamjeet
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 4283 - 4321
[8] An automatic machine translation system for multi-lingual speech to Indian sign language
Amandeep Singh Dhanjal
Williamjeet Singh
[J]. Multimedia Tools and Applications, 2022, 81 : 4283 - 4321
[9] Mobile multi-lingual automatic interpretation system Mobilingual
Anon
[J]. 2002, Hitachi Ltd.
[10] MAKED: Multi-lingual Automatic Keyword Extraction Dataset
Verma, Yash
Jangra, Anubhav
Saha, Sriparna
Jatowt, Adam
Roy, Dwaipayan
[J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6170 - 6179

← 1 2 3 4 5 →