Using prompts to produce quality corpus for training automatic speech recognition systems

被引：0

作者：

Lecouteux, Benjamin ^{[1
]}

Linares, Georges ^{[1
]}

机构：

[1] Univ Avignon, LIA, Avignon, France

来源：

2008 IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2 | 2008年

关键词：

speech recognition; closed captioning; corpus building; automatic segmentation;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In this paper we present an integrated unsupervised method to produce a quality corpus for training automatic speech recognition system (ASR) using prompts or closed captions. Closed captions and prompts do not always have timestamps and do not necessarily correspond to the exact speech. We propose a method allowing to extract quality corpus from imperfect transcript. The proposed approach works in two steps. During the search, the ASR system finds matching segments in a large prompt database. Matching segments are then used inside a Driven Decoding Algorithm (DDA) to produce a high quality corpus. Results show a F-measure of 96% in term of spotting while the DDA corrects the output according to the prompts: a high quality corpus is easily extracted. (1)

引用

页码：820 / 825

页数：6

共 50 条

[1] Corpus for automatic speech recognition
Adda-Decker, Martine
REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2007, 12 (01): : 71 - 84
[2] Using Automatic Speech Recognition in Spoken Corpus Curation
Gorisch, Jan
Gref, Michael
Schmidt, Thomas
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6423 - 6428
[3] Validation of Speech Data for Training Automatic Speech Recognition Systems
Krizaj, Janes
Gros, Jerneja Zganec
Dobrisek, Simon
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1165 - 1169
[4] Creation of Marathi Speech Corpus for Automatic Speech Recognition
Gaikwad, Santosh
Gawali, Bharti
Mehrotra, Suresh
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[5] The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Mukiibi, Jonathan
Katumba, Andrew
Nakatumba-Nabende, Joyce
Hussein, Ali
Meyer, Josh
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1945 - 1954
[6] Multimodal English corpus for automatic speech recognition
Kunka, Bartosz
Kupryjanow, Adam
Dalka, Piotr
Bratoszewski, Piotr
Szczodrak, Maciej
Spaleniak, Pawel
Szykulski, Marcin
Czyzewski, Andrzej
2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
[7] CEASR: A Corpus for Evaluating Automatic Speech Recognition
Ulasik, Malgorzata Anna
Huerlimann, Manuela
Germann, Fabian
Gedik, Esin
Benites, Fernando
Cieliebak, Mark
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6477 - 6485
[8] Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems
Abushariah, Mohammad Abd-Alrahman Mahmoud
Ainon, Raja Noor
Zainuddin, Roziati
Alqudah, Assal Ali Mustafa
Ahmed, Moustafa Elshafei
Khalifa, Othman Omran
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2012, 349 (07): : 2215 - 2242
[9] Chhattisgarhi speech corpus for research and development in automatic speech recognition
Londhe, Narendra D.
Kshirsagar, Ghanahshyam B.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 193 - 210
[10] Bangladeshi Bangla speech corpus for automatic speech recognition research
Kibria, Shafkat
Samin, Ahnaf Mozib
Kobir, M. Humayon
Rahman, M. Shahidur
Selim, M. Reza
Iqbal, M. Zafar
SPEECH COMMUNICATION, 2022, 136 : 84 - 97

← 1 2 3 4 5 →