IMPROVING DATA SELECTION FOR LOW-RESOURCE STT AND KWS

被引：0

作者：

Fraga-Silva, Thiago ^{[1
]}

Laurent, Antoine ^{[1
]}

Gauvain, Jean-Luc ^{[2
]}

Lamel, Lori ^{[2
]}

Le, Viet-Bac ^{[1
]}

Messaoudi, Abdel ^{[1
]}

机构：

[1] Vocapia Res, 28 Rue Jean Rostand, F-91400 Orsay, France

[2] CNRS LIMSI, Spoken Language Proc Grp, F-91405 Orsay, France

来源：

2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2015年

关键词：

data selection; low-resource languages; speech recognition; keyword spotting; SPEECH RECOGNITION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper extends recent research on training data selection for speech transcription and keyword spotting system development. Selection techniques were explored in the context of the IARPA-Babel Active Learning (AL) task for 6 languages. Different selection criteria were considered with the goal of improving over a system built using a pre-defined 3-hour training data set. Four variants of the entropy-based criterion were explored: words, triphones, phones as well as the use of HMM-states previously introduced in [4]. The influence of the number of HMM-states was assessed as well as whether automatic or manual reference transcripts were used. The combination of selection criteria was investigated, and a novel multi-stage selection method proposed. This method was also assessed using larger data sets than were permitted in the Babel AL task. Results are reported for the 6 languages. The multi-stage selection was also applied to the surprise language (Swahili) in the NIST OpenKWS 2015 evaluation.

引用

页码：153 / 159

页数：7

共 50 条

[21] Effectiveness of Data Augmentation and Pretraining for Improving Neural Headline Generation in Low-Resource Settings
Martinc, Matej
Montariol, Syrielle
Pivovarova, Lidia
Zosa, Elaine
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3561 - 3570
[22] Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Bartelds, Martijn
San, Nay
McDonnell, Bradley
Jurafsky, Dan
Wieling, Martijn
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 715 - 729
[23] Efficient Low-Resource Compression of HIFU Data
Kleparnik, Petr
Barina, David
Zemcik, Pavel
Jaros, Jiri
INFORMATION, 2018, 9 (07)
[24] Generalized Data Augmentation for Low-Resource Translation
Xia, Mengzhou
Kong, Xiang
Anastasopoulos, Antonios
Neubig, Graham
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5786 - 5796
[25] Data Augmentation for Low-Resource Keyphrase Generation
Garg, Krishna
Chowdhury, Jishnu Ray
Caragea, Cornelia
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8442 - 8455
[26] Improving Data Augmentation for Low-Resource NMT Guided by POS-Tagging and Paraphrase Embedding
Maimaiti, Mieradilijiang
Liu, Yang
Luan, Huanbo
Pan, Zegao
Sun, Maosong
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
[27] Improving Low-resource Question Answering by Augmenting Question Information
Chen, Andong
Sun, Yuan
Zhao, Xiaobing
Esparza, Rosella P. Galindo
Chen, Kehai
Xiang, Yang
Zhao, Tiejun
Zhang, Min
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10413 - 10420
[28] ACTIVE LEARNING FOR LOW-RESOURCE SPEECH RECOGNITION: IMPACT OF SELECTION SIZE AND LANGUAGE MODELING DATA
Syed, Ali Raza
Rosenberg, Andrew
Mandel, Michael
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5315 - 5319
[29] COMBINATION OF DATA BORROWING STRATEGIES FOR LOW-RESOURCE LVCSR
Qian, Yanmin
Yu, Kai
Liu, Jia
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 404 - 409
[30] Data Augmentation for Low-Resource Quechua ASR Improvement
Zevallos, Rodolfo
Bel, Nuria
Cambara, Guillermo
Farrus, Mireia
Luque, Jordi
INTERSPEECH 2022, 2022, : 3518 - 3522

← 1 2 3 4 5 →