IMPROVING DATA SELECTION FOR LOW-RESOURCE STT AND KWS

被引:0
|
作者
Fraga-Silva, Thiago [1 ]
Laurent, Antoine [1 ]
Gauvain, Jean-Luc [2 ]
Lamel, Lori [2 ]
Le, Viet-Bac [1 ]
Messaoudi, Abdel [1 ]
机构
[1] Vocapia Res, 28 Rue Jean Rostand, F-91400 Orsay, France
[2] CNRS LIMSI, Spoken Language Proc Grp, F-91405 Orsay, France
来源
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2015年
关键词
data selection; low-resource languages; speech recognition; keyword spotting; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper extends recent research on training data selection for speech transcription and keyword spotting system development. Selection techniques were explored in the context of the IARPA-Babel Active Learning (AL) task for 6 languages. Different selection criteria were considered with the goal of improving over a system built using a pre-defined 3-hour training data set. Four variants of the entropy-based criterion were explored: words, triphones, phones as well as the use of HMM-states previously introduced in [4]. The influence of the number of HMM-states was assessed as well as whether automatic or manual reference transcripts were used. The combination of selection criteria was investigated, and a novel multi-stage selection method proposed. This method was also assessed using larger data sets than were permitted in the Babel AL task. Results are reported for the 6 languages. The multi-stage selection was also applied to the surprise language (Swahili) in the NIST OpenKWS 2015 evaluation.
引用
收藏
页码:153 / 159
页数:7
相关论文
共 50 条
  • [21] Effectiveness of Data Augmentation and Pretraining for Improving Neural Headline Generation in Low-Resource Settings
    Martinc, Matej
    Montariol, Syrielle
    Pivovarova, Lidia
    Zosa, Elaine
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3561 - 3570
  • [22] Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
    Bartelds, Martijn
    San, Nay
    McDonnell, Bradley
    Jurafsky, Dan
    Wieling, Martijn
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 715 - 729
  • [23] Efficient Low-Resource Compression of HIFU Data
    Kleparnik, Petr
    Barina, David
    Zemcik, Pavel
    Jaros, Jiri
    INFORMATION, 2018, 9 (07)
  • [24] Generalized Data Augmentation for Low-Resource Translation
    Xia, Mengzhou
    Kong, Xiang
    Anastasopoulos, Antonios
    Neubig, Graham
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5786 - 5796
  • [25] Data Augmentation for Low-Resource Keyphrase Generation
    Garg, Krishna
    Chowdhury, Jishnu Ray
    Caragea, Cornelia
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8442 - 8455
  • [26] Improving Data Augmentation for Low-Resource NMT Guided by POS-Tagging and Paraphrase Embedding
    Maimaiti, Mieradilijiang
    Liu, Yang
    Luan, Huanbo
    Pan, Zegao
    Sun, Maosong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [27] Improving Low-resource Question Answering by Augmenting Question Information
    Chen, Andong
    Sun, Yuan
    Zhao, Xiaobing
    Esparza, Rosella P. Galindo
    Chen, Kehai
    Xiang, Yang
    Zhao, Tiejun
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10413 - 10420
  • [28] ACTIVE LEARNING FOR LOW-RESOURCE SPEECH RECOGNITION: IMPACT OF SELECTION SIZE AND LANGUAGE MODELING DATA
    Syed, Ali Raza
    Rosenberg, Andrew
    Mandel, Michael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5315 - 5319
  • [29] COMBINATION OF DATA BORROWING STRATEGIES FOR LOW-RESOURCE LVCSR
    Qian, Yanmin
    Yu, Kai
    Liu, Jia
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 404 - 409
  • [30] Data Augmentation for Low-Resource Quechua ASR Improvement
    Zevallos, Rodolfo
    Bel, Nuria
    Cambara, Guillermo
    Farrus, Mireia
    Luque, Jordi
    INTERSPEECH 2022, 2022, : 3518 - 3522