IMPROVING DATA SELECTION FOR LOW-RESOURCE STT AND KWS

被引:0
|
作者
Fraga-Silva, Thiago [1 ]
Laurent, Antoine [1 ]
Gauvain, Jean-Luc [2 ]
Lamel, Lori [2 ]
Le, Viet-Bac [1 ]
Messaoudi, Abdel [1 ]
机构
[1] Vocapia Res, 28 Rue Jean Rostand, F-91400 Orsay, France
[2] CNRS LIMSI, Spoken Language Proc Grp, F-91405 Orsay, France
关键词
data selection; low-resource languages; speech recognition; keyword spotting; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper extends recent research on training data selection for speech transcription and keyword spotting system development. Selection techniques were explored in the context of the IARPA-Babel Active Learning (AL) task for 6 languages. Different selection criteria were considered with the goal of improving over a system built using a pre-defined 3-hour training data set. Four variants of the entropy-based criterion were explored: words, triphones, phones as well as the use of HMM-states previously introduced in [4]. The influence of the number of HMM-states was assessed as well as whether automatic or manual reference transcripts were used. The combination of selection criteria was investigated, and a novel multi-stage selection method proposed. This method was also assessed using larger data sets than were permitted in the Babel AL task. Results are reported for the 6 languages. The multi-stage selection was also applied to the surprise language (Swahili) in the NIST OpenKWS 2015 evaluation.
引用
收藏
页码:153 / 159
页数:7
相关论文
共 50 条
  • [1] Active Learning based data selection for limited resource STT and KWS
    Fraga-Silva, Thiago
    Gauvain, Jean-Luc
    Lamel, Lori
    Laurent, Antoine
    Le, Viet-Bac
    Messaoudi, Abdel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3159 - 3163
  • [2] SEMI-SUPERVISED TRAINING IN LOW-RESOURCE ASR AND KWS
    Metze, Florian
    Gandhe, Ankur
    Miao, Yajie
    Sheikh, Zaid
    Wang, Yun
    Xu, Di
    Zhang, Hao
    Kim, Jungsuk
    Lane, Ian
    Lee, Won Kyum
    Stueker, Sebastian
    Mueller, Markus
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4699 - 4703
  • [3] SYNTHETIC DATA AUGMENTATION FOR IMPROVING LOW-RESOURCE ASR
    Thai, Bao
    Jimerson, Robert
    Arcoraci, Dominic
    Prud'hommeaux, Emily
    Ptucha, Raymond
    2019 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW), 2019,
  • [4] AN INVESTIGATION INTO LANGUAGE MODEL DATA AUGMENTATION FOR LOW-RESOURCED STT AND KWS
    Huang, Guangpu
    da Silva, Thiago Fraga
    Lamel, Lori
    Gauvain, Jean-Luc
    Gorin, Arseniy
    Laurent, Antoine
    Lileikyte, Rasa
    Messouadi, Abdel
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5790 - 5794
  • [5] Reliable Data Generation and Selection for Low-Resource Relation Extraction
    Yu, Junjie
    Wang, Xing
    Chen, Wenliang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19440 - 19448
  • [6] Data Augmentation, Feature Combination, and Multilingual Neural Networks to Improve ASR and KWS Performance for Low-resource Languages
    Tueske, Zoltan
    Golik, Pavel
    Nolden, David
    Schlueter, Ralf
    Ney, Hermann
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1420 - 1424
  • [7] Improving obstetric care in low-resource settings
    Hofmeyr, G. Justus
    CASE REPORTS IN WOMENS HEALTH, 2022, 36
  • [8] Improving Anesthesia Safety in Low-Resource Settings
    Khan, Fauzia A.
    Merry, Alan F.
    ANESTHESIA AND ANALGESIA, 2018, 126 (04): : 1312 - 1320
  • [9] Improving Access to Laparoscopy in Low-Resource Settings
    Rosenbaum, Alan J.
    Maine, Rebecca G.
    ANNALS OF GLOBAL HEALTH, 2019, 85 (01):
  • [10] Combining Simple but Novel Data Augmentation Methods for Improving Low-Resource ASR
    Damania, Ronit
    Homan, Christopher
    Prud'hommeaux, Emily
    INTERSPEECH 2022, 2022, : 4890 - 4894