MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION

被引:3
|
作者
Li, Xinjian [1 ]
Mortensen, David R. [1 ]
Metze, Florian [1 ]
Black, Alan W. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Multilingual Phonetic Dataset; Multilingual Speech Alignment; Low-Resource Speech recognition;
D O I
10.1109/ICASSP39728.2021.9413720
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Phone Recognition is one of the most important tasks in the field of multilingual speech recognition, especially for low-resource languages whose orthographies are not available. However, most speech recognition datasets so far only focus on high-resource languages, there are very few datasets available for low-resource languages, especially datasets with detailed phone annotation. In this work, we present a large multilingual phonetic dataset, which is preprocessed and aligned from the UCLA phonetic dataset. The dataset contains around 100 low-resource languages and 7000 utterances in total. This dataset would provide an ideal training/evaluation set for universal phone recognition.
引用
收藏
页码:6958 / 6962
页数:5
相关论文
共 50 条
  • [1] Multilingual Data Selection For Low Resource Speech Recognition
    Thomas, Samuel
    Audhkhasi, Kartik
    Cui, Jia
    Kingsbury, Brian
    Ramabhadran, Bhuvana
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3853 - 3857
  • [2] ADVERSARIAL MULTILINGUAL TRAINING FOR LOW-RESOURCE SPEECH RECOGNITION
    Yi, Jiangyan
    Tao, Jianhua
    Wen, Zhengqi
    Bai, Ye
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4899 - 4903
  • [3] Adaptive Activation Network for Low Resource Multilingual Speech Recognition
    Luo, Jian
    Wang, Jianzong
    Cheng, Ning
    Zheng, Zhenpeng
    Xiao, Jing
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] MULTILINGUAL REPRESENTATIONS FOR LOW RESOURCE SPEECH RECOGNITION AND KEYWORD SEARCH
    Cui, Jia
    Kingsbury, Brian
    Ramabhadran, Bhuvana
    Sethy, Abhinav
    Audhkhasi, Kartik
    Cui, Xiaodong
    Kislal, Ellen
    Mangu, Lidia
    Nussbaum-Thom, Markus
    Picheny, Michael
    Tueske, Zoltan
    Golik, Pavel
    Schlueter, Ralf
    Ney, Hermann
    Gales, Mark J. F.
    Knill, Kate M.
    Ragni, Anton
    Wang, Haipeng
    Woodland, Phil
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 259 - 266
  • [5] Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition
    Xiao, Yubei
    Gong, Ke
    Zhou, Pan
    Zheng, Guolin
    Liang, Xiaodan
    Lin, Liang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14112 - 14120
  • [6] Multilingual acoustic models for speech recognition in low-resource devices
    Garcia, Enrique Gil
    Mengusoglu, Erhan
    Janke, Eric
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 981 - +
  • [7] Multilingual Meta-Transfer Learning for Low-Resource Speech Recognition
    Zhou, Rui
    Koshikawa, Takaki
    Ito, Akinori
    Nose, Takashi
    Chen, Chia-Ping
    [J]. IEEE Access, 2024, 12 : 158493 - 158504
  • [8] Articulatory Feature based Multilingual MLPs for Low-Resource Speech Recognition
    Qian, Yanmin
    Liu, Jia
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2601 - 2604
  • [9] Acoustic Phonetic Decoding Oriented to Multilingual Speech Recognition in the Basque Context
    Barroso, N.
    Lopez de Ipina, K.
    Ezeiza, A.
    [J]. TRENDS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2010, 71 : 697 - +
  • [10] Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition
    Zhou, Shiyu
    Zhao, Yuanyuan
    Xu, Shuang
    Xu, Bo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 704 - 708