Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers

被引:10
|
作者
Reitmaier, Thomas [1 ]
Wallington, Electra [2 ]
Raju, Dani Kalarikalayil [3 ]
Klejch, Ondrej [2 ]
Pearson, Jennifer [1 ]
Jones, Matt [1 ]
Bell, Peter [2 ]
Robinson, Simon [1 ]
机构
[1] Swansea Univ, Swansea, W Glam, Wales
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[3] Studio Hasi, Mumbai, Maharashtra, India
基金
英国工程与自然科学研究理事会;
关键词
Speech/language; automatic speech recognition; mobile devices;
D O I
10.1145/3491102.3517639
中图分类号
学科分类号
摘要
Automatic Speech Recognition (ASR) researchers are turning their attention towards supporting low-resource languages, such as isiXhosa or Marathi, with only limited training resources. We report and reflect on collaborative research across ASR & HCI to situate ASR-enabled technologies to suit the needs and functions of two communities of low-resource language speakers, on the outskirts of Cape Town, South Africa and in Mumbai, India. We build on long-standing community partnerships and draw on linguistics, media studies and HCI scholarship to guide our research. We demonstrate diverse design methods to: remotely engage participants; collect speech data to test ASR models; and ultimately field-test models with users. Reflecting on the research, we identify opportunities, challenges, and use-cases of ASR, in particular to support pervasive use of WhatsApp voice messaging. Finally, we uncover implications for collaborations across ASR & HCI that advance important discussions at CHI surrounding data, ethics, and AI.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges
    Lin, Hung-Pang
    Zhang, Yu-Jia
    Chen, Chia-Ping
    [J]. INTERSPEECH 2021, 2021, : 4339 - 4343
  • [2] Investigate Automatic Speech Recognition and Keyword Search for Very Low-Resource Language
    Ni, Chongjia
    Ma, Bin
    [J]. 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 336 - 340
  • [3] Enrollment in low-resource speech recognition systems
    Deligne, S
    Dharanipragada, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 341 - 344
  • [4] MIXSPEECH: DATA AUGMENTATION FOR LOW-RESOURCE AUTOMATIC SPEECH RECOGNITION
    Meng, Linghui
    Xu, Jin
    Tan, Xu
    Wang, Jindong
    Qin, Tao
    Xu, Bo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7008 - 7012
  • [5] Language fusion via adapters for low-resource speech recognition
    Hu, Qing
    Zhang, Yan
    Zhang, Xianlei
    Han, Zongyu
    Liang, Xiuxia
    [J]. SPEECH COMMUNICATION, 2024, 158
  • [6] Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language
    Gupta, Vishwa
    Boulianne, Gilles
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2521 - 2527
  • [7] Low-resource automatic speech recognition and error analyses of oral cancer speech
    Halpern, Bence Mark
    Feng, Siyuan
    van Son, Rob
    van den Brekel, Michiel
    Scharenborg, Odette
    [J]. SPEECH COMMUNICATION, 2022, 141 : 14 - 27
  • [8] Cross-Lingual Language Modeling for Low-Resource Speech Recognition
    Xu, Ping
    Fung, Pascale
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1134 - 1144
  • [9] A General Procedure for Improving Language Models in Low-Resource Speech Recognition
    Liu, Qian
    Zhang, Wei-Qiang
    Liu, Jia
    Liu, Yao
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 428 - 433
  • [10] Language-Adversarial Transfer Learning for Low-Resource Speech Recognition
    Yi, Jiangyan
    Tao, Jianhua
    Wen, Zhengqi
    Bai, Ye
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 621 - 630