Automatic Speech Recognition (ASR) Systems Applied to Pronunciation Assessment of L2 Spanish for Japanese Speakers

被引:10
|
作者
Tejedor-Garcia, Cristian [1 ,2 ]
Cardenoso-Payo, Valentin [2 ]
Escudero-Mancebo, David [2 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol CLST, POB 9103, NL-6500 Nijmegen, Netherlands
[2] Univ Valladolid, Dept Comp Sci, ECA SIMM Res Grp, Valladolid 47002, Spain
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 15期
关键词
automatic speech recognition (ASR); automatic assessment tools; foreign language pronunciation; pronunciation training; computer-assisted pronunciation training (CAPT); automatic pronunciation assessment; learning environments; minimal pairs; ENGLISH; ERRORS;
D O I
10.3390/app11156695
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application The CAPT tool, ASR technology and procedure described in this work can be successfully applied to support typical learning paces for Spanish as a foreign language for Japanese people. With small changes, the application can be tailored to a different target L2, if the set of minimal pairs used for the discrimination, pronunciation and mixed-mode activities is adapted to the specific L1-L2 pair. General-purpose automatic speech recognition (ASR) systems have improved in quality and are being used for pronunciation assessment. However, the assessment of isolated short utterances, such as words in minimal pairs for segmental approaches, remains an important challenge, even more so for non-native speakers. In this work, we compare the performance of our own tailored ASR system (kASR) with the one of Google ASR (gASR) for the assessment of Spanish minimal pair words produced by 33 native Japanese speakers in a computer-assisted pronunciation training (CAPT) scenario. Participants in a pre/post-test training experiment spanning four weeks were split into three groups: experimental, in-classroom, and placebo. The experimental group used the CAPT tool described in the paper, which we specially designed for autonomous pronunciation training. A statistically significant improvement for the experimental and in-classroom groups was revealed, and moderate correlation values between gASR and kASR results were obtained, in addition to strong correlations between the post-test scores of both ASR systems and the CAPT application scores found at the final stages of application use. These results suggest that both ASR alternatives are valid for assessing minimal pairs in CAPT tools, in the current configuration. Discussion on possible ways to improve our system and possibilities for future research are included.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] THE SYLLABLE STRUCTURE: UNDERSTANDING SPANISH SPEAKERS PRONUNCIATION OF ENGLISH AS A L2
    Gomez Martinez, Susana
    RAEL-REVISTA ELECTRONICA DE LINGUISTICA APLICADA, 2011, 10 (01): : 1 - 7
  • [2] Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
    Gottardi, William
    de Almeida, Janaina Fernanda
    Soufen Tumolo, Celso Henrique
    TEXTO LIVRE-LINGUAGEM E TECNOLOGIA, 2022, 15
  • [3] ARABIC SPEECH PRONUNCIATION RECOGNITION AND CORRECTION USING AUTOMATIC SPEECH RECOGNIZER (ASR)
    Dahan, H. B.
    Mannan, A.
    INTED2012: INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2012, : 4009 - 4016
  • [4] Automatic speech recognition (ASR) for the diagnosis of pronunciation of speech sound disorders in Korean children
    Ahn, Taekyung
    Hong, Yeonjung
    Im, Younggon
    Kim, Do Hyung
    Kang, Dayoung
    Jeong, Joo Won
    Kim, Jae Won
    Kim, Min Jung
    Cho, Ah-Ra
    Nam, Hosung
    Jang, Dae-Hyun
    CLINICAL LINGUISTICS & PHONETICS, 2024,
  • [5] Investigating the role of L1 in automatic pronunciation evaluation of L2 speech
    Tu, Ming
    Grabek, Anna
    Liss, Julie
    Berisha, Visar
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1636 - 1640
  • [6] Assessment of L2 intelligibility: Comparing L1 listeners and automatic speech recognition
    Inceoglu, Solene
    Chen, Wen-Hsin
    Lim, Hyojung
    RECALL, 2023, 35 (01) : 89 - 104
  • [7] Automatic Detection of Rhythmic Patterns in Native and L2 Speech: Chinese, Japanese, and Japanese L2 Chinese
    Shi Shuju
    Xie Yanlu
    Feng Xiaoli
    Zhang Jinsong
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [8] Automatic pronunciation assessment vs. automatic speech recognition: A study of conflicting conditions for L2-English
    Camara-Arenas, Enrique
    Tejedor-Garcia, Cristian
    Tomas-Vazquez, Cecilia Judith
    Escudero-Mancebo, David
    LANGUAGE LEARNING & TECHNOLOGY, 2023, 27 (01): : 28 - 28
  • [9] A CORPUS-BASED STUDY OF SPANISH L2 MISPRONUNCIATIONS BY JAPANESE SPEAKERS
    Carranza, M.
    Cucchiarini, C.
    Llisterri, J.
    Machuca, M. J.
    Rios, A.
    EDULEARN14: 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2014, : 3696 - 3705
  • [10] The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech
    Qiao, Yu
    Zhou, Wei
    Kerz, Elma
    Schlueter, Ralf
    INTERSPEECH 2021, 2021, : 4453 - 4457