Cascade Speech Translation for the Kazakh Language

被引:2
|
作者
Kozhirbayev, Zhanibek [1 ]
Islamgozhayev, Talgat [1 ]
机构
[1] Nazarbayev Univ, Natl Lab Astana, Astana 010000, Kazakhstan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
关键词
cascade speech translation; Kazakh language; Russian language; automatic speech recognition; machine translation; cross-lingual communication; RECOGNITION;
D O I
10.3390/app13158900
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Speech translation systems have become indispensable in facilitating seamless communication across language barriers. This paper presents a cascade speech translation system tailored specifically for translating speech from the Kazakh language to Russian. The system aims to enable effective cross-lingual communication between Kazakh and Russian speakers, addressing the unique challenges posed by these languages. To develop the cascade speech translation system, we first created a dedicated speech translation dataset ST-kk-ru based on the ISSAI Corpus. The ST-kk-ru dataset comprises a large collection of Kazakh speech recordings along with their corresponding Russian translations. The automatic speech recognition (ASR) module of the system utilizes deep learning techniques to convert spoken Kazakh input into text. The machine translation (MT) module employs state-of-the-art neural machine translation methods, leveraging the parallel Kazakh-Russian translations available in the dataset to generate accurate translations. By conducting extensive experiments and evaluations, we have thoroughly assessed the performance of the cascade speech translation system on the ST-kk-ru dataset. The outcomes of our evaluation highlight the effectiveness of incorporating additional datasets for both the ASR and MT modules. This augmentation leads to a significant improvement in the performance of the cascade speech translation system, increasing the BLEU score by approximately 2 points when translating from Kazakh to Russian. These findings underscore the importance of leveraging supplementary data to enhance the capabilities of speech translation systems.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Speech into Sign Language Statistical Translation System for Deaf People
    Gallo, B.
    San-Segundo, R.
    Lucas, J. M.
    Barra, R.
    D'Haro, L. F.
    Fernandez, F.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2009, 7 (03) : 400 - 404
  • [42] Adapting a Speech into Sign Language Translation System to a new domain
    Lopez-Ludena, V.
    San-Segundo, R.
    Gonzalez-Morcillo, C.
    Lopez, J. C.
    Ferreiro, E.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1163 - 1167
  • [43] INTEGRATION OF SPEECH RECOGNITION AND LANGUAGE PROCESSING IN A JAPANESE TO ENGLISH SPOKEN LANGUAGE TRANSLATION SYSTEM
    MORIMOTO, T
    SHIKANO, K
    KOGURE, K
    IIDA, H
    KUREMATSU, A
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1889 - 1896
  • [44] Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?
    Bentivogli, Luisa
    Cettolo, Mauro
    Gaido, Marco
    Karakanta, Alina
    Martinelli, Alberto
    Negri, Matteo
    Turchi, Marco
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2873 - 2887
  • [45] Streaming cascade-based speech translation leveraged by a direct segmentation model
    Iranzo-Sánchez, Javier
    Jorge, Javier
    Baquero-Arnal, Pau
    Silvestre-Cerdà, Joan Albert
    Giménez, Adrià
    Civera, Jorge
    Sanchis, Albert
    Juan, Alfons
    [J]. Neural Networks, 2021, 142 : 303 - 315
  • [46] Impact of Statistical Language Model on Example Based Machine Translation System between Kazakh and Turkish Languages
    Kessikbayeva, Gulshat
    Cicekli, Ilyas
    [J]. 2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 112 - 118
  • [47] Kazakh Names for Relatives and Their Peculiarities in Oral Speech
    Shadkam, Zubaida
    [J]. MILLI FOLKLOR, 2019, (123): : 40 - 53
  • [48] THE MOTIVATION OF THE DERIVED WORDS IN THE KAZAKH LANGUAGE
    Salkynbay, A.
    [J]. ART-SANAT, 2016, : 244 - 249
  • [49] Language and Identity in Kazakh Horse Culture
    Sarbassova, Guldana
    [J]. BILIG, 2015, (75) : 227 - 247
  • [50] National values in the translation of Kazakh literary works
    Kozhakhmetova, Gulsara
    Abeshova, Nurgul
    Tazhibayeva, Saule
    Ibragimova, Karlygash
    [J]. CADERNOS DE TRADUCAO, 2024, 44 (01):