Cascade Speech Translation for the Kazakh Language

被引:2
|
作者
Kozhirbayev, Zhanibek [1 ]
Islamgozhayev, Talgat [1 ]
机构
[1] Nazarbayev Univ, Natl Lab Astana, Astana 010000, Kazakhstan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
关键词
cascade speech translation; Kazakh language; Russian language; automatic speech recognition; machine translation; cross-lingual communication; RECOGNITION;
D O I
10.3390/app13158900
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Speech translation systems have become indispensable in facilitating seamless communication across language barriers. This paper presents a cascade speech translation system tailored specifically for translating speech from the Kazakh language to Russian. The system aims to enable effective cross-lingual communication between Kazakh and Russian speakers, addressing the unique challenges posed by these languages. To develop the cascade speech translation system, we first created a dedicated speech translation dataset ST-kk-ru based on the ISSAI Corpus. The ST-kk-ru dataset comprises a large collection of Kazakh speech recordings along with their corresponding Russian translations. The automatic speech recognition (ASR) module of the system utilizes deep learning techniques to convert spoken Kazakh input into text. The machine translation (MT) module employs state-of-the-art neural machine translation methods, leveraging the parallel Kazakh-Russian translations available in the dataset to generate accurate translations. By conducting extensive experiments and evaluations, we have thoroughly assessed the performance of the cascade speech translation system on the ST-kk-ru dataset. The outcomes of our evaluation highlight the effectiveness of incorporating additional datasets for both the ASR and MT modules. This augmentation leads to a significant improvement in the performance of the cascade speech translation system, increasing the BLEU score by approximately 2 points when translating from Kazakh to Russian. These findings underscore the importance of leveraging supplementary data to enhance the capabilities of speech translation systems.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] CONTINUOUS SPEECH RECOGNITION OF KAZAKH LANGUAGE
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Mukhsina, Kuralay
    Keylan, Alimukhan
    BabaAli, Bagher
    Nabieva, Gulnaz
    Duisenbayeva, Aigerim
    Akhmetov, Bekturgan
    [J]. AMCSE 2018 - INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2019, 24
  • [2] THE TRANSLATION QUALITY PROBLEMS OF MACHINE TRANSLATION SYSTEMS FOR THE KAZAKH LANGUAGE
    Karibayeva, A.
    Karyukin, V
    Turgynbayeva, A.
    Turarbek, A.
    [J]. JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2021, 111 (03): : 132 - 140
  • [3] Complex Technology of Machine Translation Resources Extension for the Kazakh Language
    Rakhimova, Diana
    Zhumanov, Zhandos
    [J]. ADVANCED TOPICS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2017, 710 : 297 - 307
  • [4] Language Identification for Speech-to-Speech Translation
    Lim, Daniel Chung Yong
    Lane, Ian
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 204 - 207
  • [5] The base of speech structural units of Kazakh language for the synthesis of speech-like signals
    Seitkulov, Yerzhan N.
    Boranbayev, Seilkhan N.
    Yergaliyeva, Banu B.
    Atanov, Sabyrzhan K.
    Davydau, Henadzi V.
    Patapovich, Aleksandr V.
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 192 - 195
  • [6] Cascade or Direct Speech Translation? A Case Study
    Etchegoyhen, Thierry
    Arzelus, Haritz
    Gete, Harritxu
    Alvarez, Aitor
    Torre, Ivan G.
    Martin-Donas, Juan Manuel
    Gonzalez-Docasal, Ander
    Fernandez, Edson Benites
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [7] Applications of Language Modeling in Speech-To-Speech Translation
    Fu-Hua Liu
    Liang Gu
    Yuqing Gao
    Michael Picheny
    [J]. International Journal of Speech Technology, 2004, 7 (2-3) : 221 - 229
  • [8] Speech to Text Translation for Malay Language
    Al-Khulaidi, Rami Ali
    Akmeliawati, Rini
    [J]. 6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260
  • [9] INTEGRATING MACHINE TRANSLATION AND SPEECH SYNTHESIS COMPONENT FOR ENGLISH TO DRAVIDIAN LANGUAGE SPEECH TO SPEECH TRANSLATION SYSTEM
    Sangeetha, J.
    Jothilakshmi, S.
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2015, 10 (02): : 196 - 211
  • [10] The solution of the problem of unknown words under neural machine translation of the Kazakh language
    Turganbayeva, Aliya
    Tukeyev, Ualsher
    [J]. JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2021, 5 (02) : 214 - 225