The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021

被引:0
|
作者
Liu, Dan [1 ,2 ]
Du, Mengge [2 ]
Li, Xiaoxi [2 ]
Hu, Yuchen [1 ]
Dai, Lirong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] iFlytek Res, Hefei, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes USTC-NELSLIP's submissions to the IWSLT2021 Simultaneous Speech Translation task. We proposed a novel simultaneous translation model, Cross Attention Augmented Transducer (CAAT), which extends conventional RNN-T to sequence-to-sequence tasks without monotonic constraints, e.g., simultaneous translation. Experiments on speech-to-text (S2T) and text-to-text (T2T) simultaneous translation tasks shows CAAT achieves better quality-latency trade-offs compared to wait-k, one of the previous state-of-the-art approaches. Based on CAAT architecture and data augmentation, we build S2T and T2T simultaneous translation systems in this evaluation campaign. Compared to last year's optimal systems, our S2T simultaneous translation system improves by an average of 11.3 BLEU for all latency regimes, and our T2T simultaneous translation system improves by an average of 4.6 BLEU.
引用
收藏
页码:30 / 38
页数:9
相关论文
共 40 条
  • [31] Amazon Alexa AI's System for IWSLT 2022 Offline Speech Translation Shared Task
    Shanbhogue, Akshaya Vishnu Kudlu
    Xue, Ran
    Chang, Ching-Yun
    Campbell, Sarah
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2022), 2022, : 169 - 176
  • [32] DiDi Labs' End-to-End System for the IWSLT 2020 Offline Speech Translation Task
    Arkhangorodsky, Arkady
    Huang, Yiqi
    Axelrod, Amittai
    17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020), 2020, : 69 - 72
  • [33] The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
    Zhang, Ziqiang
    Ao, Junyi
    IWSLT 2022 - 19th International Conference on Spoken Language Translation, Proceedings of the Conference, 2022, : 158 - 168
  • [34] The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
    Zhang, Ziqiang
    Ao, Junyi
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2022), 2022, : 158 - 168
  • [35] Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021
    Zeng, Xingshan
    Li, Liangyou
    Liu, Qun
    IWSLT 2021: THE 18TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION, 2021, : 149 - 153
  • [36] End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021
    Gallego, Gerard, I
    Tsiamas, Ioannis
    Escolano, Carlos
    Fonollosa, Jose A. R.
    Costa-jussa, Marta R.
    IWSLT 2021: THE 18TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION, 2021, : 110 - 119
  • [37] The NiuTrans's Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task
    Zhang, Yuhao
    Huang, Canan
    Xu, Chen
    Liu, Xiaoqian
    Li, Bei
    Ma, Anxiang
    Xiao, Tong
    Zhu, Jingbo
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2022), 2022, : 232 - 238
  • [38] ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
    Elbayad, Maha
    Ha Nguyen
    Bougares, Fethi
    Tomashenko, Natalia
    Caubriere, Antoine
    Lecouteux, Benjamin
    Esteve, Yannick
    Besacier, Laurent
    17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020), 2020, : 35 - 43
  • [39] ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks
    Boito, Marcely Zanon
    Ortega, John
    Riguidel, Hugo
    Laurent, Antoine
    Barrault, Loic
    Bougares, Fethi
    Chaabani, Firas
    Nguyen, Ha
    Barbier, Florentin
    Gahbiche, Souhir
    Esteve, Yannick
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2022), 2022, : 308 - 318
  • [40] Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 2021
    Bahar, Parnia
    Wilken, Patrick
    di Gangi, Mattia
    Matusov, Evgeny
    IWSLT 2021: THE 18TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION, 2021, : 52 - 63