Low-Latency Neural Speech Translation

被引:23
|
作者
Niehues, Jan [1 ]
Ngoc-Quan Pham [1 ]
Thanh-Le Ha [1 ]
Sperber, Matthias [1 ]
Waibel, Alex [1 ]
机构
[1] KIT, Inst Anthropomat & Robot, Karlsruhe, Germany
关键词
speech translation; low-latency;
D O I
10.21437/Interspeech.2018-1055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Through the development of neural machine translation, the quality of machine translation systems has been improved significantly. By exploiting advancements in deep learning, systems are now able to better approximate the complex mapping from source sentences to target sentences. But with this ability, new challenges also arise. An example is the translation of partial sentences in low-latency speech translation. Since the model has only seen complete sentences in training, it will always try to generate a complete sentence, though the input may only be a partial sentence. We show that NMT systems can be adapted to scenarios where no task-specific training data is available. Furthermore, this is possible without losing performance on the original training data. We achieve this by creating artificial data and by using multi-task learning. After adaptation, we are able to reduce the number of corrections displayed during incremental output construction by 45%, without a decrease in translation quality.
引用
收藏
页码:1293 / 1297
页数:5
相关论文
共 50 条
  • [21] Low-Latency Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks
    Ai, Yang
    Ling, Zhen-Hua
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2283 - 2296
  • [22] Research on Construction of Low-Latency S-Boxes and Bidirectional Low-Latency Properties
    Wu, Rui-Chen
    Zhang, Lei
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (11): : 3769 - 3779
  • [23] Orthros: A Low-Latency PRF
    Banik, Subhadeep
    Isobe, Takanori
    Liu, Fukang
    Minematsu, Kazuhiko
    Sakamoto, Kosei
    [J]. IACR TRANSACTIONS ON SYMMETRIC CRYPTOLOGY, 2021, 2021 (01) : 37 - 77
  • [24] Low-latency monaural speech enhancement with deep filter-bank equalizer
    Zheng, Chengshi
    Liu, Wenzhe
    Li, Andong
    Ke, Yuxuan
    Li, Xiaodong
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (05): : 3291 - 3304
  • [25] Low-latency query compilation
    Henning Funke
    Jan Mühlig
    Jens Teubner
    [J]. The VLDB Journal, 2022, 31 : 1171 - 1184
  • [26] Low-latency query compilation
    Funke, Henning
    Muehlig, Jan
    Teubner, Jens
    [J]. VLDB JOURNAL, 2022, 31 (06): : 1171 - 1184
  • [27] LOW-LATENCY SOUND SOURCE SEPARATION USING DEEP NEURAL NETWORKS
    Naithani, Gaurav
    Parascandolo, Giambattista
    Barker, Tom
    Pontoppidan, Niels Henrik
    Virtanen, Tuomas
    [J]. 2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 272 - 276
  • [28] TIME-FREQUENCY MASKING STRATEGIES FOR SINGLE-CHANNEL LOW-LATENCY SPEECH ENHANCEMENT USING NEURAL NETWORKS
    Parviainen, Mikko
    Pertila, Pasi
    Virtanen, Tuomas
    Grosche, Peter
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 51 - 55
  • [29] Super-Human Performance in Online Low-latency Recognition of Conversational Speech
    Thai-Son Nguyen
    Stueker, Sebastian
    Waibel, Alex
    [J]. INTERSPEECH 2021, 2021, : 1762 - 1766
  • [30] Training Low-Latency Spiking Neural Network with Orthogonal Spiking Neurons
    Yao, Yunpeng
    Wu, Man
    Zhang, Renyuan
    [J]. 2023 21ST IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS, 2023,