Low-Latency Neural Speech Translation

被引:23
|
作者
Niehues, Jan [1 ]
Ngoc-Quan Pham [1 ]
Thanh-Le Ha [1 ]
Sperber, Matthias [1 ]
Waibel, Alex [1 ]
机构
[1] KIT, Inst Anthropomat & Robot, Karlsruhe, Germany
关键词
speech translation; low-latency;
D O I
10.21437/Interspeech.2018-1055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Through the development of neural machine translation, the quality of machine translation systems has been improved significantly. By exploiting advancements in deep learning, systems are now able to better approximate the complex mapping from source sentences to target sentences. But with this ability, new challenges also arise. An example is the translation of partial sentences in low-latency speech translation. Since the model has only seen complete sentences in training, it will always try to generate a complete sentence, though the input may only be a partial sentence. We show that NMT systems can be adapted to scenarios where no task-specific training data is available. Furthermore, this is possible without losing performance on the original training data. We achieve this by creating artificial data and by using multi-task learning. After adaptation, we are able to reduce the number of corrections displayed during incremental output construction by 45%, without a decrease in translation quality.
引用
收藏
页码:1293 / 1297
页数:5
相关论文
共 50 条
  • [41] UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS
    Zen, Heiga
    Sak, Hasim
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4470 - 4474
  • [42] Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN
    Kobayashi, Kazuhiro
    Toda, Tomoki
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 396 - 400
  • [43] LOW-LATENCY INCREMENTAL TEXT-TO-SPEECH SYNTHESIS WITH DISTILLED CONTEXT PREDICTION NETWORK
    Saeki, Takaaki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 749 - 756
  • [44] Low-latency VLSI Architecture for Neural Cross-frequency Coupling Analysis
    O'Leary, Gerard
    Valiante, Taufik A.
    Genov, Roman
    [J]. 2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 2247 - 2250
  • [45] LEARN Codes: Inventing low-latency codes via recurrent neural networks
    Jiang, Yihan
    Kim, Hyeji
    Asnani, Himanshu
    Kannan, Sreeram
    Oh, Sewoong
    Viswanath, Pramod
    [J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [46] Listening with Googlears: Low-Latency Neural Multiframe Beamforming and Equalization for Hearing Aids
    Yang, Samuel
    Wisdom, Scott
    Gnegy, Chet
    Lyon, Richard F.
    Savla, Sagar
    [J]. INTERSPEECH 2022, 2022, : 3939 - 3943
  • [47] Low-Latency Privacy-Preserving Outsourcing of Deep Neural Network Inference
    Tian, Yifan
    Njilla, Laurent
    Yuan, Jiawei
    Yu, Shucheng
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3300 - 3309
  • [48] Low-latency Convolutional Neural Network for Classification of Previously Unseen Drone Types
    Ahmad, Bashar, I
    Grey, Jonathan
    Newman, Mike
    Harman, Stephen
    [J]. 2022 19TH EUROPEAN RADAR CONFERENCE (EURAD), 2022, : 189 - 192
  • [49] An FPGA-Based Low-Latency Accelerator for Randomly Wired Neural Networks
    Kuramochi, Ryosuke
    Nakahara, Hiroki
    [J]. 2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 298 - 303
  • [50] Highway Connection for Low-Latency and High-Accuracy Spiking Neural Networks
    Zhang, Anguo
    Wu, Junyi
    Li, Xiumin
    Li, Hung Chun
    Gao, Yueming
    Pun, Sio Hang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (12) : 4579 - 4583