Dynamic Transcription for Low-latency Speech Translation

被引:20
|
作者
Niehues, Jan [1 ]
Nguyen, Thai Son [1 ]
Cho, Eunah [1 ]
Ha, Thanh-Le [1 ]
Kilgour, Kevin [1 ]
Mueller, Markus [1 ]
Sperber, Matthias [1 ]
Stueker, Sebastian [1 ]
Waibel, Alex [1 ]
机构
[1] Karlsruhe Inst Technol, Karlsruhe, Germany
基金
欧盟地平线“2020”;
关键词
speech translation; low-latency;
D O I
10.21437/Interspeech.2016-154
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Latency is one of the main challenges in the task of simultaneous spoken language translation. While significant improvements in recent years have led to high quality automatic translations, their usefulness in real-time settings is still severely limited due to the large delay between the input speech and the delivered translation. In this paper, we present a novel scheme which reduces the latency of a large scale speech translation system drastically. Within this scheme, the transcribed text and its translation can be updated when more context is available, even after they are presented to the user. Thereby, this scheme allows us to display an initial transcript and its translation to the user with a very low latency. If necessary, both transcript and translation can later be updated to better, more accurate versions until eventually the final versions are displayed. Using this framework, we are able to reduce the latency of the source language transcript into half. For the translation, an average delay of 3.3s was achieved, which is more than twice as fast as our initial system.
引用
收藏
页码:2513 / 2517
页数:5
相关论文
共 50 条
  • [11] A Survey on Low-Latency DNN-Based Speech Enhancement
    Drgas, Szymon
    [J]. SENSORS, 2023, 23 (03)
  • [12] LOW-LATENCY SPEAKER-INDEPENDENT CONTINUOUS SPEECH SEPARATION
    Yoshioka, Takuya
    Chen, Zhuo
    Liu, Changliang
    Xiao, Xiong
    Erdogan, Hakan
    Dimitriadis, Dimitrios
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6980 - 6984
  • [13] Low-latency trading
    Hasbrouck, Joel
    Saar, Gideon
    [J]. JOURNAL OF FINANCIAL MARKETS, 2013, 16 (04) : 646 - 679
  • [14] Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks
    Romaniuk, Michal
    Masztalski, Piotr
    Piaskowski, Karol
    Matuszewski, Mateusz
    [J]. INTERSPEECH 2020, 2020, : 3296 - 3300
  • [15] A Buffer Dynamic Stabilizer for Low-Latency Adaptive Video Streaming
    Shuai, Yongtao
    Herfet, Thorsten
    [J]. 2016 IEEE 6TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2016,
  • [16] A Low-Latency BF Decoding of LDPC Codes With Dynamic Thresholds
    Jiang, Ming
    Fan, Dongli
    [J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (09) : 2781 - 2785
  • [17] Low-latency readout electronics for dynamic superconducting quantum computing
    Guo, Cheng
    Lin, Jin
    Han, Lian-Chen
    Li, Na
    Sun, Li-Hua
    Liang, Fu-Tian
    Li, Dong-Dong
    Li, Yu-Huai
    Gong, Ming
    Xu, Yu
    Liao, Sheng-Kai
    Peng, Cheng-Zhi
    [J]. AIP ADVANCES, 2022, 12 (04)
  • [18] Dynamic Polling Sequence Arrangement for Low-Latency Wireless LAN
    Lv, Yunxin
    Ruan, Lihua
    Dias, Maluge Pubuduni Imali
    Wong, Elaine
    Feng, Ye
    Jiang, Ning
    Qiu, Kun
    [J]. 2018 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2018,
  • [19] Research on Construction of Low-Latency S-Boxes and Bidirectional Low-Latency Properties
    Wu, Rui-Chen
    Zhang, Lei
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (11): : 3769 - 3779
  • [20] Orthros: A Low-Latency PRF
    Banik, Subhadeep
    Isobe, Takanori
    Liu, Fukang
    Minematsu, Kazuhiko
    Sakamoto, Kosei
    [J]. IACR TRANSACTIONS ON SYMMETRIC CRYPTOLOGY, 2021, 2021 (01) : 37 - 77