TEXTUAL ECHO CANCELLATION

被引:0
|
作者
Ding, Shaojin [1 ]
Jia, Ye [1 ]
Hu, Ke [1 ]
Wang, Quan [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
关键词
echo cancellation; multi-source attention; sequence-to-sequence model; TIME;
D O I
10.1109/ASRU51503.2021.9688214
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose Textual Echo Cancellation (TEC) - a framework for cancelling the text-to-speech (TTS) playback echo(1) from overlapping speech recordings. Such a system can largely improve speech recognition performance and user experience for intelligent devices such as smart speakers, as the user can talk to the device while the device is still playing the TI'S signal responding to the previous query. We implement this system by using a novel sequence-to-sequence model with multi-source attention that takes both the microphone mixture signal and source text of the TTS playback as inputs, and predicts the enhanced audio. Experiments show that the textual information of the TTS playback is critical to enhancement performance. Besides, the text sequence is much smaller in size compared with the raw acoustic signal of the TTS playback, and can be immediately transmitted to the device or ASR server even before the playback is synthesized. Therefore, our proposed approach effectively reduces Internet communication and latency compared with alternative approaches such as acoustic echo cancellation (AEC).
引用
收藏
页码:548 / 555
页数:8
相关论文
共 50 条
  • [1] ECHO CANCELLATION AND APPLICATIONS
    MURANO, K
    UNAGAMI, S
    AMANO, F
    IEEE COMMUNICATIONS MAGAZINE, 1990, 28 (01) : 49 - 55
  • [2] ADAPTIVE REFERENCE ECHO CANCELLATION
    FALCONER, DD
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1982, 30 (09) : 2083 - 2094
  • [3] Echo cancellation with delay estimation
    Magotra, N
    Lenihan, G
    Sirivara, S
    Natarajan, TR
    40TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 1998, : 732 - 735
  • [4] ECHO CANCELLATION FOR THE LOCAL LOOP
    AGAZZI, O
    HODGES, DA
    MESSERSCHMITT, DG
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1982, 6 (03): : 234 - 235
  • [5] Consistent echo cancellation design
    Bourget, Frederic
    Awad, Thomas
    Laurence, Martin
    Electronic Engineering (London), 2002, 74 (908): : 65 - 70
  • [6] Improved Echo cancellation in VOIP
    Halder, Patrashiya Magdolina
    Haque, A. K. M. Fazlul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2011, 2 (11) : 122 - 125
  • [7] VOICE CHANNEL ECHO CANCELLATION
    FANG, GS
    IEEE COMMUNICATIONS MAGAZINE, 1983, 21 (09) : 11 - 14
  • [8] Consistent echo cancellation design
    Bourget, F
    Awad, T
    Laurence, M
    ELECTRONIC ENGINEERING DESIGN, 2002, 74 (908): : 65 - +
  • [9] Echo cancellation in IP networks
    Radecki, J
    Zilic, Z
    Radecka, K
    2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, CONFERENCE PROCEEDINGS, 2002, : 219 - 222
  • [10] Enhancement of Residual Echo for Robust Acoustic Echo Cancellation
    Wada, Ted S.
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 175 - 189