Application for Real-time Personalized Speaker Extraction

被引:0
|
作者
Ronssin, Damien [1 ]
Cernak, Milos [1 ]
机构
[1] Logitech Europe SA, CH-1015 Lausanne, Switzerland
来源
关键词
speaker extraction; personalized speech enhancement; real-time audio processing; speech separation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This short paper demonstrates an audio processing desktop application that allows isolating in real-time the voice of a specific speaker from the possibly noisy audio input after a short enrollment phase. The machine learning model embedded in this application suppresses all other sounds than the target voice from the incoming audio stream, including disturbing distractor voices. In the context of a growing need for video-collaboration solutions, personalized speech enhancement enables the use of such technologies in more challenging acoustic environments, i.e., in the presence of near distractor speech. In this situation, classical speech enhancement systems typically fail as they do not filter out any speech, hence the need for personalized methods. The presented application is an all-in-one solution for personalized speech enhancement: it allows the user to enroll and then to apply the effect seamlessly for one-to-one or one-to-many online meetings.
引用
下载
收藏
页码:1955 / 1956
页数:2
相关论文
共 50 条
  • [31] A REAL-TIME SPEAKER DIARIZATION SYSTEM BASED ON SPATIAL SPECTRUM
    Zheng, Siqi
    Huang, Weilong
    Wang, Xianliang
    Suo, Hongbin
    Feng, Jinwei
    Yan, Zhijie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7208 - 7212
  • [32] Presentation of real-time system for automatic speaker identification and verification
    David, P
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 372 - 376
  • [33] SENSAY ANALYTICS™: A REAL-TIME SPEAKER-STATE PLATFORM
    Tsiartas, A.
    Albright, C.
    Bassiou, N.
    Frandsen, M.
    Miller, I.
    Shriberg, E.
    Smith, J.
    Voss, L.
    Wagner, V.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6582 - 6583
  • [34] Chronological Self-Training for Real-Time Speaker Diarization
    Padfield, Dirk
    Liebling, Daniel J.
    INTERSPEECH 2021, 2021, : 4613 - 4617
  • [35] Real-Time Speaker Verification System Implemented on Reconfigurable Hardware
    Rafael Ramos-Lara
    Mariano López-García
    Enrique Cantó-Navarro
    Luís Puente-Rodriguez
    Journal of Signal Processing Systems, 2013, 71 : 89 - 103
  • [36] Real-Time Speaker Identification System using Cepstral Features
    Barik, Monalisha
    Sarangi, Susanta Kumar
    Sahu, Sushanta Kumar
    2016 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND INTELLIGENT SYSTEMS (CCIS), 2016, : 89 - 93
  • [37] Real-time Informatized caption enhancement based on speaker pronunciation time database
    Choi, Yong-Sik
    Kang, Jin-Gu
    Joo, Jong Wha J.
    Jung, Jin-Woo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35667 - 35688
  • [38] Real-time Informatized caption enhancement based on speaker pronunciation time database
    Yong-Sik Choi
    Jin-Gu Kang
    Jong Wha J. Joo
    Jin-Woo Jung
    Multimedia Tools and Applications, 2020, 79 : 35667 - 35688
  • [39] Electric Loads as Real-Time tasks: an application of Real-Time Physical Systems
    Della Vedova, Marco L.
    di Palma, Ettore
    Facchinetti, Tullio
    2011 7TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2011, : 1117 - 1123
  • [40] Real-time pitch extraction of voiced speech
    George, DE
    Salari, E
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1997, 20 (04) : 379 - 387