Blind source separation of speech in hardware

被引:2
|
作者
Hurley, N [1 ]
Harte, N [1 ]
Fearon, C [1 ]
Rickard, S [1 ]
机构
[1] Univ Coll Dublin, Dept Elect & Elect Engn, Dublin 2, Ireland
关键词
D O I
10.1109/SIPS.2005.1579909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents preliminary work on a hardware implementation of a source separation algorithm employing time-frequency masking methods. DUET (Degenerate Unmixing Estimation Technique) has previously been shown to achieve excellent source separation in real time in software. The current work is a move towards a hardware realization of DUET that will allow integration of the algorithm into consumer devices. Initial stages involve investigating the performance of DUET when implemented in fixed-point arithmetic and a consideration of algorithmic changes to make DUET more amenable to implementation on a DSP processor. Performance is compared for floating-point and fixed-point implementations. A Weighted K-means clustering algorithm is presented as an alternative to gradient descent methods for peak tracking and demonstrated to achieve excellent performance without adversely affecting computational load. Preliminary performance figures are given for an implementation on a TMS320VC5510 DSK.
引用
收藏
页码:442 / 445
页数:4
相关论文
共 50 条
  • [41] Over-Determined Semi-Blind Speech Source Separation
    Togami, Masahito
    Scheibler, Robin
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 640 - 645
  • [42] Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music
    Wang, Longbiao
    Odani, Kyohei
    Kai, Atsuhiko
    Li, Weifeng
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [43] A hybrid algorithm for blind source separation of a convolutive mixture of three speech sources
    Shahab Faiz Minhas
    Patrick Gaydecki
    EURASIP Journal on Advances in Signal Processing, 2014
  • [44] A Blind Source Separation Based Approach for Speech Enhancement in Noisy and Reverberant Environment
    Pignotti, Alessio
    Marcozzi, Daniele
    Cifani, Simone
    Squartini, Stefano
    Piazza, Francesco
    CROSS-MODAL ANALYSIS OF SPEECH, GESTURES, GAZE AND FACIAL EXPRESSIONS, 2009, 5641 : 356 - 367
  • [45] The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
    Araki, S
    Mukai, R
    Makino, S
    Nishikawa, T
    Saruwatari, H
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02): : 109 - 116
  • [46] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [47] Research on Speech Enhancement Algorithms Based on Blind Source Separation in Outdoor Environment
    Wang, Chunli
    Wang, Quanyu
    CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 837 - 842
  • [48] Blind Source Separation of 3-D located many speech signals
    Mukai, R
    Sawada, H
    Araki, S
    Makino, S
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 9 - 12
  • [49] An Efficient Multistage Approach for Blind Source Separation of Noisy Convolutive Speech Mixture
    Khan, Junaid Bahadar
    Jan, Tariqullah
    Khalil, Ruhul Amin
    Saeed, Nasir
    Almutiry, Muhannad
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [50] The Influence of Blind Source Separation on Mixed Audio Speech and Music Emotion Recognition
    Laugs, Casper
    Koops, Hendrik Vincent
    Odijk, Daan
    Kaya, Heysem
    Volk, Anja
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 67 - 71