Estimating Number of Speakers via Density-Based Clustering and Classification Decision

被引:4
|
作者
Yang, Junjie [1 ,2 ]
Guo, Yi [3 ]
Yang, Zuyuan [1 ,2 ]
Yang, Liu [4 ]
Xie, Shengli [1 ,2 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[2] Guangdong Key Lab IoT Informat Proc, Guangzhou 510006, Peoples R China
[3] Western Sydney Univ Parramatta, Ctr Res Math & Data Sci, Parramatta, NSW 2150, Australia
[4] Guangzhou Univ, Sch Comp Sci & Cyber Engn, Guangzhou 510006, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Number of speakers; speeches; reverberation; audio source separation (ASS); local dominance; density-based clustering; TIME-FREQUENCY MASKING; BLIND SEPARATION;
D O I
10.1109/ACCESS.2019.2956772
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is crucial to robustly estimate the number of speakers (NoS) from the recorded audio mixtures in a reverberant environment. Some popular time-frequency (TF) methods approach this NoS estimation problem by assuming that only one of the speech components is active at each TF slot. However, this condition is violated in many scenarios where the speeches are convolved with long length of room impulse response coefficients, which causes degenerated performance of NoS estimation. To tackle this problem, a density-based clustering strategy is proposed to estimate NoS based on a local dominance assumption of speeches. Our method consists of several steps from clustering to classification of speakers with the consideration of robustness. First, the leading eigenvectors are extracted from the local covariance matrices of mixture TF components and ranked by the combination of local density and minimum distance to other leading eigenvectors with higher density. Second, a gap-based method is employed to determine the cluster centers from the ranked leading eigenvectors at each frequency bin. Third, a criterion based on averaged volume of cluster centers is proposed to select reliable clustering results at some frequency bins for the classification decision of NoS. The experiment results demonstrate that the proposed algorithm is superior to the existing methods in various reverberation cases with noise-free condition or noise condition.
引用
收藏
页码:176541 / 176551
页数:11
相关论文
共 50 条
  • [31] An ensemble density-based clustering method
    Xia, Luning
    Jing, Jiwu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [32] Hierarchical density-based clustering of shapes
    Gautama, T
    Van Hulle, MM
    NEURAL NETWORKS FOR SIGNAL PROCESSING XI, 2001, : 213 - 222
  • [33] Deep density-based image clustering
    Ren, Yazhou
    Wang, Ni
    Li, Mingxia
    Xu, Zenglin
    KNOWLEDGE-BASED SYSTEMS, 2020, 197
  • [34] Anytime parallel density-based clustering
    Son T. Mai
    Ira Assent
    Jon Jacobsen
    Martin Storgaard Dieu
    Data Mining and Knowledge Discovery, 2018, 32 : 1121 - 1176
  • [35] Density-based clustering with topographic maps
    Van Hulle, MM
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (01): : 204 - 207
  • [36] Density-based clustering of social networks
    Menardi, Giovanna
    De Stefano, Domenico
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2022, 185 (03) : 1004 - 1029
  • [37] Scalable density-based distributed clustering
    Januzaj, E
    Kriegel, HP
    Pfeifle, M
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 231 - 244
  • [38] A unified view of density-based methods for semi-supervised clustering and classification
    Gertrudes, Jadson Castro
    Zimek, Arthur
    Sander, Jorg
    Campello, Ricardo J. G. B.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (06) : 1894 - 1952
  • [39] A unified view of density-based methods for semi-supervised clustering and classification
    Jadson Castro Gertrudes
    Arthur Zimek
    Jörg Sander
    Ricardo J. G. B. Campello
    Data Mining and Knowledge Discovery, 2019, 33 : 1894 - 1952
  • [40] DBHD: Density-based clustering for highly varying density
    Durani, Walid
    Mautz, Dominik
    Plant, Claudia
    Boehm, Christian
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 921 - 926