Access the cluster tendency by visual methods for robust speech clustering

被引:1
|
作者
Suneetha Rani T. [1 ]
Krishna Prasad M.H.M. [1 ]
机构
[1] Department of CSE, JNTUK, Kakinada
关键词
GMM; k-means; MST-based clustering; Speech clustering; VAT;
D O I
10.1007/s13198-015-0393-z
中图分类号
学科分类号
摘要
Identifying the cues for speech segments of speech data is an indispensable task in speaker clustering. The existing techniques perform the task of speech clustering without any prior knowledge of cluster tendency. Many techniques are investigated for finding a prior cluster tendency (CT). During the investigation, the visual access tendency (VAT) is recognized as a reasonable choice to find a cluster tendency. The speech clustering poses three important problems, which are as follows: modelling the speech data, cluster tendency, and effective speech clustering. Modelling is required for defining the shape of the speech segment based on the characteristics of speaker’s voice; hence it is useful for speech recognition. The GMM is a good choice for obtaining the precise model of speech data. Determining the number of speakers (or number of clusters) for the speech is known as cluster tendency. The quality of speech clustering depends on modelling and a prior clustering tendency. The classical algorithms [such as k-means, and minimum spanning tree (MST)-based-clustering] are merged with VAT for determining the effective clustering results along with a prior cluster tendency. We use linear subspace learning for representing the speech segments (or speech utterances) in a projected space of high-dimensional data. Various linear subspace learning techniques are used for improving the speech clustering results. The proposed approaches are hybrid approaches (i.e., k-means-CT, and MST–CT-based clustering), they use expensive steps. For this key reason, we propose another method, direct visualized clustering method, in which we derive the explicit speaker clustering results directly from VAT instead of using either k-means or MST-based clustering. We experimented the proposed methods on TSP speech datasets and done the comparative study for demonstrating the effectiveness of our work. © 2015, The Society for Reliability Engineering, Quality and Operations Management (SREQOM), India and The Division of Operation and Maintenance, Lulea University of Technology, Sweden.
引用
收藏
页码:465 / 477
页数:12
相关论文
共 50 条
  • [31] A robust unsupervised pattern discovery and clustering of speech signals
    Kumar, Kishore R.
    Birla, Lokendra
    Rao, Sreenivasa K.
    PATTERN RECOGNITION LETTERS, 2018, 116 : 254 - 261
  • [32] Modern robust data analysis methods: Measures of central tendency
    Wilcox, RR
    Keselman, HJ
    PSYCHOLOGICAL METHODS, 2003, 8 (03) : 254 - 274
  • [33] ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION
    Dubey, Harishchandra
    Sangwan, Abhijeet
    Hansen, John H. L.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2726 - 2730
  • [34] AN EFFICIENT VISUAL ANALYSIS METHOD FOR CLUSTER TENDENCY EVALUATION, DATA PARTITIONING AND INTERNAL CLUSTER VALIDATION
    Prabhu, Puniethaa
    Duraiswamy, Karuppusamy
    COMPUTING AND INFORMATICS, 2013, 32 (05) : 1013 - 1037
  • [36] Audio-Visual Deep Clustering for Speech Separation
    Lu, Rui
    Duan, Zhiyao
    Zhang, Changshui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1697 - 1712
  • [37] A new formulation of the coVAT algorithm for visual assessment of clustering tendency in rectangular data
    Havens, Timothy C.
    Bezdek, James C.
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2012, 27 (06) : 590 - 612
  • [38] Clustering methods for the optimization of atomic cluster structure
    Bagattini, Francesco
    Schoen, Fabio
    Tigli, Luca
    JOURNAL OF CHEMICAL PHYSICS, 2018, 148 (14):
  • [39] Visual cluster validity for prototype generator clustering models
    Hathaway, RJ
    Bezdek, JC
    PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1563 - 1569
  • [40] A Visual Approach to Improve Clustering Based on Cluster Ensembles
    Zhou, Jianping
    Konecni, Shawn
    Marx, Kenneth
    Grinstein, Georges
    VISUALIZATION AND DATA ANALYSIS 2010, 2010, 7530