Access the cluster tendency by visual methods for robust speech clustering

被引:1
|
作者
Suneetha Rani T. [1 ]
Krishna Prasad M.H.M. [1 ]
机构
[1] Department of CSE, JNTUK, Kakinada
关键词
GMM; k-means; MST-based clustering; Speech clustering; VAT;
D O I
10.1007/s13198-015-0393-z
中图分类号
学科分类号
摘要
Identifying the cues for speech segments of speech data is an indispensable task in speaker clustering. The existing techniques perform the task of speech clustering without any prior knowledge of cluster tendency. Many techniques are investigated for finding a prior cluster tendency (CT). During the investigation, the visual access tendency (VAT) is recognized as a reasonable choice to find a cluster tendency. The speech clustering poses three important problems, which are as follows: modelling the speech data, cluster tendency, and effective speech clustering. Modelling is required for defining the shape of the speech segment based on the characteristics of speaker’s voice; hence it is useful for speech recognition. The GMM is a good choice for obtaining the precise model of speech data. Determining the number of speakers (or number of clusters) for the speech is known as cluster tendency. The quality of speech clustering depends on modelling and a prior clustering tendency. The classical algorithms [such as k-means, and minimum spanning tree (MST)-based-clustering] are merged with VAT for determining the effective clustering results along with a prior cluster tendency. We use linear subspace learning for representing the speech segments (or speech utterances) in a projected space of high-dimensional data. Various linear subspace learning techniques are used for improving the speech clustering results. The proposed approaches are hybrid approaches (i.e., k-means-CT, and MST–CT-based clustering), they use expensive steps. For this key reason, we propose another method, direct visualized clustering method, in which we derive the explicit speaker clustering results directly from VAT instead of using either k-means or MST-based clustering. We experimented the proposed methods on TSP speech datasets and done the comparative study for demonstrating the effectiveness of our work. © 2015, The Society for Reliability Engineering, Quality and Operations Management (SREQOM), India and The Division of Operation and Maintenance, Lulea University of Technology, Sweden.
引用
收藏
页码:465 / 477
页数:12
相关论文
共 50 条
  • [21] Enhanced Visual Analysis for Cluster Tendency Assessment and Data Partitioning
    Wang, Liang
    Geng, Xin
    Bezdek, James
    Leckie, Christopher
    Ramamohanarao, Kotagiri
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) : 1401 - 1414
  • [22] Visual hierarchical cluster structure: A refined co-association matrix based visual assessment of cluster tendency
    Zhong, Caiming
    Yue, Xiaodong
    Lei, Jingsheng
    PATTERN RECOGNITION LETTERS, 2015, 59 : 48 - 55
  • [23] Cluster Sculptor, an interactive visual clustering system
    Bruneau, P.
    Pinheiro, P.
    Broeksema, B.
    Otjacques, B.
    NEUROCOMPUTING, 2015, 150 : 627 - 644
  • [24] Robust clustering by aggregation and intersection methods
    Bifulco, Ida
    Fedullo, Carmine
    Napolitano, Francesco
    Raiconi, Giancarlo
    Tagliaferri, Roberto
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2008, 5179 : 732 - 739
  • [25] Robust clustering methods: A unified view
    Dave, RN
    Krishnapuram, R
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1997, 5 (02) : 270 - 293
  • [26] Robust Visual Vocabulary Based On Grid Clustering
    Ouni, Achref
    Royer, Eric
    Chevaldonne, Marc
    Dhome, Michel
    INTELLIGENT DECISION TECHNOLOGIES, KES-IDT 2021, 2021, 238 : 221 - 230
  • [27] Robust Face Frontalization For Visual Speech Recognition
    Kang, Zhiqi
    Horaud, Radu
    Sadeghi, Mostafa
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2485 - 2495
  • [28] Graph based visual assessment cluster tendency for unlabeled data sets
    Prabhu, P. (spunitha156@yahoo.co.in), 1957, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [29] An Efficient Formulation of the Improved Visual Assessment of Cluster Tendency (iVAT) Algorithm
    Havens, Timothy C.
    Bezdek, James C.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) : 813 - 822
  • [30] Parallel edge-based visual assessment of cluster tendency on GPU
    Meng, Tao
    Yuan, Bo
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2018, 6 (04) : 287 - 295