Estimating Speaker Clustering Quality Using Logistic Regression

被引:3
|
作者
Cohen, Yishai [1 ]
Lapidot, Itshak [1 ]
机构
[1] Afeka Tel Aviv Coll Engn, ACLP, Tel Aviv, Israel
关键词
Cluster validity; Logistic Regression; I-vectors; Mean-shift; PLDA; MEAN SHIFT;
D O I
10.21437/Interspeech.2017-492
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on estimating clustering validity by using logistic regression. For many applications it might be important to estimate the quality of the clustering, e.g. in case of speech segments' clustering, make a decision whether to use the clustered data for speaker verification. In the case of short segments speakers clustering, the common criteria for cluster validity are average cluster purity (ACP). average speaker purity (ASP) and K - the geometric mean between the two measures. As in practice, true labels are not available for evaluation. hence they have to be estimated from the clustering itself. In this paper. mean shift clustering with PLDA score is applied in order to cluster short speaker segments represented as i-vectors. Different statistical parameters are then estimated on the clustered data and arc used to train logistic regression to estimate ACP, ASP and K. It was found that logistic regression can be a good predictor of the actual ACP, ASP and K. and yields reasonable information regarding the clustering quality.
引用
收藏
页码:3577 / 3581
页数:5
相关论文
共 50 条
  • [1] Speaker clustering quality estimation with logistic regression
    Cohen, Yishai
    Lapidot, Itshak
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 65
  • [2] A Procedure for Estimating the Number of Clusters in Logistic Regression Clustering
    Qian, Guoqi
    Wu, Yuehua
    Shao, Qing
    [J]. JOURNAL OF CLASSIFICATION, 2009, 26 (02) : 183 - 199
  • [3] A Procedure for Estimating the Number of Clusters in Logistic Regression Clustering
    Guoqi Qian
    Yuehua Wu
    Qing Shao
    [J]. Journal of Classification, 2009, 26 : 183 - 199
  • [4] Regularized Logistic Regression Fusion for Speaker Verification
    Hautamaki, Ville
    Lee, Kong Aik
    Kinnunen, Tomi
    Ma, Bin
    Li, Haizhou
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2756 - +
  • [5] Estimating sustainable harvest in wolverine populations using logistic regression
    Dalerum, Fredrik
    Shults, Brad
    Kunkel, Kyran
    [J]. JOURNAL OF WILDLIFE MANAGEMENT, 2008, 72 (05): : 1125 - 1132
  • [6] A simple method for estimating relative risk using logistic regression
    Diaz-Quijano, Fredi A.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2012, 12
  • [7] A simple method for estimating relative risk using logistic regression
    Fredi A Diaz-Quijano
    [J]. BMC Medical Research Methodology, 12
  • [8] Defining and estimating the reliability of physician quality measures in hierarchical logistic regression models
    Hwang, Jessica
    Adams, John L.
    Paddock, Susan M.
    [J]. HEALTH SERVICES AND OUTCOMES RESEARCH METHODOLOGY, 2021, 21 (01) : 111 - 130
  • [9] Defining and estimating the reliability of physician quality measures in hierarchical logistic regression models
    Jessica Hwang
    John L. Adams
    Susan M. Paddock
    [J]. Health Services and Outcomes Research Methodology, 2021, 21 : 111 - 130
  • [10] Estimating the causes of traffic accidents using logistic regression and discriminant analysis
    Karacasu, Murat
    Ergul, Baris
    Yavuz, Arzu Altin
    [J]. INTERNATIONAL JOURNAL OF INJURY CONTROL AND SAFETY PROMOTION, 2014, 21 (04) : 305 - 312