Estimating Speaker Clustering Quality Using Logistic Regression

被引:3
|
作者
Cohen, Yishai [1 ]
Lapidot, Itshak [1 ]
机构
[1] Afeka Tel Aviv Coll Engn, ACLP, Tel Aviv, Israel
关键词
Cluster validity; Logistic Regression; I-vectors; Mean-shift; PLDA; MEAN SHIFT;
D O I
10.21437/Interspeech.2017-492
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on estimating clustering validity by using logistic regression. For many applications it might be important to estimate the quality of the clustering, e.g. in case of speech segments' clustering, make a decision whether to use the clustered data for speaker verification. In the case of short segments speakers clustering, the common criteria for cluster validity are average cluster purity (ACP). average speaker purity (ASP) and K - the geometric mean between the two measures. As in practice, true labels are not available for evaluation. hence they have to be estimated from the clustering itself. In this paper. mean shift clustering with PLDA score is applied in order to cluster short speaker segments represented as i-vectors. Different statistical parameters are then estimated on the clustered data and arc used to train logistic regression to estimate ACP, ASP and K. It was found that logistic regression can be a good predictor of the actual ACP, ASP and K. and yields reasonable information regarding the clustering quality.
引用
收藏
页码:3577 / 3581
页数:5
相关论文
共 50 条