Segment-oriented evaluation of speaker diarisation performance

被引:0
|
作者
Milner, Rosanna [1 ]
Hain, Thomas [1 ]
机构
[1] Univ Sheffield, Speech & Hearing Res Grp, Sheffield S10 2TN, S Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
speaker diarisation; diarisation error rate; boundary information; purity measures;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
High performance diarisation is a necessity for a variety of applications, and the task has been studied extensively in the context of broadcast news and meeting processing. Upon introduction of the task in NIST led evaluations, diarisation error rate (DER) was introduced as the standard metric for evaluation, and it has been consistently used to compare systems ever since. DER is a frame based metric that does not penalise for producing many short segments. However, practical systems that require diarisation input are typically not able to cope well with such artefacts. In this paper we illustrate the need for an alternative metric focussing on segments, instead of duration or boundaries only. We propose a segment based F-measure, which specifically addresses issues such as reference errors, matching start and end boundaries, and speaker pairing. The performance of the metric is analysed in the context of state-of-the-art systems and compared with other existing metrics. It is shown to give a deeper insight into the segmentation quality over the standard metrics, and thus better value for to understand impact on follow on tasks such as ASR.
引用
收藏
页码:5460 / 5464
页数:5
相关论文
共 50 条
  • [1] Segment-oriented approach to liver resection
    Liau, KH
    Blumgart, LH
    DeMatteo, RP
    [J]. SURGICAL CLINICS OF NORTH AMERICA, 2004, 84 (02) : 543 - +
  • [2] Segment-Oriented Depiction and Analysis for Hyperspectral Image Data
    Yin, Jihao
    Qv, Hui
    Luo, Xiaoyan
    Jia, Xiuping
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (07): : 3982 - 3996
  • [3] Adapting Speaker Embeddings for Speaker Diarisation
    Kwon, Youngki
    Jung, Jee-weon
    Heo, Hee-Soo
    Kim, You Jin
    Lee, Bong-Jin
    Chung, Joon Son
    [J]. INTERSPEECH 2021, 2021, : 3101 - 3105
  • [4] SEGMENT-ORIENTED LIVER RESECTION - PRINCIPLES, TECHNIQUES, THERAPEUTIC VALUE
    SCHEELE, J
    [J]. CHIRURG, 1989, 60 (04): : 251 - 265
  • [5] Segment-oriented hepatic resection in the management of malignant neoplasms of the liver
    Billingsley, KG
    Jarnagin, WR
    Fong, Y
    Blumgart, LH
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 1998, 187 (05) : 471 - 481
  • [6] CONTENT-AWARE SPEAKER EMBEDDINGS FOR SPEAKER DIARISATION
    Sun, G.
    Liu, D.
    Zhang, C.
    Woodland, P. C.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7168 - 7172
  • [7] Combination of deep speaker embeddings for diarisation
    Sun, Guangzhi
    Zhang, Chao
    Woodland, Philip C.
    [J]. NEURAL NETWORKS, 2021, 141 : 372 - 384
  • [8] DEVELOPMENT OF THE MARKETING COMMUNICATIONS OF COMMERCIAL BANKS THROUGH A SEGMENT-ORIENTED APPROACH
    Demko, M.
    Kosar, N.
    Kuzo, N.
    Jo, Pochopien
    [J]. FINANCIAL AND CREDIT ACTIVITY-PROBLEMS OF THEORY AND PRACTICE, 2021, 3 (38): : 35 - 45
  • [9] DNN APPROACH TO SPEAKER DIARISATION USING SPEAKER CHANNELS
    Milner, Rosanna
    Hain, Thomas
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4925 - 4929
  • [10] Speaker overlap detection with prosodic features for speaker diarisation
    Zelenak, M.
    Hernando, J.
    [J]. IET SIGNAL PROCESSING, 2012, 6 (08) : 798 - 804