Evaluation of BIC-based algorithms for audio segmentation

被引:31
|
作者
Cettolo, M
Vescovi, M
Rizzi, R
机构
[1] ITCirst, Ctr Ric Sci & Tecnol, I-38050 Trento, Italy
[2] Univ Trent, Fac Sci, I-38050 Trento, Italy
来源
COMPUTER SPEECH AND LANGUAGE | 2005年 / 19卷 / 02期
关键词
D O I
10.1016/j.csl.2004.05.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bayesian Information Criterion (BIC) is a widely adopted method for audio segmentation, and has inspired a number of dominant algorithms for this application. At present, however, literature lacks in analytical and experimental studies on these algorithms. This paper tries to partially cover this gap. Typically, BIC is applied within a sliding variable-size analysis window where single changes in the nature of the audio are locally searched. Three different implementations of the algorithm are described and compared: (i) the first keeps updated a pair of sums, that of input vectors and that of square input vectors, in order to save computations in estimating covariance matrices on partially shared data; (ii) the second implementation, recently proposed in literature, is based on the encoding of the input signal with cumulative statistics for an efficient estimation of covariance matrices; (iii) the third implementation consists of a novel approach, and is characterized by the encoding of the input stream with the cumulative pair of sums of the first approach. Furthermore, a dynamic programming algorithm is presented that, within the BIC model, finds a globally optimal segmentation of the input audio stream. All algorithms are analyzed in detail from the viewpoint of the computational cost, experimentally evaluated on proper tasks, and compared. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:147 / 170
页数:24
相关论文
共 50 条
  • [1] BIC-based audio segmentation by divide-and-conquer
    Cheng, Shih-Sian
    Wang, Hsin-Min
    Fu, Hsin-Chia
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4841 - +
  • [2] Auditory Features Analysis for BIC-based Audio Segmentation
    Maka, Tomasz
    [J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP), 2014, : 48 - 53
  • [3] Efficient audio segmentation algorithms based on the BIC
    Cettolo, M
    Vescovi, M
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL VI, PROCEEDINGS: SIGNAL PROCESSING THEORY AND METHODS, 2003, : 537 - 540
  • [4] Computationally efficient and robust BIC-based speaker segmentation
    Kotti, Margarita
    Benetos, Emmanouil
    Kotropoulos, Constantine
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 920 - 933
  • [5] Systematic comparison of BIC-based speaker segmentation systems
    Moschou, Vassiliki
    Kotti, Margarita
    Benetos, Emman Uil
    Kotropoulos, Constantine
    [J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 66 - 69
  • [6] BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
    Cheng, Shih-Sian
    Wang, Hsin-Min
    Fu, Hsin-Chia
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 141 - 157
  • [7] A BIC-based consistent metric between Markovian processes
    Garcia, Jesus E.
    Gholizadeh, R.
    Gonzalez-Lopez, V. A.
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2018, 34 (06) : 868 - 878
  • [8] Speaker Clustering Using Direct Maximization of A BIC-based Score
    Tsai, Wei-Ho
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 497 - 500
  • [9] An adaptive BIC approach for robust audio stream segmentation
    Zibert, Janez
    Brodnik, Andrej
    Mihelic, France
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2507 - +
  • [10] BIC-based unit-root detection: Simulation-based evidence
    Fukuda, Kosei
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2006, 183 (01) : 518 - 521