Auditory Features Analysis for BIC-based Audio Segmentation

被引:0
|
作者
Maka, Tomasz [1 ]
机构
[1] West Pomeranian Univ Technol, Fac Comp Sci & Informat Technol, Zolnierska 49, PL-71210 Szczecin, Poland
关键词
Auditory Features; Audio Segmentation; Delta-BIC Segmentation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Audio segmentation is one of the stages in audio processing chain whose accuracy plays a primary role in the final performane of the audio recognition and processing tasks. This paper presents an analysis of auditory features for audio segmentation. A set of features is derived from a time-frequency representation of an input signal and has been calculated based on properties of human auditory system. An analysis of several sets of audio features efficiency for BIC-based audio segmentation has been performed. The obtained results show that auditory features derived from different frequency scales are competitive to the widely used MFCC feature in terms of accuracy and the number of detected points.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [1] Evaluation of BIC-based algorithms for audio segmentation
    Cettolo, M
    Vescovi, M
    Rizzi, R
    [J]. COMPUTER SPEECH AND LANGUAGE, 2005, 19 (02): : 147 - 170
  • [2] BIC-based audio segmentation by divide-and-conquer
    Cheng, Shih-Sian
    Wang, Hsin-Min
    Fu, Hsin-Chia
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4841 - +
  • [3] Computationally efficient and robust BIC-based speaker segmentation
    Kotti, Margarita
    Benetos, Emmanouil
    Kotropoulos, Constantine
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 920 - 933
  • [4] Systematic comparison of BIC-based speaker segmentation systems
    Moschou, Vassiliki
    Kotti, Margarita
    Benetos, Emman Uil
    Kotropoulos, Constantine
    [J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 66 - 69
  • [5] Efficient audio segmentation algorithms based on the BIC
    Cettolo, M
    Vescovi, M
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL VI, PROCEEDINGS: SIGNAL PROCESSING THEORY AND METHODS, 2003, : 537 - 540
  • [6] BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
    Cheng, Shih-Sian
    Wang, Hsin-Min
    Fu, Hsin-Chia
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 141 - 157
  • [7] A BIC-based consistent metric between Markovian processes
    Garcia, Jesus E.
    Gholizadeh, R.
    Gonzalez-Lopez, V. A.
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2018, 34 (06) : 868 - 878
  • [8] Speaker Clustering Using Direct Maximization of A BIC-based Score
    Tsai, Wei-Ho
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 497 - 500
  • [9] Audio elements based auditory scene segmentation
    Lu, Lie
    Cai, Rui
    Hanjalic, Alan
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4875 - 4878
  • [10] An adaptive BIC approach for robust audio stream segmentation
    Zibert, Janez
    Brodnik, Andrej
    Mihelic, France
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2507 - +