A dynamic programming approach to audio segmentation and speech/music discrimination

被引:0
|
作者
Goodwin, MM [1 ]
Laroche, J [1 ]
机构
[1] Creat Adv Technol Ctr, Scotts Valley, CA USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider the problem of segmenting an audio signal into characteristic regions based on feature-set similarities. In the proposed approach, a feature-space representation of the signal is generated; sequences of these feature-space samples are then aggregated into clusters corresponding to distinct signal regions. The algorithm consists of using linear discriminant analysis (LDA) to condition the feature space and dynamic programming (DP) to identify data clusters. In this paper, we consider the design of the dynamic program cost functions; we are able to derive effective cost functions without relying on significant prior information about the structure of the expected data clusters. We demonstrate the application of the LDA-DP segmentation algorithm to speech/music discrimination; experimental results are given and discussed.
引用
收藏
页码:309 / 312
页数:4
相关论文
共 50 条
  • [41] Perceptual Models for Speech, Audio, and Music Processing
    Jont B Allen
    Wai-Yip Geoffrey Chan
    Stephen Voran
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [42] Speech-Music Segmentation System for Speech Recognition
    Demir, Cemil
    Dogan, Mehmet Ugur
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 846 - 849
  • [43] Music Training for the Development of Speech Segmentation
    Francois, Clement
    Chobert, Julie
    Besson, Mireille
    Schoen, Daniele
    [J]. CEREBRAL CORTEX, 2013, 23 (09) : 2038 - 2043
  • [44] Audio segmentation by feature-space clustering using linear discriminant analysis and dynamic programming
    Goodwin, MM
    Laroche, J
    [J]. 2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 131 - 134
  • [45] Insertion, deletion robust audio watermarking: a set theoretic, dynamic programming approach
    Nadeau, Andrew
    Sharma, Gaurav
    [J]. MEDIA WATERMARKING, SECURITY, AND FORENSICS 2013, 2013, 8665
  • [46] Dynamic programming method for fine-tuning the boundary points in automatic segmentation of speech
    Szymanski, Marcin
    Grocholewski, Stefan
    [J]. ARCHIVES OF ACOUSTICS, 2007, 32 (01) : 127 - 134
  • [47] AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES
    Rybach, David
    Gollan, Christian
    Schlueter, Ralf
    Ney, Hermann
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4197 - 4200
  • [48] A machine learning approach to dynamic programming for stochastic process of speech recognition
    Ding, Ing-Jr
    Yen, Chih-Ta
    Hsu, Yen-Ming
    [J]. INNOVATION, COMMUNICATION AND ENGINEERING, 2014, : 351 - 354
  • [49] Special Issue on Dereverberation and Reverberation of Audio, Music, and Speech
    Spriet, Ann
    Goetze, Stefan
    van Waterschoot, Toon
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (1-2): : 6 - 7
  • [50] Novel features for effective speech and music discrimination
    Muharak, Omer Mohsin
    Ambikairajah, Eliathamby
    Epps, Julien
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF INTELLIGENT SYSTEMS, 2006, : 343 - +