Audio indexing using feature warping and fusion techniques

被引:0
|
作者
Sénac, C [1 ]
Ambikairajah, E [1 ]
机构
[1] UPS 47, UMR 5505 CNRS, INP, Inst Rech Informat Toulouse, Toulouse, France
关键词
audio indexing; classification; fusion; feature normalization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports on the improvement of speech and music indexation performance under various noisy conditions for radio broadcast using warped features fused with traditional features at the output stage. The system employs a bank of four parallel front ends followed by a classification in speech and music by Gaussian mixture models, where each front end employs a different feature extraction technique. Then an automatic gathering in macro classes is made. Indexing was performed on 8 hours of manually labelled radio broadcast from multilingual Radio France International recordings containing diverse speech an music content with different speaking styles, speakers, noise conditions and channels. For speech signal classification under the noisiest conditions, the warped features fused with traditional features produced an error rate three times smaller than that of either the warped features or the traditional features alone. Significant improvements were also found or speech classification under less noisy conditions.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [21] A Feature Level Fusion Fingerprint Indexing Approach Based on MV and MCC using SVM Classifier
    Parmar, Pooja A.
    Degadwala, Sheshang D.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1024 - 1028
  • [22] AUDIO INDEXING FOR EFFICIENCY
    RAHMLOW, HF
    PEDRICK, L
    [J]. EDUCATIONAL TECHNOLOGY, 1978, 18 (01) : 52 - 54
  • [23] Time warping of audio signals
    Goldenstein, S
    Gomes, J
    [J]. COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 1999, : 52 - 57
  • [24] Audio Indexing for YouTube
    Al Laham, Mohamad Nour
    Ayass, Imad
    Ghareeb, Majd
    El-Bazzal, Zouhair
    Raad, Mohamad
    [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND ITS APPLICATIONS (DICTAP), 2015, : 111 - 114
  • [25] Exact indexing of dynamic time warping
    Keogh, E
    Ratanamahatana, CA
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (03) : 358 - 386
  • [26] Enhanced Feature Fusion Segmentation for Tumor Detection Using Intelligent Techniques
    Radha, R.
    Gopalakrishnan, R.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (03): : 3113 - 3127
  • [27] Domain Specific Audio Indexing Using Linguistic Information
    Pandey, L.
    Nathwani, K.
    Kaur, S.
    Husain, I.
    Pathak, R.
    Singh, G.
    Tiwari, S.
    Hegde, Rajesh M.
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2014, : 364 - 369
  • [28] Diverse feature set based Keyphrase extraction and indexing techniques
    Sharma, Saurabh
    Gupta, Vishal
    Juneja, Mamta
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 4111 - 4142
  • [29] Diverse feature set based Keyphrase extraction and indexing techniques
    Saurabh Sharma
    Vishal Gupta
    Mamta Juneja
    [J]. Multimedia Tools and Applications, 2021, 80 : 4111 - 4142
  • [30] Exact indexing of dynamic time warping
    Eamonn Keogh
    Chotirat Ann Ratanamahatana
    [J]. Knowledge and Information Systems, 2005, 7 : 358 - 386