Audio indexing using feature warping and fusion techniques

被引:0
|
作者
Sénac, C [1 ]
Ambikairajah, E [1 ]
机构
[1] UPS 47, UMR 5505 CNRS, INP, Inst Rech Informat Toulouse, Toulouse, France
关键词
audio indexing; classification; fusion; feature normalization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports on the improvement of speech and music indexation performance under various noisy conditions for radio broadcast using warped features fused with traditional features at the output stage. The system employs a bank of four parallel front ends followed by a classification in speech and music by Gaussian mixture models, where each front end employs a different feature extraction technique. Then an automatic gathering in macro classes is made. Indexing was performed on 8 hours of manually labelled radio broadcast from multilingual Radio France International recordings containing diverse speech an music content with different speaking styles, speakers, noise conditions and channels. For speech signal classification under the noisiest conditions, the warped features fused with traditional features produced an error rate three times smaller than that of either the warped features or the traditional features alone. Significant improvements were also found or speech classification under less noisy conditions.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [1] Image indexing using the color and bit pattern feature fusion
    Guo, Jing-Ming
    Prasetyo, Heri
    Su, Huai-Sheng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (08) : 1360 - 1379
  • [2] Mobile agent audio source indexing using acoustic source localization techniques
    Gelowitz, CM
    Benedicenti, L
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XV, PROCEEDINGS: MOBILE/WIRELESS COMPUTING AND COMMUNICATION SYSTEMS III, 2002, : 395 - 398
  • [3] Audio Anti-Spoofing Based on Audio Feature Fusion
    Zhang, Jiachen
    Tu, Guoqing
    Liu, Shubo
    Cai, Zhaohui
    [J]. ALGORITHMS, 2023, 16 (07)
  • [4] Using multi-audio feature fusion for android malware detection
    Tarwireyi, Paul
    Terzoli, Alfredo
    Adigun, Matthew
    [J]. COMPUTERS & SECURITY, 2023, 131
  • [5] Video indexing using speech recognition techniques in audio channel preliminary system design
    Gu, LY
    [J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 342 - 345
  • [6] Face recognition using fusion of feature learning techniques
    Umer, Saiyed
    Dhara, Bibhas Chandra
    Chanda, Bhabatosh
    [J]. MEASUREMENT, 2019, 146 : 43 - 54
  • [7] An efficient audio watermarking by using spectrum warping
    Choi, KP
    Lee, KY
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (06): : 1257 - 1264
  • [8] Speech/Music Discrimination using Hybrid-Based Feature Extraction for Audio Data Indexing
    Wang, Kun-Ching
    Yang, Yung-Ming
    Yang, Ying-Ru
    [J]. 2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 515 - 519
  • [9] Wind Sounds Classification Using Different Audio Feature Extraction Techniques
    Jasim, Wala'a Nsaif
    Saddam, Saba Abdual Wahid
    Harfash, Esra'a Jasem
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07): : 57 - 65
  • [10] A BLIND AUDIO STEGANALYSIS BASED ON FEATURE FUSION
    Wei Yifang Guo Li Wang Yujie Wang Cuiping (Department of Electronic Science and Technology
    [J]. Journal of Electronics(China), 2011, 28 (03) : 265 - 276