Audio indexing using feature warping and fusion techniques

被引：0

作者：

Sénac, C ^{[1
]}

Ambikairajah, E ^{[1
]}

机构：

[1] UPS 47, UMR 5505 CNRS, INP, Inst Rech Informat Toulouse, Toulouse, France

来源：

2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING | 2004年

关键词：

audio indexing; classification; fusion; feature normalization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper reports on the improvement of speech and music indexation performance under various noisy conditions for radio broadcast using warped features fused with traditional features at the output stage. The system employs a bank of four parallel front ends followed by a classification in speech and music by Gaussian mixture models, where each front end employs a different feature extraction technique. Then an automatic gathering in macro classes is made. Indexing was performed on 8 hours of manually labelled radio broadcast from multilingual Radio France International recordings containing diverse speech an music content with different speaking styles, speakers, noise conditions and channels. For speech signal classification under the noisiest conditions, the warped features fused with traditional features produced an error rate three times smaller than that of either the warped features or the traditional features alone. Significant improvements were also found or speech classification under less noisy conditions.

引用

页码：359 / 362

页数：4

共 50 条

[1] Image indexing using the color and bit pattern feature fusion
Guo, Jing-Ming
Prasetyo, Heri
Su, Huai-Sheng
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (08) : 1360 - 1379
[2] Mobile agent audio source indexing using acoustic source localization techniques
Gelowitz, CM
Benedicenti, L
[J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XV, PROCEEDINGS: MOBILE/WIRELESS COMPUTING AND COMMUNICATION SYSTEMS III, 2002, : 395 - 398
[3] Audio Anti-Spoofing Based on Audio Feature Fusion
Zhang, Jiachen
Tu, Guoqing
Liu, Shubo
Cai, Zhaohui
[J]. ALGORITHMS, 2023, 16 (07)
[4] Using multi-audio feature fusion for android malware detection
Tarwireyi, Paul
Terzoli, Alfredo
Adigun, Matthew
[J]. COMPUTERS & SECURITY, 2023, 131
[5] Video indexing using speech recognition techniques in audio channel preliminary system design
Gu, LY
[J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 342 - 345
[6] Face recognition using fusion of feature learning techniques
Umer, Saiyed
Dhara, Bibhas Chandra
Chanda, Bhabatosh
[J]. MEASUREMENT, 2019, 146 : 43 - 54
[7] An efficient audio watermarking by using spectrum warping
Choi, KP
Lee, KY
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (06): : 1257 - 1264
[8] Speech/Music Discrimination using Hybrid-Based Feature Extraction for Audio Data Indexing
Wang, Kun-Ching
Yang, Yung-Ming
Yang, Ying-Ru
[J]. 2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 515 - 519
[9] Wind Sounds Classification Using Different Audio Feature Extraction Techniques
Jasim, Wala'a Nsaif
Saddam, Saba Abdual Wahid
Harfash, Esra'a Jasem
[J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07): : 57 - 65
[10] A BLIND AUDIO STEGANALYSIS BASED ON FEATURE FUSION
Wei Yifang Guo Li Wang Yujie Wang Cuiping (Department of Electronic Science and Technology
[J]. Journal of Electronics(China), 2011, 28 (03) : 265 - 276

← 1 2 3 4 5 →