Speech/music discrimination for multimedia applications

被引：0

作者：

El-Maleh, K ^{[1
]}

Klein, M ^{[1
]}

Petrucci, G ^{[1
]}

Kabal, P ^{[1
]}

机构：

[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

dAutomatic discrimination of speech and music is an important tool in many multimedia applications. Previous work has focused on using long-term features such as differential parameters, variances, and time-averages of spectral parameters. These classifiers use features estimated over windows of 0.5-5 seconds, and are relatively complex. In this paper, we present our results of combining the line spectral frequencies (LSFs) and zero-crossing-based features for frame-level narrowband speech/music discrimination. Our classification results for different types of music and speech show the good discriminating power of these features. Our classification algorithms operate using only a frame delay of 20 ms, making them suitable for real-time multimedia applications.

引用

页码：2445 / 2448

页数：4

共 50 条

[1] Speech-Music-Noise Discrimination in Sound Indexing of Multimedia Documents
Bouafif, Lamia
Ellouze, Noureddine
[J]. SOUND AND VIBRATION, 2018, 52 (06): : 2 - 10
[2] MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION
Sell, Gregory
Clark, Pascal
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[3] Speech/music discrimination for robust speech recognition in robots
Choi, Mu Yeol
Song, Hwa Jeon
Kim, Hyung Soon
[J]. 2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 118 - +
[4] A New Feature for Speech/Music Discrimination
Huang, Houjun
Xu, Yunfei
Zhou, Ruohua
[J]. INTERNATIONAL ACADEMIC CONFERENCE ON THE INFORMATION SCIENCE AND COMMUNICATION ENGINEERING (ISCE 2014), 2014, : 133 - 137
[5] A multifeature speech/music discrimination system
Saad, EM
El-Adawy, MI
Abu-El-Wafa, ME
Wahba, AA
[J]. IEEE CCEC 2002: CANADIAN CONFERENCE ON ELECTRCIAL AND COMPUTER ENGINEERING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2002, : 1055 - 1058
[6] Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination
Tsipas, Nikolaos
Vrysis, Lazaros
Dimoulas, Charalampos
Papanikolaou, George
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 25603 - 25621
[7] Feature extraction for speech and music discrimination
Hou, Huiyu
Sadka, Abdul
Jiang, Richard M.
[J]. 2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 154 - 157
[8] Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination
Nikolaos Tsipas
Lazaros Vrysis
Charalampos Dimoulas
George Papanikolaou
[J]. Multimedia Tools and Applications, 2017, 76 : 25603 - 25621
[9] A fast and robust speech/music discrimination approach
Wang, WQ
Gao, W
Ying, DW
[J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1325 - 1329
[10] A comparison of features for speech, music discrimination.
Carey, MJ
Parris, ES
Lloyd-Thomas, H
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 149 - 152

← 1 2 3 4 5 →