Speech-Music-Noise Discrimination in Sound Indexing of Multimedia Documents

被引：0

作者：

Bouafif, Lamia ^{[1
]}

Ellouze, Noureddine ^{[2
]}

机构：

[1] Natl Inst Biomed Studies Tunis, Tunis 1092, Tunisia

[2] Univ Tunis El Manar, Image & Signal Proc Lab, ENIT BP 37, Tunis 1064, Tunisia

来源：

SOUND AND VIBRATION | 2018年 / 52卷 / 06期

关键词：

Speech processing; audio indexing; training and recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Sound indexing and segmentation of digital documents especially in the internet and digital libraries are very useful to simplify and to accelerate the multimedia document retrieval. We can imagine that we can extract multimedia files not only by keywords but also by speech semantic contents. The main difficulty of this operation is the parameterization and modelling of the sound track and the discrimination of the speech, music and noise segments. In this paper, we will present a Speech/Music/Noise indexing interface designed for audio discrimination in multimedia documents. The program uses a statistical method based on ANN and HMM classifiers. After pre-emphasis and segmentation, the audio segments are analysed by the cepstral acoustic analysis method. The developed system was evaluated on a database constituted of music songs with Arabic speech segments under several noisy environments.

引用

页码：2 / 10

页数：9

共 50 条

[1] Speech/music discrimination for multimedia applications
El-Maleh, K
Klein, M
Petrucci, G
Kabal, P
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2445 - 2448
[2] Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination
Tsipas, Nikolaos
Vrysis, Lazaros
Dimoulas, Charalampos
Papanikolaou, George
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 25603 - 25621
[3] Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination
Nikolaos Tsipas
Lazaros Vrysis
Charalampos Dimoulas
George Papanikolaou
[J]. Multimedia Tools and Applications, 2017, 76 : 25603 - 25621
[4] Semantic indexing of multimedia documents
Leonardi, R
Migliorati, P
[J]. IEEE MULTIMEDIA, 2002, 9 (02) : 44 - 51
[5] INFANT SPEECH-SOUND DISCRIMINATION IN NOISE
NOZZA, RJ
ROSSMAN, RNF
BOND, LC
MILLER, SL
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (01): : 339 - 350
[6] Speech/Music discrimination using spectral peak feature for speaker indexing
Keum, Ji-Soo
Lee, Hyon-Soo
[J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 299 - 302
[7] Innovative Automatic Discrimination Multimedia Documents for Indexing using Hybrid GMM-SVM Method
Turkia, Debabi
Souha, Bousselmi
Adnen, Cherif
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 274 - 279
[8] Contribution of NLP to the content indexing of multimedia documents
Declerck, T
Kuper, J
Saggion, H
Samiotou, A
Wittenburg, P
Contreras, J
[J]. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 610 - 618
[9] SOUND RESTORATION AND TEMPORAL LOCALIZATION OF NOISE IN SPEECH AND MUSIC SOUNDS
SASAKI, T
[J]. TOHOKU PSYCHOLOGICA FOLIA, 1980, 39 (1-4): : 79 - 88
[10] Sound documents of electronic music
Vidolin, Alvise
[J]. MUSICA TECNOLOGIA, 2008, 2 : 49 - +

← 1 2 3 4 5 →