Speech-Music-Noise Discrimination in Sound Indexing of Multimedia Documents

被引:0
|
作者
Bouafif, Lamia [1 ]
Ellouze, Noureddine [2 ]
机构
[1] Natl Inst Biomed Studies Tunis, Tunis 1092, Tunisia
[2] Univ Tunis El Manar, Image & Signal Proc Lab, ENIT BP 37, Tunis 1064, Tunisia
来源
SOUND AND VIBRATION | 2018年 / 52卷 / 06期
关键词
Speech processing; audio indexing; training and recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sound indexing and segmentation of digital documents especially in the internet and digital libraries are very useful to simplify and to accelerate the multimedia document retrieval. We can imagine that we can extract multimedia files not only by keywords but also by speech semantic contents. The main difficulty of this operation is the parameterization and modelling of the sound track and the discrimination of the speech, music and noise segments. In this paper, we will present a Speech/Music/Noise indexing interface designed for audio discrimination in multimedia documents. The program uses a statistical method based on ANN and HMM classifiers. After pre-emphasis and segmentation, the audio segments are analysed by the cepstral acoustic analysis method. The developed system was evaluated on a database constituted of music songs with Arabic speech segments under several noisy environments.
引用
收藏
页码:2 / 10
页数:9
相关论文
共 50 条
  • [1] Speech/music discrimination for multimedia applications
    El-Maleh, K
    Klein, M
    Petrucci, G
    Kabal, P
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2445 - 2448
  • [2] Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination
    Tsipas, Nikolaos
    Vrysis, Lazaros
    Dimoulas, Charalampos
    Papanikolaou, George
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 25603 - 25621
  • [3] Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination
    Nikolaos Tsipas
    Lazaros Vrysis
    Charalampos Dimoulas
    George Papanikolaou
    [J]. Multimedia Tools and Applications, 2017, 76 : 25603 - 25621
  • [4] Semantic indexing of multimedia documents
    Leonardi, R
    Migliorati, P
    [J]. IEEE MULTIMEDIA, 2002, 9 (02) : 44 - 51
  • [5] INFANT SPEECH-SOUND DISCRIMINATION IN NOISE
    NOZZA, RJ
    ROSSMAN, RNF
    BOND, LC
    MILLER, SL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (01): : 339 - 350
  • [6] Speech/Music discrimination using spectral peak feature for speaker indexing
    Keum, Ji-Soo
    Lee, Hyon-Soo
    [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 299 - 302
  • [7] Innovative Automatic Discrimination Multimedia Documents for Indexing using Hybrid GMM-SVM Method
    Turkia, Debabi
    Souha, Bousselmi
    Adnen, Cherif
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 274 - 279
  • [8] Contribution of NLP to the content indexing of multimedia documents
    Declerck, T
    Kuper, J
    Saggion, H
    Samiotou, A
    Wittenburg, P
    Contreras, J
    [J]. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 610 - 618
  • [9] SOUND RESTORATION AND TEMPORAL LOCALIZATION OF NOISE IN SPEECH AND MUSIC SOUNDS
    SASAKI, T
    [J]. TOHOKU PSYCHOLOGICA FOLIA, 1980, 39 (1-4): : 79 - 88
  • [10] Sound documents of electronic music
    Vidolin, Alvise
    [J]. MUSICA TECNOLOGIA, 2008, 2 : 49 - +