Speech/music discrimination for multimedia applications

被引:0
|
作者
El-Maleh, K [1 ]
Klein, M [1 ]
Petrucci, G [1 ]
Kabal, P [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
dAutomatic discrimination of speech and music is an important tool in many multimedia applications. Previous work has focused on using long-term features such as differential parameters, variances, and time-averages of spectral parameters. These classifiers use features estimated over windows of 0.5-5 seconds, and are relatively complex. In this paper, we present our results of combining the line spectral frequencies (LSFs) and zero-crossing-based features for frame-level narrowband speech/music discrimination. Our classification results for different types of music and speech show the good discriminating power of these features. Our classification algorithms operate using only a frame delay of 20 ms, making them suitable for real-time multimedia applications.
引用
收藏
页码:2445 / 2448
页数:4
相关论文
共 50 条
  • [1] Speech-Music-Noise Discrimination in Sound Indexing of Multimedia Documents
    Bouafif, Lamia
    Ellouze, Noureddine
    [J]. SOUND AND VIBRATION, 2018, 52 (06): : 2 - 10
  • [2] MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION
    Sell, Gregory
    Clark, Pascal
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Speech/music discrimination for robust speech recognition in robots
    Choi, Mu Yeol
    Song, Hwa Jeon
    Kim, Hyung Soon
    [J]. 2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 118 - +
  • [4] A New Feature for Speech/Music Discrimination
    Huang, Houjun
    Xu, Yunfei
    Zhou, Ruohua
    [J]. INTERNATIONAL ACADEMIC CONFERENCE ON THE INFORMATION SCIENCE AND COMMUNICATION ENGINEERING (ISCE 2014), 2014, : 133 - 137
  • [5] A multifeature speech/music discrimination system
    Saad, EM
    El-Adawy, MI
    Abu-El-Wafa, ME
    Wahba, AA
    [J]. IEEE CCEC 2002: CANADIAN CONFERENCE ON ELECTRCIAL AND COMPUTER ENGINEERING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2002, : 1055 - 1058
  • [6] Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination
    Tsipas, Nikolaos
    Vrysis, Lazaros
    Dimoulas, Charalampos
    Papanikolaou, George
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 25603 - 25621
  • [7] Feature extraction for speech and music discrimination
    Hou, Huiyu
    Sadka, Abdul
    Jiang, Richard M.
    [J]. 2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 154 - 157
  • [8] Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination
    Nikolaos Tsipas
    Lazaros Vrysis
    Charalampos Dimoulas
    George Papanikolaou
    [J]. Multimedia Tools and Applications, 2017, 76 : 25603 - 25621
  • [9] A fast and robust speech/music discrimination approach
    Wang, WQ
    Gao, W
    Ying, DW
    [J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1325 - 1329
  • [10] A comparison of features for speech, music discrimination.
    Carey, MJ
    Parris, ES
    Lloyd-Thomas, H
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 149 - 152