Rhythm detection for speech-music discrimination in MPEG compressed domain

被引:3
|
作者
Jarina, R [1 ]
O'Connor, N [1 ]
Marlow, S [1 ]
Murphy, N [1 ]
机构
[1] Dublin City Univ, Ctr Digital Video Proc, Dublin 9, Ireland
关键词
D O I
10.1109/ICDSP.2002.1027851
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A novel approach to speech-music discrimination based on rhythm (or beat) detection is introduced. Rhythmic pulses are detected by applying a long-term autocorrelation method on band-passed signals. This approach is combined with another, in which the features describe the energy peaks of the signal. The discriminator uses just three features that are computed from data directly taken from an MPEG-I bitstream. The discriminator was tested on more than 3 hours of audio data. Average recognition rate is 97.7%.
引用
收藏
页码:129 / 132
页数:4
相关论文
共 50 条
  • [1] SPEECH-MUSIC DISCRIMINATION: A DEEP LEARNING PERSPECTIVE
    Pikrakis, Aggelos
    Theodoridis, Sergios
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 616 - 620
  • [2] Speech-music discrimination using deep visual feature extractors
    Papakostas, Michalis
    Giannakopoulos, Theodoros
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 334 - 344
  • [3] Improvement to speech-music discrimination using sinusoidal model based features
    Jalil Shirazi
    Shahrokh Ghaemmaghami
    Multimedia Tools and Applications, 2010, 50 : 415 - 435
  • [4] Improvement to speech-music discrimination using sinusoidal model based features
    Shirazi, Jalil
    Ghaemmaghami, Shahrokh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 50 (02) : 415 - 435
  • [5] Speech-Music Segmentation System for Speech Recognition
    Demir, Cemil
    Dogan, Mehmet Ugur
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 846 - 849
  • [6] An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder
    Yang, Wanzhao
    Tu, Weiping
    Zheng, Jiaxi
    Zhang, Xiong
    Yang, Yuhong
    Song, Yucheng
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 81 - 92
  • [7] From close listening to distant listening: Developing tools for Speech-Music discrimination of Danish music radio
    Have, Iben
    Enevoldsen, Kenneth
    DIGITAL HUMANITIES QUARTERLY, 2021, 15 (01):
  • [8] Wipe transition detection method in MPEG compressed domain
    Liu, Yang
    Wu, Zhi-Mei
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2002, 30 (05): : 741 - 744
  • [9] Fast object detection and segmentation in MPEG compressed domain
    Sukmarg, O
    Rao, KR
    IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : B364 - B368
  • [10] Event detection from MPEG video in the compressed domain
    Yoon, K
    DeMenthon, D
    Doermann, D
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 819 - 822