Rhythm detection for speech-music discrimination in MPEG compressed domain

被引:3
|
作者
Jarina, R [1 ]
O'Connor, N [1 ]
Marlow, S [1 ]
Murphy, N [1 ]
机构
[1] Dublin City Univ, Ctr Digital Video Proc, Dublin 9, Ireland
关键词
D O I
10.1109/ICDSP.2002.1027851
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A novel approach to speech-music discrimination based on rhythm (or beat) detection is introduced. Rhythmic pulses are detected by applying a long-term autocorrelation method on band-passed signals. This approach is combined with another, in which the features describe the energy peaks of the signal. The discriminator uses just three features that are computed from data directly taken from an MPEG-I bitstream. The discriminator was tested on more than 3 hours of audio data. Average recognition rate is 97.7%.
引用
收藏
页码:129 / 132
页数:4
相关论文
共 50 条
  • [41] Fast Camera Motion Estimation in MPEG Compressed Domain
    Weng, Ying
    Jiang, Jianmin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (03) : 1329 - 1335
  • [42] Compressed-domain registration techniques for MPEG video
    Lee, MS
    Shen, M
    Kuo, CCJ
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2005, PTS 1 AND 2, 2005, 5685 : 1043 - 1052
  • [43] A scalable video scrambling method in MPEG compressed domain
    Takayama, Makoto
    Tanakat, Kiyoshi
    Takagi, Koichi
    Nakajima, Yasuyuki
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 1035 - +
  • [44] Compressed domain spatial scaling of MPEG video sequences
    Ghandi, MM
    Modirzadeh, ME
    Hashemi, MR
    Fatemi, O
    2002 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2002, : 138 - 139
  • [45] Compressed-domain video watermarking of MPEG streams
    Simitopoulos, D
    Tsaftaris, SA
    Boulgouris, NV
    Strintzis, MG
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 569 - 572
  • [46] Shot detection from MPEG compressed video
    Hwang, HC
    Kim, DG
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2004, E87A (06): : 1509 - 1513
  • [47] Target Detection from MPEG Video Based on Low-Rank Filtering in the Compressed Domain
    Viangteeravat, Teeradache
    Krootjohn, Soradech
    Wilkes, D. Mitchell
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XIX, 2010, 7697
  • [48] Defeating the blocking effect in the filter steered robust compressed domain object detection in MPEG videos
    Ahmad, AMA
    Lee, SY
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 1 - 6
  • [49] Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning
    Bhattacharjee, Mrinmoy
    Prasanna, S. R. M.
    Guha, Prithwijit
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1 - 10
  • [50] Replay boundary detection in MPEG compressed video
    Ouyang, JQ
    Li, JT
    Zhang, YD
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2800 - 2804