Sound analysis using MPEG compressed audio

被引:0
|
作者
Tzanetakis, G [1 ]
Cook, P [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There is a huge amount of audio data available that is compressed using the MPEG audio compression standard. Sound analysis is based on the computation of short time feature vectors that describe the instantaneous spectral content of the sound. An interesting possibility is the calculation of features directly from compressed data. Since the bulk of the feature calculation is performed during the encoding stage this process has a significant performance advantage if the available data is compressed. Combining decoding and analysis in one stage is also very important for audio streaming applications. In this paper, we describe the calculation of features directly from MPEG audio compressed data. Two of the basic processes of analyzing sound are: segmentation and classification. To illustrate the effectiveness of the calculated features we have implemented two case studies: a general audio segmentation algorithm and a Music/Speech classifier. Experimental data is provided to show that the results obtained are comparable with sound analysis algorithms working directly with audio samples.
引用
收藏
页码:761 / 764
页数:4
相关论文
共 50 条
  • [1] Data hiding in MPEG compressed audio using wet paper codes
    Quan, Xiaomei
    Zhang, Hongbin
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 727 - +
  • [2] MPEG Standards for Compressed Representation of Immersive Audio
    Quackenbush, Schuyler R.
    Herre, Juergen
    [J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1578 - 1589
  • [3] Sound enhancement technology for compressed audio
    Satoh, Yasushi
    [J]. Journal of the Institute of Electronics, Information and Communication Engineers, 2013, 96 (11): : 888 - 893
  • [4] Quantifying perceptual distortion in scalably compressed MPEG audio
    Creusere, CD
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 265 - 269
  • [5] Audio enhancement in compressed domain based on MPEG-AAC codec
    [J]. Deng, Feng, 1600, Chinese Institute of Electronics (42):
  • [6] Data embedding in MPEG-1/audio layer II compressed domain using side information
    Matsuoka, Akihiro
    Tanaka, Kiyoshi
    Yoneyama, Akio
    Nakajima, Yasuyuki
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1585 - +
  • [7] Efficient MPEG Compressed Video Analysis Using Macroblock Type Information
    Pei, Soo-Chang
    Chou, Yu-Zuong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 1999, 1 (04) : 321 - 333
  • [8] Performance of MPEG-7 low level audio descriptors with compressed data
    Lukasiak, J
    Stirling, D
    Harders, N
    Perrow, S
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 273 - 276
  • [9] Audio contents adaptation using user's preference on sound fields in MPEG-21 DIA
    Seo, J
    Kang, K
    Hong, JK
    [J]. 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS: BROADBAND CONVERGENCE NETWORK INFRASTRUCTURE, 2004, : 1081 - 1086
  • [10] A new paradigm for analysis of MPEG compressed videos
    Farag, WE
    Abdel-Wahab, H
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2002, 25 (02) : 109 - 127