Sound analysis using MPEG compressed audio

被引：0

作者：

Tzanetakis, G ^{[1
]}

Cook, P ^{[1
]}

机构：

[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

There is a huge amount of audio data available that is compressed using the MPEG audio compression standard. Sound analysis is based on the computation of short time feature vectors that describe the instantaneous spectral content of the sound. An interesting possibility is the calculation of features directly from compressed data. Since the bulk of the feature calculation is performed during the encoding stage this process has a significant performance advantage if the available data is compressed. Combining decoding and analysis in one stage is also very important for audio streaming applications. In this paper, we describe the calculation of features directly from MPEG audio compressed data. Two of the basic processes of analyzing sound are: segmentation and classification. To illustrate the effectiveness of the calculated features we have implemented two case studies: a general audio segmentation algorithm and a Music/Speech classifier. Experimental data is provided to show that the results obtained are comparable with sound analysis algorithms working directly with audio samples.

引用

页码：761 / 764

页数：4

共 50 条

[1] Data hiding in MPEG compressed audio using wet paper codes
Quan, Xiaomei
Zhang, Hongbin
[J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 727 - +
[2] MPEG Standards for Compressed Representation of Immersive Audio
Quackenbush, Schuyler R.
Herre, Juergen
[J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1578 - 1589
[3] Sound enhancement technology for compressed audio
Satoh, Yasushi
[J]. Journal of the Institute of Electronics, Information and Communication Engineers, 2013, 96 (11): : 888 - 893
[4] Quantifying perceptual distortion in scalably compressed MPEG audio
Creusere, CD
[J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 265 - 269
[5] Audio enhancement in compressed domain based on MPEG-AAC codec
[J]. Deng, Feng, 1600, Chinese Institute of Electronics (42):
[6] Data embedding in MPEG-1/audio layer II compressed domain using side information
Matsuoka, Akihiro
Tanaka, Kiyoshi
Yoneyama, Akio
Nakajima, Yasuyuki
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1585 - +
[7] Efficient MPEG Compressed Video Analysis Using Macroblock Type Information
Pei, Soo-Chang
Chou, Yu-Zuong
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 1999, 1 (04) : 321 - 333
[8] Performance of MPEG-7 low level audio descriptors with compressed data
Lukasiak, J
Stirling, D
Harders, N
Perrow, S
[J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 273 - 276
[9] Audio contents adaptation using user's preference on sound fields in MPEG-21 DIA
Seo, J
Kang, K
Hong, JK
[J]. 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS: BROADBAND CONVERGENCE NETWORK INFRASTRUCTURE, 2004, : 1081 - 1086
[10] A new paradigm for analysis of MPEG compressed videos
Farag, WE
Abdel-Wahab, H
[J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2002, 25 (02) : 109 - 127

← 1 2 3 4 5 →