How to Talk about Speech and Audio Quality with Speech and Audio People

被引：0

作者：

Raake, Alexander ^{[1
]}

Waeltermann, Marcel ^{[1
]}

Wuestenhagen, Ulf ^{[2
]}

Feiten, Bernhard ^{[2
]}

机构：

[1] TU Berlin, Telekom Innovat Labs T Labs, Berlin, Germany

[2] Telekom Innovat Labs T Labs, Deutsch Telekom, Germany

来源：

JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2012年 / 60卷 / 03期

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We present the results of two speech and two audio quality listening tests to compare the two test methods ACR (Absolute Category Rating) and MUSHRA (MUlti Stimulus with Hidden Reference and Anchors) for different content types. These comparisons have three primary goals: (i) analyze the resolution of the collected ratings, (ii) find relations for transforming test results obtained with one method onto the scale of the other, and (iii) refine audio quality results obtained using the ACR method by using MUSHRA-results for the upper quality regime, where MUSHRA typically shows a sightly better resolution than ACR. The aim is to contribute to the harmonization of speech and audio assessment methods considered meaningful in the light of the convergence between speech and audio coding and transmission.

引用

页码：147 / 155

页数：9

共 50 条

[31] Speech and crosstalk detection in multichannel audio
Wrigley, SN
Brown, GJ
Wan, V
Renals, S
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01): : 84 - 91
[32] Speech and music classification in audio documents
Pinquier, J
Sénac, C
André-Obrecht, R
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4164 - 4164
[33] Speech, audio, and acoustic processing for multimedia
Juang, BH
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 1997, 14 (04) : 34 - 36
[34] Speech/Laughter Classification in Meeting Audio
Khine, Swe Zin Kalayar
Nwe, Tin Lay
Li, Haizhou
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 793 - 796
[35] APPLYING SPEECH ENHANCEMENT TO AUDIO SURVEILLANCE
OSHAUGHNESSY, D
KABAL, P
BERNARDI, D
BARBEAU, L
CHU, CC
MONCET, JL
[J]. JOURNAL OF FORENSIC SCIENCES, 1990, 35 (05) : 1163 - 1172
[36] COMPARISON OF WINDOWING IN SPEECH AND AUDIO CODING
Baeckstroem, Tom
[J]. 2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
[37] Fusing audio and visual features of speech
Pan, H
Liang, ZP
Huang, TS
[J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 214 - 217
[38] Speech/audio coding technologies and their applications
[J]. Kaneko, Takao, 2000, NTT, Tokyo, Japan (49):
[39] Joint speech recognition and audio captioning
Carnegie Mellon University, United States
不详
[J]. arXiv, 1600,
[40] MPEG Unified Speech and Audio Coding
Quackenbush, Schuyler
[J]. IEEE MULTIMEDIA, 2013, 20 (02) : 72 - 78

← 1 2 3 4 5 →