How to Talk about Speech and Audio Quality with Speech and Audio People

被引:0
|
作者
Raake, Alexander [1 ]
Waeltermann, Marcel [1 ]
Wuestenhagen, Ulf [2 ]
Feiten, Bernhard [2 ]
机构
[1] TU Berlin, Telekom Innovat Labs T Labs, Berlin, Germany
[2] Telekom Innovat Labs T Labs, Deutsch Telekom, Germany
来源
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present the results of two speech and two audio quality listening tests to compare the two test methods ACR (Absolute Category Rating) and MUSHRA (MUlti Stimulus with Hidden Reference and Anchors) for different content types. These comparisons have three primary goals: (i) analyze the resolution of the collected ratings, (ii) find relations for transforming test results obtained with one method onto the scale of the other, and (iii) refine audio quality results obtained using the ACR method by using MUSHRA-results for the upper quality regime, where MUSHRA typically shows a sightly better resolution than ACR. The aim is to contribute to the harmonization of speech and audio assessment methods considered meaningful in the light of the convergence between speech and audio coding and transmission.
引用
收藏
页码:147 / 155
页数:9
相关论文
共 50 条
  • [31] Speech and crosstalk detection in multichannel audio
    Wrigley, SN
    Brown, GJ
    Wan, V
    Renals, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01): : 84 - 91
  • [32] Speech and music classification in audio documents
    Pinquier, J
    Sénac, C
    André-Obrecht, R
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4164 - 4164
  • [33] Speech, audio, and acoustic processing for multimedia
    Juang, BH
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1997, 14 (04) : 34 - 36
  • [34] Speech/Laughter Classification in Meeting Audio
    Khine, Swe Zin Kalayar
    Nwe, Tin Lay
    Li, Haizhou
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 793 - 796
  • [35] APPLYING SPEECH ENHANCEMENT TO AUDIO SURVEILLANCE
    OSHAUGHNESSY, D
    KABAL, P
    BERNARDI, D
    BARBEAU, L
    CHU, CC
    MONCET, JL
    [J]. JOURNAL OF FORENSIC SCIENCES, 1990, 35 (05) : 1163 - 1172
  • [36] COMPARISON OF WINDOWING IN SPEECH AND AUDIO CODING
    Baeckstroem, Tom
    [J]. 2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [37] Fusing audio and visual features of speech
    Pan, H
    Liang, ZP
    Huang, TS
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 214 - 217
  • [38] Speech/audio coding technologies and their applications
    [J]. Kaneko, Takao, 2000, NTT, Tokyo, Japan (49):
  • [39] Joint speech recognition and audio captioning
    Carnegie Mellon University, United States
    不详
    [J]. arXiv, 1600,
  • [40] MPEG Unified Speech and Audio Coding
    Quackenbush, Schuyler
    [J]. IEEE MULTIMEDIA, 2013, 20 (02) : 72 - 78