How to Talk about Speech and Audio Quality with Speech and Audio People

被引:0
|
作者
Raake, Alexander [1 ]
Waeltermann, Marcel [1 ]
Wuestenhagen, Ulf [2 ]
Feiten, Bernhard [2 ]
机构
[1] TU Berlin, Telekom Innovat Labs T Labs, Berlin, Germany
[2] Telekom Innovat Labs T Labs, Deutsch Telekom, Germany
来源
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present the results of two speech and two audio quality listening tests to compare the two test methods ACR (Absolute Category Rating) and MUSHRA (MUlti Stimulus with Hidden Reference and Anchors) for different content types. These comparisons have three primary goals: (i) analyze the resolution of the collected ratings, (ii) find relations for transforming test results obtained with one method onto the scale of the other, and (iii) refine audio quality results obtained using the ACR method by using MUSHRA-results for the upper quality regime, where MUSHRA typically shows a sightly better resolution than ACR. The aim is to contribute to the harmonization of speech and audio assessment methods considered meaningful in the light of the convergence between speech and audio coding and transmission.
引用
收藏
页码:147 / 155
页数:9
相关论文
共 50 条
  • [1] Understanding the quality losses of embedded speech and audio coders
    Ramprashad, SA
    [J]. 2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 11 - 13
  • [2] Audio Mixer Algorithm for Enhancing Speech Quality of Multi-party Audio Telephony
    Ryu, Sang-Hyeon
    Kim, Hyoung-Gook
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (06): : 541 - 547
  • [3] Combining Audio and Brain Activity for Predicting Speech Quality
    Parmonangan, Ivan Halim
    Tanaka, Hiroki
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. INTERSPEECH 2020, 2020, : 2762 - 2766
  • [4] Objective assessment of speech and audio quality - Technology and applications
    Rix, Antony W.
    Beerends, John G.
    Kim, Doh-Suk
    Kroon, Peter
    Ghitza, Oded
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 1890 - 1901
  • [5] Speech in Smartwatch based Audio
    Liaqat, Daniyal
    Wu, Robert
    Gershon, Andrea
    Alshaer, Hisham
    Rudzicz, Frank
    de lara, Eyal
    [J]. MOBISYS'18: PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS, AND SERVICES, 2018, : 523 - 523
  • [6] Reviewing Speech Input with Audio
    Hong, Jonggi
    Vaing, Christine
    Kacorri, Hernisa
    Findlater, Leah
    [J]. ACM Transactions on Accessible Computing, 2020, 13 (01):
  • [7] Speech processing for audio indexing
    Lamel, Lori
    Gauvain, Jean-Luc
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 4 - 15
  • [8] Applied speech and audio processing
    Puder, H
    Schmidt, G
    [J]. SIGNAL PROCESSING, 2006, 86 (06) : 1121 - 1123
  • [9] On Exploring Audio Anomaly in Speech
    Roxo, Tiago
    Costa, Joana Cabral
    Inacio, Pedro R. M.
    Proenca, Hugo
    [J]. 2023 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY, WIFS, 2023,
  • [10] SPEECH DETECTION ON BROADCAST AUDIO
    Zubari, Unal
    Ozan, Ezgi Can
    Acar, Banu Oskay
    Ciloglu, Tolga
    Esen, Ersin
    Ates, Tugrul K.
    Onur, Duygu Oskay
    [J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 85 - 89