Quality Dimensions of Narrowband and Wideband Speech Transmission

被引:29
|
作者
Waeltermann, M. [1 ]
Raake, A. [1 ]
Moeller, S. [1 ]
机构
[1] Berlin Inst Technol, Deutsch Telekom Labs, Qual & Usabil Lab, Berlin, Germany
关键词
INDIVIDUAL-DIFFERENCES; IMPAIRMENT FACTOR; NOISE;
D O I
10.3813/AAA.918370
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The study presented in this paper aims at exploring the perceptual spaces evoked for users of two different telephone scenarios: traditional narrowband speech transmission, and mixed narrowband/wideband speech transmission that may be encountered in today's Voice-over-IP services. Underlying dimensions that constitute the skeleton of these spaces are revealed by auditory experiments, following two different paradigms of judgment: a) Similarity-scaling, and b) Attribute-scaling (Semantic Differential) with subsequent a) Multidimensional Scaling, and b) Principal Component Analysis of a diverse set of stimuli. Similar configurations are obtained which are unequivocally interpretable. Three common dimensions, valid for both the narrowband and the wideband scenario can be identified: "Discontinuity", "Noisiness", and "Coloration". In addition, the wideband space is extended by a further, wideband-specific dimension. Integral listening-quality can well be modeled by means of these dimensions. In both scenarios, "Discontinuity" represents the most important quality feature. The presented work forms the basis for instrumental diagnostic quality measures.
引用
收藏
页码:1090 / 1103
页数:14
相关论文
共 50 条
  • [31] On comparing speech quality of various narrow- and wideband speech codecs
    Rämö, N
    Toukomaa, H
    ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 603 - 606
  • [33] Wideband re-synthesis of narrowband CELP coded speech using multiband excitation model
    Chan, CF
    Hui, WK
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 322 - 325
  • [34] Narrowband-to-wideband expansion of telephony speech using piece wise deviation linear transformation
    Hu, H.T.
    Yu, C.
    International Journal of Electrical Engineering, 2010, 17 (01): : 7 - 17
  • [35] WIDEBAND SPEECH CODING WITH HYBRID DIGITAL-ANALOG TRANSMISSION
    Ruengeler, Matthias
    Kleifgen, Fabian
    Vary, Peter
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 784 - 788
  • [36] Direct Quantification of Latent Speech Quality Dimensions
    Waeltermann, Marcel
    Raake, Alexander
    Moeller, Sebastian
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2012, 60 (04): : 246 - 254
  • [37] Identifying Speech Quality Dimensions in a Telephone Conversation
    Koester, Friedemann
    Guse, Dennis
    Moeller, Sebastian
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2017, 103 (03) : 506 - 522
  • [38] Perceptual Speech Quality Dimensions in a Conversational Situation
    Koester, Friedemann
    Moeller, Sebastian
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2544 - 2548
  • [39] An Instrumental Measure for End-to-end Speech Transmission Quality Based on Perceptual Dimensions: Framework and Realization
    Waeltermann, Marcel
    Scholz, Kirstin
    Moeller, Sebastian
    Huo, Lu
    Raake, Alexander
    Heute, Ulrich
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 61 - +
  • [40] Predicting the quality of enhanced wideband speech with a cochlear model
    Wirtzfeld, Michael R.
    Pourmand, Nazanin
    Parsa, Vijay
    Bruce, Ian C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : EL319 - EL325