SUBJECTIVE RATINGS OF INSTANTANEOUS AND GRADUAL TRANSITIONS FROM NARROWBAND TO WIDEBAND ACTIVE SPEECH

被引:4
|
作者
Voran, Stephen D. [1 ]
机构
[1] Natl Telecommun & Informat Adm, Inst Telecommun Sci, Boulder, CO 80303 USA
关键词
Narrowband speech; speech coding; subjective testing; wideband speech;
D O I
10.1109/ICASSP.2010.5495187
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In advanced heterogeneous telecommunication networks, network resources can dynamically dictate the type of speech coding that is used. An increase in resources allows for lower coding distortion or it might also be used to provide wideband speech instead of narrowband speech. Existing studies have demonstrated that wideband speech is preferred to narrowband speech, but they have also demonstrated that an abrupt transition from narrowband to wideband is perceived as an impairment, even though it is a transition to a higher quality signal. We describe our recent work that resulted in subjective scores for abrupt and gradual transitions from narrowband to wideband at the midpoint of a six-second segment of active speech. On average, signals that start narrowband and end wideband are rated slightly lower than constant narrowband signals and results are nearly the same for abrupt and gradual (2.5 second) transitions. Scores from 20 listeners show a wide range of individual opinions so we conclude that studies of bandwidth transitions may be quite sensitive to the listener population sample.
引用
收藏
页码:4674 / 4677
页数:4
相关论文
共 4 条
  • [1] Generation of wideband speech from narrowband speech
    NTT Human Interface Lab
    NTT R&D, 10 (1027-1032):
  • [2] Statistical Recovery of Wideband Speech from Narrowband Speech
    Cheng, Yan Ming
    O'Shaughnessy, Douglas
    Mermelstein, Paul
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 544 - 548
  • [3] Predicting Multidimensional Subjective Ratings of Children' Readings from the Speech Signals for the Automatic Assessment of Fluency
    Bailly, Gerard
    Godde, Erika
    Piat-Marchand, Anne-Laure
    Bosse, Marie-Line
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 317 - 322
  • [4] An investigation on the degradation of different features extracted from the compressed American English speech using narrowband and wideband codecs
    Sankar, M. S. Arun
    Sathidevi, P. S.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 861 - 876