Listener detection of talker stress in low-rate coded speech

被引:1
|
作者
Voran, Stephen [1 ]
机构
[1] Natl Telecommun & Informat Adm, Inst Telecommun Sci, Boulder, CO 80303 USA
关键词
speech coding; speech intelligibility; stress detection; subjective testing; talker stress;
D O I
10.1109/ICASSP.2008.4518734
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We describe an experiment where listeners were asked to detect two specific forms of stress in talkers' recorded voices heard via six different simulated communication systems. Both task-induced stress and dramatized urgency were used. Communication systems included low-rate digital speech coding combined with bit errors, packet loss, and packet loss concealment. Twenty-four listeners participated in a total of 11,520 detection trials. A parallel investigation of word intelligibility in sentence context used 576 trials. Intelligibility results showed wide variance due to communication system and stress detection results showed less variance. More specifically, we found that listener detection of dramatized talker urgency was 4.7 times more robust to communication system degradations than word intelligibility in sentence context.
引用
收藏
页码:4813 / 4816
页数:4
相关论文
共 50 条
  • [1] SPEAKER IDENTIFICATION IN LOW-RATE CODED SPEECH
    Catellier, Andrew
    Voran, Stephen
    [J]. MEASUREMENT OF SPEECH, AUDIO AND VIDEO QUALITY IN NETWORKS, 2008, : 27 - 36
  • [2] AN OBJECTIVE-MEASURE BASED ON AN AUDITORY MODEL FOR ASSESSING LOW-RATE CODED SPEECH
    WATANABE, T
    HAYASHI, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 751 - 757
  • [3] The Effect of Talker and Listener Depressive Symptoms on Speech Intelligibility
    Yi, Hoyoung
    Smiljanic, Rajka
    Chandrasekaran, Bharath
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (12): : 4269 - 4281
  • [4] The Listener Effect in Multitalker Speech Segregation and Talker Identification
    Lutfi, Robert A.
    Rodriguez, Briana
    Lee, Jungmee
    [J]. TRENDS IN HEARING, 2021, 25
  • [5] Vowel Onset Point Detection for Low Bit Rate Coded Speech
    Vuppala, Anil Kumar
    Yadav, Jainath
    Chakrabarti, Saswat
    Rao, K. Sreenivasa
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1894 - 1903
  • [6] Talker-to-listener distance effects on speech production and perception
    Cheyne, Harold A.
    Kalgaonkar, Kaustubh
    Clements, Mark
    Zurek, Patrick
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (04): : 2052 - 2060
  • [7] The consequences of linguistic perception on low-rate speech coding
    Parry, JJ
    Burnett, IS
    Chicharo, JF
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1383 - 1386
  • [8] Low-rate multimode multiband spectral coding of speech
    Das A.
    Gersho A.
    [J]. International Journal of Speech Technology, 1999, 2 (4) : 317 - 327
  • [9] A Low-Rate DoS Detection Based on Rate Anomalies
    Wu, Libing
    Cheng, Jing
    He, Yanxiang
    Xu, Ao
    Wen, Peng
    [J]. APPLIED INFORMATICS AND COMMUNICATION, PT III, 2011, 226 : 189 - +
  • [10] A Low-rate DoS Detection Based on Rate Anomalies
    Wu, Libing
    Cheng, Jing
    He, Yanxiang
    Xu, Ao
    Wen, Peng
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL III, 2010, : 89 - 92