The Effect of Filled Pauses and Speaking Rate on Speech Comprehension in Natural, Vocoded and Synthetic Speech

被引:0
|
作者
Dall, Rasmus [1 ]
Wester, Mirjam [1 ]
Corley, Martin [2 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9YL, Midlothian, Scotland
[2] Univ Edinburgh, PPLS, Edinburgh EH8 9YL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
HMM-synthesis; speech synthesis; reaction time; filled pause; disfluency; speaking rate; speech perception; UH; UM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has been shown that in natural speech filled pauses can be beneficial to a listener. In this paper, we attempt to discover whether listeners react in a similar way to filled pauses in synthetic and vocoded speech compared to natural speech. We present two experiments focusing on reaction time to a target word. In the first, we replicate earlier work in natural speech, namely that listeners respond faster to a target word following a filled pause than following a silent pause. This is replicated in vocoded but not in synthetic speech. Our second experiment investigates the effect of speaking rate on reaction times as this was potentially a confounding factor in the first experiment. Evidence suggests that slower speech rates lead to slower reaction times in synthetic and in natural speech. Moreover, in synthetic speech the response to a target word after a filled pause is slower than after a silent pause. This finding, combined with an overall slower reaction time, demonstrates a shortfall in current synthesis techniques. Remedying this could help make synthesis less demanding and more pleasant for the listener, and reaction time experiments could thus provide a measure of improvement in synthesis techniques.
引用
收藏
页码:56 / 60
页数:5
相关论文
共 50 条
  • [1] L2 comprehension of filled pauses and fillers in unscripted speech
    Carney, Nathaniel
    [J]. SYSTEM, 2022, 105
  • [2] CALIBRATION AND THE COMPREHENSION OF SYNTHETIC AND NATURAL SPEECH
    ZEIGLER, BL
    BOGGS, GJ
    KAUFMAN, LS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S79 - S79
  • [3] Comprehension of synthetic speech and digitized natural speech by adults with aphasia
    Hux, Karen
    Knollman-Porter, Kelly
    Brown, Jessica
    Wallace, Sarah E.
    [J]. JOURNAL OF COMMUNICATION DISORDERS, 2017, 69 : 15 - 26
  • [4] Filled pauses in speech synthesis: Towards conversational speech
    Adell, Jordi
    Bonafonte, Antonio
    Escudero, David
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 358 - +
  • [5] SPEAKING RATE - EFFECTS ON CHILDRENS COMPREHENSION OF NORMAL SPEECH
    BERRY, MD
    ERICKSON, RL
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1973, 16 (03): : 367 - 374
  • [6] APHASIC SUBJECTS COMPREHENSION OF SYNTHETIC AND NATURAL SPEECH
    HUNTRESS, LM
    LEE, L
    CREAGHEAD, NA
    WHEELER, DD
    BRAVERMAN, KM
    [J]. JOURNAL OF SPEECH AND HEARING DISORDERS, 1990, 55 (01): : 21 - 27
  • [7] The Effect of Filled Pauses in a Lecture Speech on Impressive Evaluation of Listeners
    Nishizaki, Hiromitsu
    Sohmiya, Mitsuhiro
    Kobayashi, Kenji
    Sekiguchi, Yoshihiro
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 565 - +
  • [8] Presentation rate in comprehension of natural and synthesized speech
    Reynolds, ME
    Givens, J
    [J]. PERCEPTUAL AND MOTOR SKILLS, 2001, 92 (03) : 958 - 968
  • [9] Effect of Speaking Rate on Recognition of Synthetic and Natural Speech by Normal-Hearing and Cochlear Implant Listeners
    Ji, Caili
    Galvin, John J., III
    Xu, Anting
    Fu, Qian-Jie
    [J]. EAR AND HEARING, 2013, 34 (03): : 313 - 323
  • [10] Automatic identification of filled pauses in spontaneous speech
    O'Shaughnessy, D
    Gabrea, M
    [J]. 2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 620 - 624