Visual Hearing Aids: Artificial Visual Speech Stimuli for Audiovisual Speech Perception in Noise

被引:0
|
作者
Choudhary, Zubin Datta [1 ]
Bruder, Gerd [1 ]
Welch, Gregory F. [1 ]
机构
[1] Univ Cent Florida, Orlando, FL 32816 USA
基金
美国国家科学基金会;
关键词
Speech perception; background noise; hearing; speechreading; visualizations; virtual humans; user study; SOCIAL PRESENCE; VIRTUAL HUMANS; FACE;
D O I
10.1145/3611659.3615682
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speech perception is optimal in quiet environments, but noise can impair comprehension and increase errors. In these situations, lip reading can help, but it is not always possible, such as during an audio call or when wearing a face mask. One approach to improve speech perception in these situations is to use an artificial visual lip reading aid. In this paper, we present a user study (N = 17) in which we compared three levels of audio stimuli visualizations and two levels of modulating the appearance of the visualization based on the speech signal, and we compared them against two control conditions: an audio-only condition, and a real human speaking. We measured participants' speech reception thresholds (SRTs) to understand the effects of these visualizations on speech perception in noise. These thresholds indicate the decibel levels of the speech signal that are necessary for a listener to receive the speech correctly 50% of the time. Additionally, we measured the usability of the approaches and the user experience. We found that the different artificial visualizations improved participants' speech reception compared to the audio-only baseline condition, but they were significantly poorer than the real human condition. This suggests that different visualizations can improve speech perception when the speaker's face is not available. However, we also discuss limitations of current plug-and-play lip sync software and abstract representations of the speaker in the context of speech perception.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Visual Speech Recognition: Improving Speech Perception in Noise through Artificial Intelligence
    Raghavan, Arun M.
    Lipschitz, Noga
    Breen, Joseph T.
    Samy, Ravi N.
    Kohlberg, Gavriel D.
    [J]. OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2020, 163 (04) : 771 - 777
  • [2] Visual attention modulates audiovisual speech perception
    Tiippana, K
    Andersen, TS
    Sams, M
    [J]. EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2004, 16 (03): : 457 - 472
  • [3] COMPARATIVE ANALYSIS OF AUDIOVISUAL, AUDITIVE AND VISUAL PERCEPTION OF SPEECH
    EWERTSEN, HW
    NIELSEN, HB
    [J]. ACTA OTO-LARYNGOLOGICA, 1971, 72 (03) : 201 - &
  • [4] The contribution of dynamic visual cues to audiovisual speech perception
    Jaekl, Philip
    Pesquita, Ana
    Alsius, Agnes
    Munhall, Kevin
    Soto-Faraco, Salvador
    [J]. NEUROPSYCHOLOGIA, 2015, 75 : 402 - 410
  • [5] The role of visual spatial attention in audiovisual speech perception
    Andersen, Tobias S.
    Tiippana, Kaisa
    Laarni, Jari
    Kojo, Ilpo
    Sams, Mikko
    [J]. SPEECH COMMUNICATION, 2009, 51 (02) : 184 - 193
  • [6] Visual and Auditory Components in the Perception of Asynchronous Audiovisual Speech
    Garcia-Perez, Miguel A.
    Alcala-Quintana, Rocio
    [J]. I-PERCEPTION, 2015, 6 (06): : 1 - 20
  • [7] An audiovisual test of kinematic primitives for visual speech perception
    Rosenblum, LD
    Saldana, HM
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1996, 22 (02) : 318 - 331
  • [8] Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments
    Yuan, Yi
    Meyers, Kelli
    Borges, Kayla
    Lleo, Yasneli
    Fiorentino, Katarina A.
    Oh, Yonghee
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2021, 64 (07): : 2845 - 2853
  • [9] Auditory, visual and audiovisual perception of segmental speech features by severely hearing-impaired children
    Lamoré, PJJ
    Huiskamp, TMI
    van Son, NJDMM
    Bosman, AJ
    Smoorenburg, GF
    [J]. AUDIOLOGY, 1998, 37 (06): : 396 - 419
  • [10] PERCEPTION OF VISUAL TRANSFORMS OF SPEECH STIMULI - A PRELIMINARY EXPERIMENT
    HOUSE, AS
    HUGHES, GW
    GOLDSTEI.DP
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 39 (06): : 1257 - &