Investigating the Visual Lombard Effect with Gabor Based Features

被引:2
|
作者
Chiu, Waito [1 ]
Xu, Yan [1 ]
Abel, Andrew [1 ]
Lin, Chun [2 ]
Tu, Zhengzheng [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Dept Comp Sci & Software Engn, Suzhou 215123, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
来源
关键词
Lombard Effect; Gabor Features; Lip Features; SPEECH PRODUCTION; INTELLIGIBILITY; BABBLE; NOISE;
D O I
10.21437/Interspeech.2020-1291
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The Lombard Effect shows that speakers increase their vocal effort in the presence of noise, and research into acoustic speech, has demonstrated varying effects, depending on the noise level and speaker, with several differences, including timing and vocal effort. Research also identified several differences, including between gender, and noise type. However, most research has focused on the audio domain, with very limited focus on the visual effect. This paper presents a detailed study of the visual Lombard Effect, using a pilot Lombard Speech corpus developed for our needs, and a recently developed Gabor based lip feature extraction approach. Using Kernel Density Estimation, we identify clear differences between genders, and also show that speakers handle different noise types differently.
引用
收藏
页码:4606 / 4610
页数:5
相关论文
共 50 条
  • [1] Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
    Ma, Pingchuan
    Petridis, Stavros
    Pantic, Maja
    INTERSPEECH 2019, 2019, : 4090 - 4094
  • [2] VISUAL OBJECT TRACKING VIA GABOR-BASED SALIENT FEATURES EXTRACTION
    Zoidi, O.
    Tefas, A.
    Pitas, I.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1925 - 1929
  • [3] Analysis of the visual Lombard effect and automatic recognition experiments
    Heracleous, Panikos
    Ishi, Carlos T.
    Sato, Miki
    Ishiguro, Hiroshi
    Hagita, Norihiro
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 288 - 300
  • [4] Robust Formant Features for Speaker Verification in the Lombard Effect
    Kwak, Ileun
    Kang, Hong-Goo
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 114 - 118
  • [5] TEXTURE FEATURES BASED ON GABOR PHASE
    DUBUF, JMH
    HEITKAMPER, P
    SIGNAL PROCESSING, 1991, 23 (03) : 227 - 244
  • [6] Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
    Korvel, Grazina
    Kakol, Krzysztof
    Treigys, Povilas
    Kostek, Bozena
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022), 2022, 13515 : 399 - 407
  • [7] Visual Speech Recognition with Lightweight Psychologically Motivated Gabor Features
    Zhang, Xuejie
    Xu, Yan
    Abel, Andrew K.
    Smith, Leslie S.
    Watt, Roger
    Hussain, Amir
    Gao, Chengxiang
    ENTROPY, 2020, 22 (12) : 1 - 24
  • [8] Generic visual categorization using composite Gabor and moment features
    Arivazhagan, S.
    Priyadharshini, R. Ahila
    OPTIK, 2015, 126 (21): : 2912 - 2916
  • [9] DENSE-GWP: AN IMPROVED PRIMARY VISUAL ENCODING MODEL BASED ON DENSE GABOR FEATURES
    Cui, Yibo
    Zhang, Chi
    Wang, Linyuan
    Yan, Bin
    Tong, Li
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2021, 21 (05)
  • [10] The impact of the Lombard effect on audio and visual speech recognition systems
    Marxer, Ricard
    Barker, Jon
    Alghamdi, Najwa
    Maddock, Steve
    SPEECH COMMUNICATION, 2018, 100 : 58 - 68