On the Impact of Non-speech Sounds on Speaker Recognition

被引:0
|
作者
Janicki, Artur [1 ]
机构
[1] Warsaw Univ Technol, Inst Telecommun, PL-00665 Warsaw, Poland
来源
关键词
speaker recognition; GMM-UBM; non-speech sounds; TIMIT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the impact of non-speech sounds on the performance of speaker recognition. Various experiments were conducted to check what the accuracy of speaker classification would be if non-speech sounds, such as breaths, were removed from the training and/or testing speech. Experiments were run using the GMM-UBM algorithm and speech taken from the TIMIT speech corpus, either original or transcoded using the G.711 or GSM 06.10 codecs. The results show a remarkable contribution of non-speech sounds to the overall speaker recognition performance.
引用
收藏
页码:566 / 572
页数:7
相关论文
共 50 条
  • [31] Effects of audio-visual integration on the detection of masked speech and non-speech sounds
    Eramudugolla, Ranmalee
    Henderson, Rachel
    Mattingley, Jason B.
    BRAIN AND COGNITION, 2011, 75 (01) : 60 - 66
  • [32] RECOGNITION OF NON-SPEECH SOUNDS USING MEL-FREQUENCY CEPSTRUM COEFFICIENTS AND DYNAMIC TIME WARPING METHOD
    Disken, Gokay
    Ibrikci, Turgay
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 144 - 147
  • [33] Language Model Based Non-speech Recognition Method
    Zhang, Qinglin
    Chen, Jianfeng
    Bai, Jisheng
    CONFERENCE PROCEEDINGS OF 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2019), 2019,
  • [34] DICHOTIC AND MONOTIC INTERACTIONS BETWEEN SPEECH AND NON-SPEECH SOUNDS AT DIFFERENT STIMULUS ONSET ASYNCHRONIES
    PORTER, RJ
    MIRABILE, PJ
    PERCEPTION & PSYCHOPHYSICS, 1977, 21 (05): : 408 - 412
  • [35] Of words and whistles: Statistical learning operates similarly for identical sounds perceived as speech and non-speech
    Sweet, Sierra J.
    Van Hedger, Stephen C.
    Batterink, Laura J.
    COGNITION, 2024, 242
  • [36] Discrimination of speech and non-speech sounds following theta-burst stimulation of the motor cortex
    Rogers, Jack C.
    Moettoenen, Riikka
    Boyles, Rowan
    Watkins, Kate E.
    FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [37] An event-related potential (ERP) study of duration changes in speech and non-speech sounds
    Jaramillo, M
    Alku, P
    Paavilainen, P
    NEUROREPORT, 1999, 10 (16) : 3301 - 3305
  • [38] Auditory spatial attention to speech and complex non-speech sounds in children with autism spectrum disorder
    Soskey, Laura N.
    Allen, Paul D.
    Bennetto, Loisa
    AUTISM RESEARCH, 2017, 10 (08) : 1405 - 1416
  • [39] An Algorithm for Detection of Breath Sounds in Spontaneous Speech with Application to Speaker Recognition
    Dumpala, Sri Harsha
    Alluri, K. N. R. K. Raju
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 98 - 108
  • [40] Discrimination and categorization of speech and non-speech sounds in an MEG delayed-match-to-sample study
    Luo, H
    Husain, FT
    Horwitz, B
    Poeppel, D
    NEUROIMAGE, 2005, 28 (01) : 59 - 71