On the Impact of Non-speech Sounds on Speaker Recognition

被引:0
|
作者
Janicki, Artur [1 ]
机构
[1] Warsaw Univ Technol, Inst Telecommun, PL-00665 Warsaw, Poland
来源
关键词
speaker recognition; GMM-UBM; non-speech sounds; TIMIT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the impact of non-speech sounds on the performance of speaker recognition. Various experiments were conducted to check what the accuracy of speaker classification would be if non-speech sounds, such as breaths, were removed from the training and/or testing speech. Experiments were run using the GMM-UBM algorithm and speech taken from the TIMIT speech corpus, either original or transcoded using the G.711 or GSM 06.10 codecs. The results show a remarkable contribution of non-speech sounds to the overall speaker recognition performance.
引用
收藏
页码:566 / 572
页数:7
相关论文
共 50 条
  • [41] Classification of Non-Speech Human Sounds: Feature Selection and Snoring Sound Analysis
    Liao, Wen-Hung
    Lin, Yu-Kai
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2695 - 2700
  • [42] CATEGORICAL PERCEPTION OF NON-SPEECH SOUNDS BY 2-MONTH-OLD INFANTS
    JUSCZYK, PW
    ROSNER, BS
    CUTTING, JE
    FOARD, CF
    SMITH, LB
    PERCEPTION & PSYCHOPHYSICS, 1977, 21 (01): : 50 - 54
  • [43] Priming of non-speech vocalizations in male adults: The influence of the speaker's gender
    Fecteau, S
    Armony, JL
    Joanette, Y
    Belin, P
    BRAIN AND COGNITION, 2004, 55 (02) : 300 - 302
  • [44] Lombard Speech Impact on Perceptual Speaker Recognition
    Ikeno, Ayako
    Hansen, John H. L.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2025 - 2028
  • [45] Using speech/non-speech detection to bias recognition search on noisy data
    Beaufays, F
    Boies, D
    Weintraub, M
    Zhu, QF
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 424 - 427
  • [46] Auditory evoked potentials using speech and non-speech sounds in a dichotic-listening paradigm.
    Hen-Tov, JK
    Cottone, JG
    Harkavy, LA
    Squires, NK
    JOURNAL OF COGNITIVE NEUROSCIENCE, 1999, : 101 - 101
  • [47] Perceiving identical sounds as speech or non-speech modulates activity in the left posterior superior temporal sulcus
    Möttönen, R
    Calvert, GA
    Jääskeläinen, IP
    Matthews, PM
    Thesen, T
    Tuomainen, J
    Sams, M
    NEUROIMAGE, 2006, 30 (02) : 563 - 569
  • [48] Hemispheric asymmetry of the auditory short-term habituation:: speech vs. non-speech sounds
    Sörös, P
    Knecht, S
    Teismann, I
    Manemann, E
    Imai, T
    Lütkenhöner, B
    Pantev, C
    NEUROIMAGE, 2001, 13 (06) : S940 - S940
  • [49] VOWELS, CONSONANTS, SPEECH, AND NON-SPEECH
    ADES, AE
    PSYCHOLOGICAL REVIEW, 1977, 84 (06) : 524 - 530
  • [50] Robust speech and non-speech detection
    Tian, Y
    Wang, ZY
    Lu, DJ
    CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (01): : 79 - 82