On the Impact of Non-speech Sounds on Speaker Recognition

被引:0
|
作者
Janicki, Artur [1 ]
机构
[1] Warsaw Univ Technol, Inst Telecommun, PL-00665 Warsaw, Poland
来源
关键词
speaker recognition; GMM-UBM; non-speech sounds; TIMIT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the impact of non-speech sounds on the performance of speaker recognition. Various experiments were conducted to check what the accuracy of speaker classification would be if non-speech sounds, such as breaths, were removed from the training and/or testing speech. Experiments were run using the GMM-UBM algorithm and speech taken from the TIMIT speech corpus, either original or transcoded using the G.711 or GSM 06.10 codecs. The results show a remarkable contribution of non-speech sounds to the overall speaker recognition performance.
引用
收藏
页码:566 / 572
页数:7
相关论文
共 50 条
  • [21] Auditory hallucinations and the mismatch negativity: Processing speech and non-speech sounds in schizophrenia
    Fisher, Derek J.
    Labelle, Alain
    Knott, Verner J.
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2008, 70 (01) : 3 - 15
  • [22] Sensitivity of the human auditory cortex to acoustic degradation of speech and non-speech sounds
    Miettinen, Ismo
    Tiitinen, Hannu
    Alku, Paavo
    May, Patrick J. C.
    BMC NEUROSCIENCE, 2010, 11
  • [23] Listening to speech and non-speech sounds activates phonological and semantic knowledge differently
    Bartolotti, James
    Schroeder, Scott R.
    Hayakawa, Sayuri
    Rochanavibhata, Sirada
    Chen, Peiyao
    Marian, Viorica
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2020, 73 (08): : 1135 - 1149
  • [24] Speech/Non-Speech Segmentation Based on Phoneme Recognition Features
    Janez Žibert
    Nikola Pavešić
    France Mihelič
    EURASIP Journal on Advances in Signal Processing, 2006
  • [25] Speech/non-speech segmentation based on phoneme recognition features
    Zibert, Janez
    Pavesic, Nikola
    Mihelic, France
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [26] Central processing of speech sounds and non-speech sounds with similar spectral distribution: An auditory evoked potential study
    Kaneshiro, Shinsuke
    Hiraumi, Harukazu
    Sato, Hiroaki
    AURIS NASUS LARYNX, 2020, 47 (05) : 727 - 733
  • [27] The processing of speech and non-speech sounds in aphasic patients as reflected by the mismatch negativity (MMN)
    Ilvonen, T
    Kujala, T
    Kozou, H
    Kiesiläinen, A
    Salonen, O
    Alku, P
    Näätänen, R
    NEUROSCIENCE LETTERS, 2004, 366 (03) : 235 - 240
  • [28] Perception of speech and non-speech sounds by listeners with real and simulated sensorineural hearing loss
    Lum, DS
    Braida, LD
    JOURNAL OF PHONETICS, 2000, 28 (03) : 343 - 366
  • [29] Plastic cortical changes induced by learning to communicate with non-speech sounds
    Kujala, A
    Huotilainen, M
    Uther, M
    Shtyrov, Y
    Monto, S
    Ilmoniemi, RJ
    Näätänen, R
    NEUROREPORT, 2003, 14 (13) : 1683 - 1687
  • [30] Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms
    Maganti, Hari Krishna
    Motlicek, Petr
    Gatica-Perez, Daniel
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1037 - +