On the Impact of Non-speech Sounds on Speaker Recognition

被引：0

作者：

Janicki, Artur ^{[1
]}

机构：

[1] Warsaw Univ Technol, Inst Telecommun, PL-00665 Warsaw, Poland

来源：

TEXT, SPEECH AND DIALOGUE, TSD 2012 | 2012年 / 7499卷

关键词：

speaker recognition; GMM-UBM; non-speech sounds; TIMIT;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates the impact of non-speech sounds on the performance of speaker recognition. Various experiments were conducted to check what the accuracy of speaker classification would be if non-speech sounds, such as breaths, were removed from the training and/or testing speech. Experiments were run using the GMM-UBM algorithm and speech taken from the TIMIT speech corpus, either original or transcoded using the G.711 or GSM 06.10 codecs. The results show a remarkable contribution of non-speech sounds to the overall speaker recognition performance.

引用

页码：566 / 572

页数：7

共 50 条

[21] Auditory hallucinations and the mismatch negativity: Processing speech and non-speech sounds in schizophrenia
Fisher, Derek J.
Labelle, Alain
Knott, Verner J.
INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2008, 70 (01) : 3 - 15
[22] Sensitivity of the human auditory cortex to acoustic degradation of speech and non-speech sounds
Miettinen, Ismo
Tiitinen, Hannu
Alku, Paavo
May, Patrick J. C.
BMC NEUROSCIENCE, 2010, 11
[23] Listening to speech and non-speech sounds activates phonological and semantic knowledge differently
Bartolotti, James
Schroeder, Scott R.
Hayakawa, Sayuri
Rochanavibhata, Sirada
Chen, Peiyao
Marian, Viorica
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2020, 73 (08): : 1135 - 1149
[24] Speech/Non-Speech Segmentation Based on Phoneme Recognition Features
Janez Žibert
Nikola Pavešić
France Mihelič
EURASIP Journal on Advances in Signal Processing, 2006
[25] Speech/non-speech segmentation based on phoneme recognition features
Zibert, Janez
Pavesic, Nikola
Mihelic, France
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
[26] Central processing of speech sounds and non-speech sounds with similar spectral distribution: An auditory evoked potential study
Kaneshiro, Shinsuke
Hiraumi, Harukazu
Sato, Hiroaki
AURIS NASUS LARYNX, 2020, 47 (05) : 727 - 733
[27] The processing of speech and non-speech sounds in aphasic patients as reflected by the mismatch negativity (MMN)
Ilvonen, T
Kujala, T
Kozou, H
Kiesiläinen, A
Salonen, O
Alku, P
Näätänen, R
NEUROSCIENCE LETTERS, 2004, 366 (03) : 235 - 240
[28] Perception of speech and non-speech sounds by listeners with real and simulated sensorineural hearing loss
Lum, DS
Braida, LD
JOURNAL OF PHONETICS, 2000, 28 (03) : 343 - 366
[29] Plastic cortical changes induced by learning to communicate with non-speech sounds
Kujala, A
Huotilainen, M
Uther, M
Shtyrov, Y
Monto, S
Ilmoniemi, RJ
Näätänen, R
NEUROREPORT, 2003, 14 (13) : 1683 - 1687
[30] Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms
Maganti, Hari Krishna
Motlicek, Petr
Gatica-Perez, Daniel
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1037 - +

← 1 2 3 4 5 →