Investigations on inter-speaker variability in the feature space

被引:8
|
作者
Haeb-Umbach, R [1 ]
机构
[1] Philips Res Labs, D-52066 Aachen, Germany
关键词
D O I
10.1109/ICASSP.1999.758146
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We apply Fisher variate analysis to measure the effectiveness of speaker normalization techniques. A trace criterion, which measures the ratio of the variations due to different phonemes compared to variations due to different speakers, serves as a first assessment of a feature set without the need for recognition experiments. By using this measure and by recognition experiments we demonstrate that cepstral mean normalization also has a speaker normalization effect, in addition to the well-known channel normalization effect. Similarly vocal tract normalization (VTN) is shown to remove inter-speaker variability. For VTN we show that normalization on a per sentence basis performs better than normalization on a per speaker basis. Recognition results are given on Wallstreet Journal and Hub-4 databases.
引用
收藏
页码:397 / 400
页数:4
相关论文
共 50 条
  • [1] Modeling inter-speaker variability in speech recognition
    Cloarec, Gwenael
    Jouvet, Denis
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4529 - 4532
  • [2] Eliminating inter-speaker variability prior to discriminant transforms
    Saon, G
    Padmanabhan, M
    Gopinath, R
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 73 - 76
  • [3] Biomechanical Tongue Models: An Approach to Studying Inter-speaker Variability
    Winkler, Ralf
    Fuchs, Susanne
    Perrier, Pascal
    Tiede, Mark
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 280 - +
  • [4] INTER-SPEAKER VARIABILITY IN FORENSIC VOICE COMPARISON: A PRELIMINARY EVALUATION
    Ajili, Moez
    Bonastre, Jean-Francois
    Rossato, Solange
    Kahn, Juliette
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2114 - 2118
  • [5] Inter-speaker variability in audio-visual classification of word prominence
    Heckmann, Martin
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1790 - 1794
  • [6] Inter-speaker variability: speaker normalisation and quantitative estimation of articulatory invariants in speech production for French
    Serrurier, Antoine
    Badin, Pierre
    Boe, Louis-Jean
    Lamalle, Laurent
    Neuschaefer-Rube, Christiane
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2272 - 2276
  • [7] Capture inter-speaker information with a neural network for speaker identification
    Wang, L
    Chen, K
    Chi, HH
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL V, 2000, : 247 - 252
  • [8] INTER-SPEAKER VARIATION IN COMPOUND PROMINENCE
    Bell, Melanie J.
    [J]. LINGUE E LINGUAGGIO, 2015, 14 (01) : 61 - 78
  • [9] Intra-speaker and inter-speaker variability in speech sound pressure level across repeated readings
    Castellana, Antonella
    Carullo, Alessio
    Astolfi, Arianna
    Puglisi, Giuseppina Emma
    Fugiglando, Umberto
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (04): : 2353 - 2363
  • [10] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    S UMESH
    [J]. Sadhana, 2011, 36 : 853 - 883