Investigations on inter-speaker variability in the feature space

被引:8
|
作者
Haeb-Umbach, R [1 ]
机构
[1] Philips Res Labs, D-52066 Aachen, Germany
关键词
D O I
10.1109/ICASSP.1999.758146
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We apply Fisher variate analysis to measure the effectiveness of speaker normalization techniques. A trace criterion, which measures the ratio of the variations due to different phonemes compared to variations due to different speakers, serves as a first assessment of a feature set without the need for recognition experiments. By using this measure and by recognition experiments we demonstrate that cepstral mean normalization also has a speaker normalization effect, in addition to the well-known channel normalization effect. Similarly vocal tract normalization (VTN) is shown to remove inter-speaker variability. For VTN we show that normalization on a per sentence basis performs better than normalization on a per speaker basis. Recognition results are given on Wallstreet Journal and Hub-4 databases.
引用
收藏
页码:397 / 400
页数:4
相关论文
共 50 条
  • [21] Intra- and inter-speaker variation in eight Russian fricativesa)
    Ulrich, Natalja
    Pellegrino, Francois
    Allassonniere-Tang, Marc
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (04): : 2285 - 2297
  • [22] R-Norm: Improving Inter-Speaker Variability Modelling at the Score Level via Regression Score Normalisation
    Vandyke, David
    Wagner, Michael
    Goecke, Roland
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3116 - 3120
  • [23] Inter-speaker speech variability assessment using statistical deformable models from 3.0 Tesla magnetic resonance images
    Vasconcelos, Maria J. M.
    Ventura, Sandra M. R.
    Freitas, Diamantino R. S.
    Tavares, Joao Manuel R. S.
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H-JOURNAL OF ENGINEERING IN MEDICINE, 2012, 226 (H3) : 185 - 196
  • [24] Voice conversion based on probabilistic parameter transformation and extended inter-speaker residual prediction
    Hanzlicek, Zdenek
    Matousek, Jindrich
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 480 - 487
  • [26] Inter-speaker synchronization in audiovisual database for lip-readable speech to animation conversion
    Feldhoffer, Gergely
    Oroszi, Balazs
    Takacs, Gyoergy
    Tihanyi, Attila
    Bardi, Tamas
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 447 - 454
  • [27] Intra- and inter-speaker variations of formant pattern for lateral syllables in Standard Chinese
    Zhang, C
    van de Weijer, J
    Cui, JX
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2006, 158 (2-3) : 117 - 124
  • [28] Normal non-fluency in adult males: An intra-and inter-speaker study
    Duckworth, M. S.
    McDougall, K.
    [J]. 10TH OXFORD DYSFLUENCY CONFERENCE, ODC 2014, 2015, 193 : 302 - 303
  • [29] SPECTRAL DISTRIBUTION CUES - COMPARATIVE-STUDY BASED ON 2 INTRA-SPEAKER AND INTER-SPEAKER DISCRIMINATING ANALYSES
    CAELEN, G
    VIGOUROUX, N
    [J]. SPEECH COMMUNICATION, 1983, 2 (2-3) : 133 - 136
  • [30] A Study on the Mixed Model Approach and Symbol Probability Weighting Function for Maximization of Inter-Speaker Variation
    Chin, Se-Noon
    Kang, Chul-Ho
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (07): : 410 - 415