DYNAMIC SPECTRAL SHAPE-FEATURES AS ACOUSTIC CORRELATES FOR INITIAL STOP CONSONANTS

被引:51
|
作者
NOSSAIR, ZB
ZAHORIAN, SA
机构
[1] Department of Electrical and Computer Engineering, Old Dominion University, Norfolk
来源
关键词
D O I
10.1121/1.400735
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A comprehensive investigation of two acoustic feature sets for English stop consonants spoken in syllable initial position was conducted to determine the relative invariance of the features that cue place and voicing. The features evaluated were overall spectral shape, encoded as the cosine transform coefficients of the nonlinearly scaled amplitude spectrum, and formants. In addition, features were computed both for the static case, i.e., from one 25-ms frame starting at the burst, and for the dynamic case, i.e., as parameter trajectories over several frames of speech data. All features were evaluated with speaker-independent automatic classification experiments using the data from 15 speakers to train the classifier and the data from 15 different speakers for testing. The primary conclusions from these experiments, as measured via automatic recognition rates, are as follows: (1) spectral shape features are superior to both formants, and formants plus amplitudes; (2) features extracted from the dynamic spectrum are superior to features extracted from the static spectrum; and (3) features extracted from the speech signal beginning with the burst onset are superior to features extracted from the speech signal beginning with the vowel transition. Dynamic features extracted from the smoothed spectra over a 60-ms interval timed to begin with the burst onset appear to account for the primary vowel context effects. Automatic recognition results for the 6 stops (93.7%) based on 20 features was better than the rates obtained with human listeners for a 50-ms segment (89.9%) and only slightly worse than the rates obtained by human listeners for a 100-ms interval (96.6%). Thus the basic conclusion from our work is that dynamic spectral shape features are acoustically invariant cues for both place and voicing in initial stop consonants.
引用
收藏
页码:2978 / 2991
页数:14
相关论文
共 13 条
  • [1] PERCEPTION OF STATIC AND DYNAMIC ACOUSTIC CUES TO PLACE OF ARTICULATION IN INITIAL STOP CONSONANTS
    KEWLEYPORT, D
    PISONI, DB
    STUDDERTKENNEDY, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 73 (05): : 1779 - 1793
  • [2] THE ROLE OF THE GROSS SPECTRAL SHAPE AS A PERCEPTUAL CUE TO PLACE OF ARTICULATION IN INITIAL STOP CONSONANTS
    BLUMSTEIN, SE
    ISAACS, E
    MERTUS, J
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 72 (01): : 43 - 50
  • [3] ACOUSTIC FEATURES IN MANNER DIFFERENTIATION OF KOREAN STOP CONSONANTS
    WEITZMAN, RS
    HAN, MS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 40 (05): : 1272 - &
  • [4] Acoustic and articulatory correlates of stop consonants in a parrot and a human subject
    Patterson, DK
    Pepperberg, IM
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 103 (04): : 2197 - 2215
  • [5] EVIDENCE AGAINST ACOUSTIC INVARIANCE IN INITIAL VOICED STOP CONSONANTS
    ZAHORIAN, SA
    NOSSAIR, ZB
    COLEMAN, RF
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S36 - S36
  • [6] Acoustic-phonetic features for the automatic classification of stop consonants
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 833 - 841
  • [7] SPECTRAL-SHAPE FEATURES VERSUS FORMANTS AS ACOUSTIC CORRELATES FOR VOWELS
    ZAHORIAN, SA
    JAGHARGHI, AJ
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 94 (04): : 1966 - 1982
  • [8] TIME-VARYING FEATURES AS CORRELATES OF PLACE OF ARTICULATION IN STOP CONSONANTS
    KEWLEYPORT, D
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 73 (01): : 322 - 335
  • [9] DISSOCIATION OF SPECTRAL AND TEMPORAL CUES TO VOICING DISTINCTION IN INITIAL STOP CONSONANTS
    SUMMERFIELD, Q
    HAGGARD, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (02): : 435 - 448
  • [10] ACOUSTIC AND PERCEPTUAL ANALYSIS OF WORD-INITIAL STOP CONSONANTS IN PHONOLOGICALLY DISORDERED CHILDREN
    FORREST, K
    ROCKMAN, BK
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1988, 31 (03): : 449 - 459