Formant measurement in children's speech based on spectral filtering

被引:22
|
作者
Story, Brad H. [1 ]
Bunton, Kate [1 ]
机构
[1] Univ Arizona, Dept Speech Language & Hearing Sci, Speech Acoust Lab, POB 210071, Tucson, AZ 85721 USA
基金
美国国家科学基金会;
关键词
Formant; Vocal tract; Speech analysis; Children's speech; Speech modeling; LINEAR PREDICTION; VOCAL-TRACT; FREQUENCY; VOWELS; SIMULATION; MODEL; COORDINATION; HARMONICS; CEPSTRUM; AIRWAY;
D O I
10.1016/j.specom.2015.11.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Children's speech presents a challenging problem for formant frequency measurement. In part, this is because high fundamental frequencies, typical of a children's speech production, generate widely spaced harmonic components that may undersample the spectral shape of the vocal tract transfer function. In addition, there is often a weakening of upper harmonic energy and a noise component due to glottal turbulence. The purpose of this study was to develop a formant measurement technique based on cepstral analysis that does not require modification of the cepstrum itself or transformation back to the spectral domain. Instead, a narrow-band spectrum is low-pass filtered with a cutoff point (i.e., cutoff "quefrency" in the terminology of cepstral analysis) to preserve only the spectral envelope. To test the method, speech representative of a 2-3 year-old child was simulated with an airway modulation model of speech production. The model, which includes physiologically-scaled vocal folds and vocal tract, generates sound output analogous to a microphone signal. The vocal tract resonance frequencies can be calculated independently of the output signal and thus provide test cases that allow for assessing the accuracy of the formant tracking algorithm. When applied to the simulated child-like speech, the spectral filtering approach was shown to provide a clear spectrographic representation of formant change over the time course of the signal, and facilitates tracking formant frequencies for further analysis. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 111
页数:19
相关论文
共 50 条
  • [1] Formant estimation of whispered speech based on spectral segmentation
    Gong Chenghui
    Zhao Heming
    Lu Gang
    Liu Hanxin
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 562 - +
  • [2] SUBTRACTIVE FILTERING METHOD IN FORMANT ANALYSIS OF SPEECH
    PATRYN, R
    [J]. ACUSTICA, 1982, 50 (04): : 285 - 286
  • [3] SPEECH SPECTRAL SEGMENTATION FOR SPECTRAL ESTIMATION AND FORMANT MODELING
    CHHATWAL, HS
    CONSTANTINIDES, AG
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 127 - 144
  • [4] A formant modification method for improved ASR of children's speech
    Kathania, Hemant Kumar
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    Kurimo, Mikko
    [J]. SPEECH COMMUNICATION, 2022, 136 : 98 - 106
  • [5] LPC-based formant enhancement method in Kalman filtering for speech enhancement
    Mellahi, Tarek
    Hamdi, Rachid
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2015, 69 (02) : 545 - 554
  • [6] Modeling Formant Dynamics in Speech Spectral Envelopes
    Craciun, Alexandra
    Paulus, Jouni
    Sevkin, Goekhan
    Backstrom, Tom
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1619 - 1623
  • [7] Adaptation of children's speech with limited data based on formant-like peak alignment
    Cui, Xiaodong
    Alwan, Abeer
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 400 - 419
  • [8] FORMANT BASED SPEECH SYNTHESIS
    HUGHES, PM
    [J]. BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 84 - 90
  • [9] Correlation based speech formant recovery
    Nelson, D
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1643 - 1646
  • [10] Formant frequency measurement on spectrally reduced speech signals
    Deutsch, WA
    Moosmueller, S
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2003, 136 : 372 - 372