On Pole-Zero Model Estimation Methods Minimizing a Logarithmic Criterion for Speech Analysis

被引:19
|
作者
Marelli, Damian [1 ]
Balazs, Peter [2 ]
机构
[1] Univ Newcastle, Sch Elect Engn & Comp Sci, Callaghan, NSW 2308, Australia
[2] Austrian Acad Sci, Acoust Res Inst, A-1040 Vienna, Austria
关键词
Bark scale; estimation; iterative methods; logarithmic arithmetic; nasals; numerator and denominator; poles and zeroes; speech analysis; transfer functions; FREQUENCY-DOMAIN; DESIGN; APPROXIMATIONS; FILTERS; IDENTIFICATION;
D O I
10.1109/TASL.2009.2025544
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech production model consists of a linear, slowly time-varying filter. Pole-zero models are required for a good representation of certain types of speech sounds, like nasals and laterals. From a perceptual point of view, designing them by minimizing a logarithmic criterion appears as a very suitable approach. The most accurate available results are obtained by using Newton-like search algorithms to optimize pole and zero positions, or the coefficients of a decomposition into quadratic factors. In this paper, we propose to optimize the numerator and denominator coefficients instead. Experimental results show that this is the computationally most efficient approach, especially when the optimization criterion considers a psychoacoustical frequency scale. To illustrate its applicability in speech processing, we used the proposed method for formant and anti-formant tracking as well as speech resynthesis.
引用
收藏
页码:237 / 248
页数:12
相关论文
共 50 条
  • [1] SPEECH ANALYSIS BY POLE-ZERO DECOMPOSITION
    YEGNANARAYANA, B
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S67 - S67
  • [2] A LOGARITHMIC BASED POLE-ZERO VOCAL TRACT MODEL ESTIMATION FOR SPEAKER VERIFICATION
    Enzinger, Ewald
    Balazs, Peter
    Marelli, Damian
    Becker, Timo
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4820 - 4823
  • [3] ADAPTIVE ANALYSIS OF SPEECH BASED ON A POLE-ZERO REPRESENTATION
    MORIKAWA, H
    FUJISAKI, H
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1982, 30 (01): : 77 - 88
  • [4] Pole-zero estimation of speech signal based on zero-tracking algorithm
    Ouaaline, N
    Radouane, L
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 1998, 12 (01) : 1 - 12
  • [5] AN ALGORITHM FOR POLE-ZERO SYSTEM MODEL ORDER ESTIMATION
    DAVILA, CE
    CHIANG, HL
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (04) : 1013 - 1017
  • [6] Pole-zero approximations for head-related transfer functions using a logarithmic error criterion
    Blommer, Michael A.
    Wakefield, Gregory H.
    [J]. IEEE, New York, NY, United States (05):
  • [7] Pole-zero approximations for head-related transfer functions using a logarithmic error criterion
    Blommer, MA
    Wakefield, GH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 278 - 287
  • [8] HIGH-RESOLUTION POLE-ZERO ANALYSIS OF PARKINSONIAN SPEECH
    YAIR, E
    GATH, I
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1991, 38 (02) : 161 - 167
  • [9] LINEAR PREDICTION ANALYSIS OF SPEECH BASED ON A POLE-ZERO REPRESENTATION
    ATAL, BS
    SCHROEDER, MR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S96 - S96
  • [10] LINEAR PREDICTION ANALYSIS OF SPEECH BASED ON A POLE-ZERO REPRESENTATION
    ATAL, BS
    SCHROEDER, MR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 (05): : 1310 - 1318