Applications of Positive Time-Frequency Distributions to Speech Processing

被引:28
|
作者
Pitton, James W. [1 ]
Atlas, Les E. [2 ]
Loughlin, Patrick J. [3 ]
机构
[1] AT&T Bell Labs, Murray Hill, NJ 07974 USA
[2] Univ Washington, Dept Elect Engn, Interact Syst Design Lab, Seattle, WA 98195 USA
[3] Univ Pittsburgh, Dept Elect Engn, Pittsburgh, PA 15261 USA
来源
关键词
D O I
10.1109/89.326614
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Much of our current knowledge and intuition of speech is derived from analyses involving assumptions of short-time stationarity (e. g., the speech spectrogram). Such methods are, by their very nature, incapable of revealing the true nonstationary nature of speech. A careful consideration of the theory of time-frequency distributions (TFD's), however, allows the construction of methods that reveal far more of the nonstationarities of speech, thereby highlighting just what it is that conventional approaches miss. We apply two iterative methods for generating positive time-frequency distributions (TFD's) to speech analysis. Both methods make use of multiple sources of information (e. g., multiple spectrograms) to yield a high-resolution estimate of the joint time-frequency energy density of speech. Plosive events and formant harmonic structure are simultaneously preserved in these TFD's. Rapidly time-varying formants are also resolved by these TFD's, and harmonic structure is revealed, independent of sweep rate; this result is quite different from that seen with conventional speech spectrograms. The speech features observed in these distributions demonstrate that conventional sliding window techniques lose or distort much of the rich nonstationary structure of speech. Examples for synthetic formants and real speech are provided. The differences between joint distributions and conditional distributions are also illustrated.
引用
收藏
页码:554 / 566
页数:13
相关论文
共 50 条
  • [1] POSITIVE TIME-FREQUENCY DISTRIBUTIONS - A NOTE
    JANSSEN, AJEM
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (05): : 701 - 703
  • [2] CONSTRUCTION OF POSITIVE TIME-FREQUENCY DISTRIBUTIONS
    LOUGHLIN, PJ
    PITTON, JW
    ATLAS, LE
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (10) : 2697 - 2705
  • [3] An algorithm for positive time-frequency distributions
    Sang, TH
    Williams, WJ
    ONeill, JC
    [J]. PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 165 - 168
  • [4] Time-frequency distributions for automatic speech recognition
    Potamianos, A
    Maragos, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 196 - 200
  • [5] Spatial time-frequency distributions and their applications
    Amin, MG
    Zhang, Y
    [J]. ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 254 - 255
  • [6] Bilinear Time-Frequency Distributions for Ultrasonic Signal Processing and NDE Applications
    Saniie, Jafar
    Lu, Juan
    Oruklu, Erdal
    [J]. 2013 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2013, : 954 - 957
  • [7] Construction of discriminative positive time-frequency distributions
    Umapathy, Karthikeyan
    Krishnan, Sridhar
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1462 - 1466
  • [8] A class of positive isentropic time-frequency distributions
    Knockaert, L
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 22 - 25
  • [9] Signal synthesis and positive time-frequency distributions
    Shah, SI
    El-Jaroudi, A
    Loughlin, PJ
    Chaparro, LF
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2000, 337 (04): : 317 - 328
  • [10] POSITIVE TIME-FREQUENCY DISTRIBUTIONS - AN NOTE - REPLY
    COHEN, L
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (05): : 703 - 705