Principal component analysis of speech spectrogram images

被引:32
|
作者
Pinkowski, B
机构
[1] Computer Science Department, Western Michigan University, Kalamazoo
基金
美国国家卫生研究院;
关键词
principal components; Karhunen-Loeve transform; Fourier descriptors; cluster analysis; speech spectrogram;
D O I
10.1016/S0031-3203(96)00103-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research has demonstrated that spectrograms containing human speech utterances can be analyzed using image processing techniques to yield a high recognition rate. In particular, Fourier descriptors (FDs) have been proved very useful for characterizing the boundary of segmented isolated words containing the English semivowels /w/, /y/, /l/, and /r/. This study examines the appropriateness of FDs combined with 17 other general features for classifying objects contained in binary spectrogram images. Principal components (PCs) are used for feature reduction on a speaker-dependent data set consisting of 80 sounds representing 20 speaker-dependent words containing English semivowels. With only eight features, including four 32-point FDs and four general features obtained from principal component analysis, a 97.5% recognition rate was obtained. (C) 1997 Pattern Recognition Society.
引用
收藏
页码:777 / 787
页数:11
相关论文
共 50 条
  • [1] Principal component analysis of the spectrogram of the speech signal: Interpretation and application to dysarthric speech
    Kacha, Abdellah
    Grenez, Francis
    Rafael Orozco-Arroyave, Juan
    Schoentgen, Jean
    [J]. COMPUTER SPEECH AND LANGUAGE, 2020, 59 : 114 - 122
  • [2] Speech Enhancement Algorithm Based on Robust Principal Component Analysis with Whitened Spectrogram Rearrangement in Colored Noise
    Luo Yongjiang
    Yang Tengfei
    Zhao Dong
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3671 - 3679
  • [3] PRINCIPAL COMPONENT ANALYSIS OF MULTIVARIATE IMAGES
    GELADI, P
    ISAKSSON, H
    LINDQVIST, L
    WOLD, S
    ESBENSEN, K
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1989, 5 (03) : 209 - 220
  • [4] Principal component analysis of scintimammographic images
    Bonifazzi, Claudio
    Cinti, Maria Nerina
    De Vincentis, Giuseppe
    Finos, Livio
    Muzzioli, Valerio
    Betti, Margherita
    Lanconelli, Nico
    Tartari, Agostino
    Pani, Roberto
    [J]. PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2006, 21 : 91 - 93
  • [5] Exposing Speech Resampling Manipulation by Local Texture Analysis on Spectrogram Images
    Zhang, Yujin
    Dai, Shuxian
    Song, Wanqing
    Zhang, Lijun
    Li, Dongmei
    [J]. ELECTRONICS, 2020, 9 (01)
  • [6] Transform principal component analysis of spectral images
    Bochko, H
    Jaaskelainen, T
    Parkkinen, J
    [J]. CGIV 2004: SECOND EUROPEAN CONFERENCE ON COLOR IN GRAPHICS, IMAGING, AND VISION - CONFERENCE PROCEEDINGS, 2004, : 120 - 124
  • [7] Fundus Images Filtering by Principal Component Analysis
    Moret, F.
    Lagreze, W. A.
    Bach, M.
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2010, 51 (13)
  • [8] Principal component analysis for compression of hyperspectral images
    Lim, S
    Sohn, KH
    Lee, C
    [J]. IGARSS 2001: SCANNING THE PRESENT AND RESOLVING THE FUTURE, VOLS 1-7, PROCEEDINGS, 2001, : 97 - 99
  • [9] Quaternion principal component analysis of color images
    Le Bihan, N
    Sangwine, SJ
    [J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 809 - 812
  • [10] Texture analysis of images using Principal Component Analysis
    Bharati, MH
    MacGregor, JF
    [J]. PROCESS IMAGING FOR AUTOMATIC CONTROL, 2001, 4188 : 27 - 37