A comparison of cepstral coefficients and spectral moments in the classification of Romanian fricatives

被引:21
|
作者
Spinu, Laura [1 ]
Lilley, Jason [2 ]
机构
[1] Univ Western Ontario, London, ON N6A 3K7, Canada
[2] Nemours Biomed Res, Wilmington, DE 19803 USA
基金
美国国家科学基金会;
关键词
Fricatives; Cepstral coefficients; Spectral moments; Place of articulation; Secondary palatalization; Classification; Romanian; ACOUSTIC CHARACTERISTICS; STATISTICAL-ANALYSIS; EUROPEAN PORTUGUESE; ENGLISH FRICATIVES; PERCEPTION; ARTICULATION; CHILDREN; CUES; ADOLESCENTS; OBSTRUENTS;
D O I
10.1016/j.wocn.2016.05.002
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this paper we explore two methods for the classification of fricatives. First, for the coding of the speech, we compared two sets of acoustic measures obtained from a corpus of Romanian fricatives: (a) spectral moments and (b) cepstral coefficients. Second, we compared two methods of determining the regions of the segments from which the measures would be extracted. In the first method, the phonetic segments were divided into three regions of approximately equal duration. In the second method, Hidden Markov Models (HMMs) were used to divide each segment into three regions such that the variances of the measures within each region were minimized. The corpus we analyzed consists of 3674 plain and palatalized word-final fricatives from four places of articulation, produced by 31 native speakers of Romanian (20 females). We used logistic regression to classify fricatives by place, voicing, palatalization status, and gender. We found that cepstral coefficients reliably outperformed spectral moments in all classification tasks, and that using regions determined by HMM yielded slightly higher correct classification rates than using regions of equal duration. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:40 / 58
页数:19
相关论文
共 50 条
  • [1] Predictive power of cepstral coefficients and spectral moments in the classification of Azerbaijani fricatives
    Mokari, Payam Ghaffarvand
    Sardhaei, Nasim Mahdinezhad
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (03): : EL228 - EL234
  • [2] Acoustic classification of Russian plain and palatalized sibilant fricatives: Spectral vs. cepstral measures
    Spinu, Laura
    Kochetov, Alexei
    Lilley, Jason
    SPEECH COMMUNICATION, 2018, 100 : 41 - 45
  • [3] Using Spectral Moments as a Speaker Specific Feature in Nasals and Fricatives
    Schindler, Carola
    Draxler, Christoph
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2792 - 2795
  • [4] Spectral and Textural Features for Automatic Classification of Fricatives
    Frid, Alex
    Lavner, Yizhar
    2014 XXII ANNUAL PACIFIC VOICE CONFERENCE (PVC), 2014,
  • [5] Spectral moments vs discrete cosine transformation coefficients: Evaluation of acoustic measures distinguishing two merging German fricatives
    Jannedy, Stefanie
    Weirich, Melanie
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (01): : 395 - 405
  • [6] A comparison of reconstructed phase spaces and cepstral coefficients for multi-band phoneme classification
    Indrebo, KM
    Povinelli, RJ
    Johnson, MT
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 634 - 637
  • [7] On approximating line spectral frequencies to LPC cepstral coefficients
    Kim, Hong Kook, 2000, IEEE, Piscataway, NJ, United States (08):
  • [8] On approximating line spectral frequencies to LPC cepstral coefficients
    Kim, HK
    Choi, SH
    Lee, HS
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02): : 195 - 199
  • [9] Breathing site classification via joint mel frequency cepstral coefficients and gammatone frequency cepstral coefficients approach
    Zhang, Jiarui
    Ling, Bingo Wing-Kuen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 3623 - 3634
  • [10] DYSPHONIA DETECTION BASED ON MODULATION SPECTRAL FEATURES AND CEPSTRAL COEFFICIENTS
    Markaki, M.
    Stylianou, Y.
    Arias-Londono, J. D.
    Godino-Llorente, J. I.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5162 - 5165