Phonetic feature extraction for context-sensitive glottal source processing

被引:9
|
作者
Kane, John [1 ]
Aylett, Matthew [2 ,3 ]
Yanushevskaya, Irena [1 ]
Gobl, Christer [1 ]
机构
[1] Trinity Coll Dublin, Sch Linguist Speech & Commun Sci, Phonet & Speech Lab, Dublin, Ireland
[2] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
[3] CereProc Ltd, Edinburgh, Midlothian, Scotland
基金
爱尔兰科学基金会;
关键词
Voice quality; Phonation type; Glottal source; Expressive speech; Speech synthesis; DEEP NEURAL-NETWORKS; SPEECH;
D O I
10.1016/j.specom.2013.12.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The effectiveness of glottal source analysis is known to be dependent on the phonetic properties of its concomitant supraglottal features. Phonetic classes like nasals and fricatives are particularly problematic. Their acoustic characteristics, including zeros in the vocal tract spectrum and aperiodic noise, can have a negative effect on glottal inverse filtering, a necessary pre-requisite to glottal source analysis. In this paper, we first describe and evaluate a set of binary feature extractors, for phonetic classes with relevance for glottal source analysis. As voice quality classification is typically achieved using feature data derived by glottal source analysis, we then investigate the effect of removing data from certain detected phonetic regions on the classification accuracy. For the phonetic feature extraction, classification algorithms based on Artificial Neural Networks (ANNs), Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs) are compared. Experiments demonstrate that the discriminative classifiers (i.e. ANNs and SVMs) in general give better results compared with the generative learning algorithm (i.e. GMMs). This accuracy generally decreases according to the sparseness of the feature (e.g., accuracy is lower for nasals compared to syllabic regions). We find best classification of voice quality when just using glottal source parameter data derived within detected syllabic regions. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:10 / 21
页数:12
相关论文
共 50 条
  • [31] LES LANGAGES CONTEXT-SENSITIVE
    FRIANT, J
    ANNALES DE L INSTITUT HENRI POINCARE SECTION B-CALCUL DES PROBABILITES ET STATISTIQUE, 1967, 3 (01): : 35 - &
  • [32] A hallmark of context-sensitive design
    Moler, Steve
    Public Roads, 2002, 65 (06)
  • [33] Context-sensitive state estimation
    Steinberg, AN
    FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 881 - 888
  • [34] Context-sensitive dependency pairs
    Alarcon, Beatriz
    Gutierrez, Raul
    Lucas, Salvador
    FSTTCS 2006: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, PROCEEDINGS, 2006, 4337 : 297 - +
  • [35] Competition for Context-Sensitive Consumers
    Apffelstaedt, Arno
    Mechtenberg, Lydia
    MANAGEMENT SCIENCE, 2021, 67 (05) : 2828 - 2844
  • [36] Dynamic Context-Sensitive Deliberation
    Jensen, Maarten
    Vanhee, Lois
    Dignum, Frank
    MULTI-AGENT-BASED SIMULATION XXIV, MABS 2023, 2024, 14558 : 112 - 126
  • [37] Context-Sensitive Document Ranking
    常利军
    于旭
    秦璐
    Journal of Computer Science & Technology, 2010, 25 (03) : 444 - 457
  • [38] Context-sensitive query expansion
    Li, Weijiang
    Zhao, Tiejun
    Wang, Xiangang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (02): : 300 - 304
  • [39] THE CENTERS OF CONTEXT-SENSITIVE LANGUAGES
    STAIGER, L
    NEHRLICH, W
    LECTURE NOTES IN COMPUTER SCIENCE, 1986, 233 : 594 - 601
  • [40] Practical Context-Sensitive CFI
    van der Veen, Victor
    Andriesse, Dennis
    Goktas, Enes
    Gras, Ben
    Sambuc, Lionel
    Slowinska, Asia
    Bos, Herbert
    Giuffrida, Cristiano
    CCS'15: PROCEEDINGS OF THE 22ND ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2015, : 927 - 940