Speech events are recoverable from unlabeled articulatory data: Using an unsupervised clustering approach on data obtained from Electromagnetic Midsaggital Articulography (EMA)

被引:0
|
作者
Duran, Daniel [1 ]
Bruni, Jagoda [1 ]
Dogil, Grzegorz [1 ]
Schuetze, Hinrich [1 ]
机构
[1] Univ Stuttgart, Inst Nat Language Proc, Stuttgart, Germany
关键词
Speech production/perception; modeling; clustering; EMA; SYLLABLE STRUCTURE; SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Some models of speech perception/production and language acquisition make use of a quasi-continuous representation of the acoustic speech signal. We investigate whether such models could potentially profit from incorporating articulatory information in an analogous fashion. In particular, we investigate how articulatory information represented by EMA measurements can influence unsupervised phonetic speech categorization. By incorporation of the acoustic signal and non-synthetic, raw articulatory data, we present first results of a clustering procedure, which is similarly applied in numerous language acquisition and speech perception models. It is observed that non-labeled articulatory data, i.e. without previously assumed landmarks, perform fine clustering results. A more effective clustering outcome for plosives than for vowels seems to support the motor view of speech perception.
引用
下载
收藏
页码:2212 / 2215
页数:4
相关论文
共 40 条
  • [1] Multimodal speech animation from electromagnetic articulography data
    Inserm, U846, 18 avenue Doyen Lépine, 69500 Bron, France
    不详
    不详
    不详
    不详
    不详
    不详
    European Signal Proces. Conf., (2807-2811):
  • [2] MULTIMODAL SPEECH ANIMATION FROM ELECTROMAGNETIC ARTICULOGRAPHY DATA
    Gibert, Guillaume
    Attina, Virginie
    Tiede, Mark
    Bundgaard-Nielsen, Rikke
    Kroos, Christian
    Kasisopa, Benjawan
    Vatikiotis-Bateson, Eric
    Best, Catherine T.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2807 - 2811
  • [3] Articulatory Synthesis of French Connected Speech from EMA Data
    Toutios, Asterios
    Narayanan, Shrikanth S.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2737 - 2741
  • [4] PREDICTION OF VOICING AND THE F0 CONTOUR FROM ELECTROMAGNETIC ARTICULOGRAPHY DATA FOR ARTICULATION-TO-SPEECH SYNTHESIS
    Stone, Simon
    Schmidt, Philipp
    Birkholz, Peter
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7329 - 7333
  • [5] AUTOMATIC ALIGNMENT OF A PHONETIC TRANSCRIPTION WITH ARTICULATORY EVENTS FROM X-RAY DATA OF CONTINUOUS SPEECH UTTERANCES
    NELSON, WL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S22 - S22
  • [6] Unsupervised learning from incomplete data using a mixture model approach
    Hunt, L
    Jorgensen, M
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 173 - 191
  • [7] Generating concrete test cases from vehicle data using models obtained from clustering
    Chetouane, Nour
    Wotawa, Franz
    2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS, ICSTW, 2023, : 70 - 77
  • [8] UNSUPERVISED CLASSIFICATION OF FOREST FROM POLARIMETRIC INTERFEROMETRIC SAR DATA USING FUZZY CLUSTERING
    Luo, Huan-Min
    Chen, Er-Xue
    Li, Xiao-Wen
    Cheng, Jian
    Li, Min
    PROCEEDINGS OF THE 2010 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, 2010, : 201 - 206
  • [9] Lithofacies Identification from Wire-Line Logs Using an Unsupervised Data Clustering Algorithm
    Ul Hasan, Md Monjur
    Hasan, Tanzeer
    Shahidi, Reza
    James, Lesley
    Peters, Dennis
    Gosine, Ray
    ENERGIES, 2023, 16 (24)
  • [10] A robust ensemble approach to learn from positive and unlabeled data using SVM base models
    Claesen, Marc
    De Smet, Frank
    Suykens, Johan A. K.
    De Moor, Bart
    NEUROCOMPUTING, 2015, 160 : 73 - 84