MRI-Based Vocal Tract Representations for the Three-Dimensional Finite Element Synthesis of Diphthongs

被引:19
|
作者
Arnela, Marc [1 ]
Dabbaghchian, Saeed [2 ]
Guasch, Oriol [1 ]
Engwall, Olov [2 ]
机构
[1] Univ Ramon Llull, GTM Grp Recerca Tecnol Media, Barcelona 08022, Spain
[2] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, Dept Speech Mus & Hearing, SE-10044 Stockholm, Sweden
关键词
Vocal tract acoustics; Finite Element Method; diphthongs; semi-polar grid; adaptive grid; speech synthesis; WAVE-EQUATION; GEOMETRY SIMPLIFICATIONS; PROPAGATION MODES; SIMULATION; HEAD;
D O I
10.1109/TASLP.2019.2942439
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The synthesis of diphthongs in three-dimensions (3D) involves the simulation of acoustic waves propagating through a complex 3D vocal tract geometry that deforms over time. Accurate 3D vocal tract geometries can be extracted from Magnetic Resonance Imaging (MRI), but due to long acquisition times, only static sounds can be currently studied with an adequate spatial resolution. In this work, 3D dynamic vocal tract representations are built to generate diphthongs, based on a set of cross-sections extracted from MRI-based vocal tract geometries of static vowel sounds. A diphthong can then be easily generated by interpolating the location, orientation and shape of these cross-sections, thus avoiding the interpolation of full 3D geometries. Two options are explored to extract the cross-sections. The first one is based on an adaptive grid (AG), which extracts the cross-sections perpendicular to the vocal tract midline, whereas the second one resorts to a semi-polar grid (SPG) strategy, which fixes the cross-section orientations. The finite element method (FEM) has been used to solve the mixed wave equation and synthesize diphthongs [${\alpha i}$] and [${\alpha u}$] in the dynamic 3D vocal tracts. The outputs from a 1D acoustic model based on the Transfer Matrix Method have also been included for comparison. The results show that the SPG and AG provide very close solutions in 3D, whereas significant differences are observed when using them in 1D. The SPG dynamic vocal tract representation is recommended for 3D simulations because it helps to prevent the collision of adjacent cross-sections.
引用
收藏
页码:2173 / 2182
页数:10
相关论文
共 50 条
  • [21] A Three-Dimensional Mixed Finite Element for Flexoelectricity
    Deng, Feng
    Deng, Qian
    Shen, Shengping
    JOURNAL OF APPLIED MECHANICS-TRANSACTIONS OF THE ASME, 2018, 85 (03):
  • [22] Three-dimensional finite element analysis shelves
    Zhao, Xiuting
    Meng, Jin
    MATERIALS AND COMPUTATIONAL MECHANICS, PTS 1-3, 2012, 117-119 : 639 - 642
  • [23] MRI-based three-dimensional thermal physiological characterization of thyroid gland of human body
    Jin, Chao
    He, Zhi Zhu
    Yang, Yang
    Liu, Jing
    MEDICAL ENGINEERING & PHYSICS, 2014, 36 (01) : 16 - 25
  • [24] MRI-BASED THREE DIMENSIONAL MODELING OF STENTING PROCEDURE
    Zhao, Shijia
    Gu, Linxia
    Ganpule, Shailesh
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION 2011, VOL 2, 2012, : 569 - 570
  • [25] The three-dimensional beam theory: Finite element formulation based on curvature
    Zupan, D
    Saje, M
    COMPUTERS & STRUCTURES, 2003, 81 (18-19) : 1875 - 1888
  • [26] Three-dimensional inversion of magnetotelluric based on adaptive finite element method
    Qin Ce
    Liu XingFei
    Wang XuBen
    Sun WeiBin
    Zhao Ning
    CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2022, 65 (06): : 2311 - 2325
  • [27] Three-dimensional Solid Element Based on the Finite Element Absolute Nodal Coordinate Formulation
    Chao, Ma
    Hao, Liu
    Yang, Zhao
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2016), 2016, 50 : 320 - 323
  • [28] Vocal tract length estimation based on vowels using a database consisting of 385 speakers and a database with MRI-based vocal tract shape information
    Kawahara, Hideki
    Kitamura, Tatsuya
    Takemoto, Hironori
    Nisimura, Ryuichi
    Irino, Toshio
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 870 - 874
  • [29] Morphologic Differences in the Vocal Tract Resonance Cavities of Voice Professionals: An MRI-Based Study
    Rua Ventura, Sandra M.
    Freitas, Diamantino Rui S.
    Ramos, Isabel Maria A. P.
    Tavares, Joao Manuel R. S.
    JOURNAL OF VOICE, 2013, 27 (02) : 132 - 140
  • [30] A Symmetric Approach in the Three-Dimensional Digital Waveguide Modeling of the Vocal Tract
    Mushtaq, Tahir
    Kamran, Ahmad
    Qureshi, Muhammad Zubair Akbar
    Iqbal, Zafar
    ARCHIVES OF ACOUSTICS, 2023, 48 (03) : 317 - 324