Low-dimensional Style Token Control for Hyperarticulated Speech Synthesis

被引:1
|
作者
Nishihara, Miku [1 ]
Wells, Dan [2 ]
Richmond, Korin [2 ]
Pine, Aidan [3 ]
机构
[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi, Japan
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
[3] Natl Res Council Canada, Ottawa, ON, Canada
来源
关键词
controllable speech synthesis; speech style embedding; hyperarticulated speech; SPEAKING; HEARING;
D O I
10.21437/Interspeech.2024-2074
中图分类号
学科分类号
摘要
Global style tokens (GSTs) allow for rich modelling of the variation in a speech corpus and subsequent control of text-to-speech synthesis (TTS). However, certain styles of speech may be marked by variation along multiple dimensions, complicating the interpretation and control of learned style tokens. One example is hyperarticulated or 'clear' speech, for example as directed toward listeners with hearing impairments or language learners in the classroom, which in English is characterised by reduced speaking rate, increased F0, more careful articulation of vowels and plosive consonants, and other factors. We present a method for simplifying control of style tokens by applying principal components analysis (PCA) to GST weights from a TTS system trained on both plain and clear speech. We identify the axes of variation in PCA space with the acoustic correlates of clear speech in English and show that we can synthesise either style by moving along a single dimension in that space. Index Terms: controllable speech synthesis, speech style
引用
收藏
页码:3385 / 3389
页数:5
相关论文
共 50 条
  • [21] Exploring Low-Dimensional Structures of Modulation Spectra for Robust Speech Recognition
    Yan, Bi-Cheng
    Shih, Chin-Hong
    Liu, Shih-Hung
    Chen, Berlin
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3637 - 3641
  • [22] Low-Dimensional Motor Control Representations in Throwing Motions
    Ruiz, Ana Lucia Cruz
    Pontonnier, Charles
    Dumont, Georges
    APPLIED BIONICS AND BIOMECHANICS, 2017, 2017
  • [23] Low-Dimensional Carbon Nanomaterials: Synthesis, Properties, and Applications
    Zhang, Sulin
    Li, Teng
    Huang, Jianyu
    Shenoy, Vivek
    JOURNAL OF NANOMATERIALS, 2011, 2011
  • [24] Synthesis of low-dimensional nano-structural GaN
    Li, ZJ
    Li, HJ
    Chen, XL
    Li, KZ
    Cao, YG
    Li, JY
    RARE METAL MATERIALS AND ENGINEERING, 2002, 31 (05) : 321 - 324
  • [25] SYNTHESIS AND PROPERTIES OF LOW-DIMENSIONAL METAL CHALCOGENIDES.
    Rouxel, Jean
    1600, (64):
  • [26] Synthesis, properties, and optical applications of low-dimensional perovskites
    Zhang, Yupeng
    Liu, Jingying
    Wang, Ziyu
    Xue, Yunzhou
    Ou, Qingdong
    Polavarapu, Lakshminarayana
    Zheng, Jialu
    Qi, Xiang
    Bao, Qiaoliang
    CHEMICAL COMMUNICATIONS, 2016, 52 (94) : 13637 - 13655
  • [27] Nanoscale control of low-dimensional spin structures in manganites
    王静
    Iftikhar Ahmed Malik
    梁仁荣
    黄文
    郑仁奎
    张金星
    Chinese Physics B, 2016, (06) : 49 - 56
  • [28] Control of energy dissipation in sliding low-dimensional materials
    Cammarata, Antonio
    Polcar, Tomas
    PHYSICAL REVIEW B, 2020, 102 (08)
  • [29] Performance animation from low-dimensional control signals
    Chai, JX
    Hodgins, JK
    ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 686 - 696
  • [30] SYNTHESIS, STRUCTURE, AND PROPERTIES OF NOVEL LOW-DIMENSIONAL SOLIDS
    HWU, SJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1993, 206 : 552 - INOR