A Study on Tailor-Made Speech Synthesis Based on Deep Neural Networks

被引:1
|
作者
Yamada, Shuhei [1 ]
Nose, Takashi [1 ]
Ito, Akinori [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Aoba Ku, Aramaki Aza Aoba 6-6-05, Sendai, Miyagi 9808579, Japan
来源
ADVANCES IN INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1 | 2017年 / 63卷
关键词
DNN-based speech synthesis; Prosody control; F0; context; Context label; Model training; Unsupervised labeling;
D O I
10.1007/978-3-319-50209-0_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose "tailor-made speech synthesis," the speech synthesis technique which enables users to control the synthetic speech naturally and intuitively. As a first step to realizing tailor-made speech synthesis, we introduce F0 context into speaker model training of speech synthesis based on deep neural networks (DNNs). F0 context represents relative log F0 at the mora or the accent-phrase level of training data. It allows users to control the F0 of synthetic speech steplessly on the contrary to conventional F0 context in HMM-based technique. Experiments showed that F0 context was effective to control the F0 because the F0 of synthetic voice followed the value of F0 context.
引用
收藏
页码:159 / 166
页数:8
相关论文
共 50 条
  • [21] A Review: Tailor-made Hydrogel Structures(Classifications and Synthesis Parameters)
    Singhal, Reena
    Gupta, Kshitij
    POLYMER-PLASTICS TECHNOLOGY AND ENGINEERING, 2016, 55 (01) : 54 - 70
  • [22] Glycoprotein synthesis: From glycobiological tools to tailor-made catalysts
    Davis, BG
    Jones, JB
    SYNLETT, 1999, (09) : 1495 - 1507
  • [23] CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions
    Yang, Longyu
    Wang, Hanli
    Tang, Pengjie
    Li, Qinyu
    IEEE Transactions on Multimedia, 2021, 23 : 835 - 845
  • [24] TEMPLATE SYNTHESIS FROM STARCH AS AN APPROACH TO TAILOR-MADE CYCLODEXTRIN
    SHINKAI, S
    YAMADA, M
    SONE, T
    MANABE, O
    TETRAHEDRON LETTERS, 1983, 24 (33) : 3501 - 3504
  • [25] CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions
    Yang, Longyu
    Wang, Hanli
    Tang, Pengjie
    Li, Qinyu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 835 - 845
  • [26] Tailor-Made MgF2-Based Catalysts by Sol-Gel Synthesis
    Kemnitz, Erhard
    Wuttke, Stefan
    Coman, Simona M.
    EUROPEAN JOURNAL OF INORGANIC CHEMISTRY, 2011, (31) : 4773 - 4794
  • [27] An Experimental Study on Speech Enhancement Based on Deep Neural Networks
    Xu, Yong
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (01) : 65 - 68
  • [28] Intrusion Detection in a Tailor-Made Gaussian Distribution Wireless Sensor Networks
    Ghosal, Amrita
    Halder, Subir
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2015, 2015, 8956 : 325 - 330
  • [29] Tailor-Made Gaussian Distribution for Intrusion Detection in Wireless Sensor Networks
    Ghosal, Amrita
    Halder, Subir
    2014 IEEE 11TH INTL CONF ON UBIQUITOUS INTELLIGENCE AND COMPUTING AND 2014 IEEE 11TH INTL CONF ON AUTONOMIC AND TRUSTED COMPUTING AND 2014 IEEE 14TH INTL CONF ON SCALABLE COMPUTING AND COMMUNICATIONS AND ITS ASSOCIATED WORKSHOPS, 2014, : 406 - 411
  • [30] Tailor-made polyesters based on pentadecalactone via enzymatic catalysis
    Vaida, Cristian
    Keul, Helmut
    Moeller, Martin
    GREEN CHEMISTRY, 2011, 13 (04) : 889 - 899