On the transformation of the speech spectrum for voice conversion

被引:0
|
作者
Baudoin, G
Stylianou, Y
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In many speech applications, control of the speech individuality is required. These applications include the personalization of the voice of speech synthesizers, the restoral of voice individuality for interpreting telephony, the improvment of abnormal speech intelligibility. It is generally admitted that both prosodic and spectral parameters have to be changed in order to modify the speech individuality. Several algorithms have recently been proposed for the spectrum control. This paper presents some improvments added to these previously proposed methods and compares 4 approaches in the same common framework of voice conversion for application to text to speech synthesizers.
引用
收藏
页码:1405 / 1408
页数:4
相关论文
共 50 条
  • [41] Voice quality conversion in TD-PSOLA speech synthesis
    Sun, XJ
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 953 - 956
  • [42] HMM adaptation and voice conversion for the synthesis of child speech: a comparison
    Watts, Oliver
    Yamagishi, Junichi
    King, Simon
    Berkling, Kay
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2595 - +
  • [43] TEXT-INFORMED SPEECH INPAINTING VIA VOICE CONVERSION
    Prablanc, Pierre
    Ozerov, Alexey
    Duong, Ngoc Q. K.
    Perez, Patrick
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 878 - 882
  • [44] Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
    Chien, Yung-Lun
    Chen, Hsin-Hao
    Yen, Ming-Chi
    Tsai, Shu-Wei
    Wang, Hsin-Min
    Tsao, Yu
    Chi, Tai-Shih
    INTERSPEECH 2023, 2023, : 5023 - 5026
  • [45] Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion
    Nakagiri, Mikihiro
    Toda, Tomoki
    Kashioka, Hideki
    Shikano, Kiyohiro
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2270 - 2273
  • [46] Enrichment of Oesophageal Speech: Voice Conversion with Duration-Matched Synthetic Speech as Target
    Raman, Sneha
    Sarasola, Xabier
    Navas, Eva
    Hernaez, Inma
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [47] Speech-to-Singing Voice Conversion The challenges and strategies for improving vocal conversion processes
    Vijayan, Karthika
    Li, Haizhou
    Toda, Tomoki
    IEEE SIGNAL PROCESSING MAGAZINE, 2019, 36 (01) : 95 - 102
  • [48] Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
    Chen, Li-Wei
    Lee, Hung-Yi
    Tsao, Yu
    INTERSPEECH 2019, 2019, : 719 - 723
  • [49] Efficient Modeling of Temporal Structure of Speech For Applications in Voice Transformation
    Nguyen, Binh Phu
    Akagi, Masato
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1599 - 1602
  • [50] Synthesizing Speech from Electromyography using Voice Transformation Techniques
    Toth, Arthur R.
    Wand, Michael
    Schultz, Tanja
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 644 - 647