High-Quality Analysis/Synthesis Method Based on Temporal Decomposition for Speech Modification

被引:0
|
作者
Nguyen, Binh Phu [1 ]
Shibata, Takeshi [1 ]
Akagi, Masato [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan
关键词
analysis/synthesis method; speech modification; temporal decomposition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The challenge of speech modification is to flexibly modify the speech without degrading speech quality. The conventional methods are limited by their inability to flexibly control speech signals in time and frequency domains. This causes degradation of the quality of modified speech. This paper proposes a high-quality analysis/synthesis method for speech modification. To control the temporal evolution, we use a speech analysis technique called temporal decomposition (TD), which decomposes a speech signal into event targets and event functions. The same event functions evaluated for the spectral parameters are also used to model the temporal evolution of the excitation parameters. The event functions describe the temporal evolution of the spectral and excitation parameters, and the event targets represent the "ideal" spectral parameters. To flexibly control speech signals in both time and frequency domains, we propose new methods to model the event functions and the event targets. The experimental results show that our proposed analysis/synthesis method produces high-quality synthesized speech, and allows the flexibility to modify speech signals.
引用
收藏
页码:662 / 665
页数:4
相关论文
共 50 条
  • [1] High-quality prosodic modification of speech signals
    Pfister, B
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2446 - 2449
  • [2] IMPROVED METHOD FOR MODEL PARAMETERS EXTRACTION USED IN HIGH-QUALITY SPEECH SYNTHESIS
    Negrescu, Cristian
    Ciobanu, Amelia
    Burileanu, Dragos
    Stanomir, Dumitru
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2012, 74 (04): : 145 - 158
  • [3] High-quality text-to-speech synthesis: An overview
    Dutoit, T.
    Journal of Electrical and Electronics Engineering, Australia, 1997, 17 (01): : 25 - 36
  • [4] FlexVoice:: A parametric approach to high-quality speech synthesis
    Balogh, G
    Dobler, E
    Grobler, T
    Smodics, B
    Szepesvári, C
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 189 - 194
  • [5] HIGH-QUALITY SPEECH COMPRESSION-EXPANSION METHOD
    JOHNSON, O
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (05): : 725 - &
  • [6] IMPROVING GAN-BASED VOCODER FOR FAST AND HIGH-QUALITY SPEECH SYNTHESIS
    He, Mengnan
    Guo, Tingwei
    Lu, Zhengxin
    Zhang, Ruixiong
    Gong, Caixia
    INTERSPEECH 2022, 2022, : 1601 - 1605
  • [7] HIGH-QUALITY SPEECH EXPANSION, COMPRESSION, AND NOISE FILTERING USING THE SOLA METHOD OF TIME SCALE MODIFICATION
    WAYMAN, JL
    REINKE, RE
    WILSON, DL
    TWENTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2: CONFERENCE RECORD, 1989, : 714 - 717
  • [8] High quality sinusoidal modeling of wideband speech for the purposes of speech synthesis and modification
    Chazan, Dan
    Hoory, Ron
    Sagi, Ariel
    Shechtman, Slava
    Sorin, Alex
    Shuang, Zhi Wei
    Bakis, Raimo
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 877 - 880
  • [9] HIGH-QUALITY SPEECH SYNTHESIS SYSTEM BASED ON WAVE-FORM CONCATENATION OF PHONEME SEGMENT
    HIROKAWA, T
    ITOH, K
    SATO, H
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (11) : 1964 - 1970
  • [10] HIGH-QUALITY PARCOR SPEECH SYNTHESIZER
    SAMPEI, T
    ASADA, A
    NAKATA, K
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1980, 26 (03) : 353 - 359