Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds

被引:0
|
作者
Kawahara, Hideki [1 ]
Morise, Masanori
Banno, Hideki
Takahashi, Toru
Nisimura, Ryuichi [1 ]
Irino, Toshio [1 ]
机构
[1] Wakayama Univ, Dept Design Informat Sci, Wakayama, Japan
关键词
speech analysis; sampling theory; speech modification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A simple new method to recover details in a spectral envelope is proposed based on a recently introduced speech analysis, modification and resynthesis framework called TANDEM-STRAIGHT. Spectral envelope recovery of voiced sounds is a discrete-to-analog conversion in the frequency domain. However, there is a fundamental problem because the spatial frequency contents of vocal tract functions generally exceed the Nyquist limit of the equivalent sampling rate determined by the fundamental frequency. TANDEM-STRAIGHT yields a method to recover a spectral envelope based on the consistent sampling theory and provides base information for exceeding this limit. At the final stage, the AR spectral envelope estimated from the TANDEM-STRAIGHT spectrum is divided by the F0 adaptively smoothed version of itself to supply the missing high-spatial-frequency details of the envelope. The underlying principle of the proposed method can also be applied to other speech synthesis frameworks.
引用
收藏
页码:650 / 653
页数:4
相关论文
共 50 条
  • [41] Probing Beyond: Looking into the Patterns within a High-Quality Diet
    Frankenfeld, Cara L.
    [J]. JOURNAL OF NUTRITION, 2022, 152 (03): : 653 - 654
  • [42] Factors that determine and limit the resistivity of high-quality individual ZnO nanowires
    Lord, Alex M.
    Maffeis, Thierry G.
    Walton, Alex S.
    Kepaptsoglou, Despoina M.
    Ramasse, Quentin M.
    Ward, Michael B.
    Koeble, Juergen
    Wilks, Steve P.
    [J]. NANOTECHNOLOGY, 2013, 24 (43)
  • [43] Beyond convenience: The disruptive high-quality, high-impact online MBA
    Geoghegan, Will
    Wanger, Sarah
    [J]. BUSINESS HORIZONS, 2024, 67 (03) : 299 - 309
  • [44] EFFICIENT HEAT-RECOVERY AND HIGH-QUALITY REFINER PULP
    FRANZEN, R
    [J]. PULP & PAPER-CANADA, 1983, 84 (06) : 71 - &
  • [45] Finding intelligible consonant-vowel sounds using high-quality articulatory synthesis
    van Niekerk, Daniel R.
    Xu, Anqi
    Gerazov, Branislav
    Krug, Paul K.
    Birkholz, Peter
    Xu, Yi
    [J]. INTERSPEECH 2020, 2020, : 4457 - 4461
  • [46] VOC - AN INTEGRATED HIGH-QUALITY SPEECH SYNTHESIZER BASED ON LPC TECHNIQUES
    ITALIANO, P
    PONTE, G
    SARTORI, M
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1985, 31 (03) : 501 - 504
  • [47] An Advanced NLP Framework for High-Quality Text-to-Speech Synthesis
    Ungurean, Catalin
    Burileanu, Dragos
    [J]. 2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,
  • [48] A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis
    Fan, Bo
    Lee, Siu Wa
    Tian, Xiaohai
    Xie, Lei
    Dong, Minghui
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 530 - 536
  • [49] TIME-FREQUENCY PROCESSING OF PARTIALS FOR HIGH-QUALITY SPEECH SYNTHESIS
    Ciobanu, Amelia
    Negrescu, Cristian
    Burileanu, Dragos
    Stanomir, Dumitru
    [J]. FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 67 - 75
  • [50] HIGH-QUALITY CODING OF TELEPHONE SPEECH AND WIDE-BAND AUDIO
    JAYANT, NS
    [J]. IEEE COMMUNICATIONS MAGAZINE, 1990, 28 (01) : 10 - 20