Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds

被引：0

作者：

Kawahara, Hideki ^{[1
]}

Morise, Masanori

Banno, Hideki

Takahashi, Toru

Nisimura, Ryuichi ^{[1
]}

Irino, Toshio ^{[1
]}

机构：

[1] Wakayama Univ, Dept Design Informat Sci, Wakayama, Japan

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

speech analysis; sampling theory; speech modification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A simple new method to recover details in a spectral envelope is proposed based on a recently introduced speech analysis, modification and resynthesis framework called TANDEM-STRAIGHT. Spectral envelope recovery of voiced sounds is a discrete-to-analog conversion in the frequency domain. However, there is a fundamental problem because the spatial frequency contents of vocal tract functions generally exceed the Nyquist limit of the equivalent sampling rate determined by the fundamental frequency. TANDEM-STRAIGHT yields a method to recover a spectral envelope based on the consistent sampling theory and provides base information for exceeding this limit. At the final stage, the AR spectral envelope estimated from the TANDEM-STRAIGHT spectrum is divided by the F0 adaptively smoothed version of itself to supply the missing high-spatial-frequency details of the envelope. The underlying principle of the proposed method can also be applied to other speech synthesis frameworks.

引用

页码：650 / 653

页数：4

共 50 条

[41] Probing Beyond: Looking into the Patterns within a High-Quality Diet
Frankenfeld, Cara L.
[J]. JOURNAL OF NUTRITION, 2022, 152 (03): : 653 - 654
[42] Factors that determine and limit the resistivity of high-quality individual ZnO nanowires
Lord, Alex M.
Maffeis, Thierry G.
Walton, Alex S.
Kepaptsoglou, Despoina M.
Ramasse, Quentin M.
Ward, Michael B.
Koeble, Juergen
Wilks, Steve P.
[J]. NANOTECHNOLOGY, 2013, 24 (43)
[43] Beyond convenience: The disruptive high-quality, high-impact online MBA
Geoghegan, Will
Wanger, Sarah
[J]. BUSINESS HORIZONS, 2024, 67 (03) : 299 - 309
[44] EFFICIENT HEAT-RECOVERY AND HIGH-QUALITY REFINER PULP
FRANZEN, R
[J]. PULP & PAPER-CANADA, 1983, 84 (06) : 71 - &
[45] Finding intelligible consonant-vowel sounds using high-quality articulatory synthesis
van Niekerk, Daniel R.
Xu, Anqi
Gerazov, Branislav
Krug, Paul K.
Birkholz, Peter
Xu, Yi
[J]. INTERSPEECH 2020, 2020, : 4457 - 4461
[46] VOC - AN INTEGRATED HIGH-QUALITY SPEECH SYNTHESIZER BASED ON LPC TECHNIQUES
ITALIANO, P
PONTE, G
SARTORI, M
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1985, 31 (03) : 501 - 504
[47] An Advanced NLP Framework for High-Quality Text-to-Speech Synthesis
Ungurean, Catalin
Burileanu, Dragos
[J]. 2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,
[48] A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis
Fan, Bo
Lee, Siu Wa
Tian, Xiaohai
Xie, Lei
Dong, Minghui
[J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 530 - 536
[49] TIME-FREQUENCY PROCESSING OF PARTIALS FOR HIGH-QUALITY SPEECH SYNTHESIS
Ciobanu, Amelia
Negrescu, Cristian
Burileanu, Dragos
Stanomir, Dumitru
[J]. FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 67 - 75
[50] HIGH-QUALITY CODING OF TELEPHONE SPEECH AND WIDE-BAND AUDIO
JAYANT, NS
[J]. IEEE COMMUNICATIONS MAGAZINE, 1990, 28 (01) : 10 - 20

← 1 2 3 4 5 →