ROBUST FULL-BAND ADAPTIVE SINUSOIDAL ANALYSIS AND SYNTHESIS OF SPEECH

被引：0

作者：

Kafentzis, George P. ^{[1
,3
]}

Rosec, Olivier ^{[2
]}

Stylianou, Yannis ^{[3
]}

机构：

[1] Orange Labs, TECH ACTS MAS, Lannion, France

[2] Voxygen S A, Pole Phonix, Pleumeur Bodou, France

[3] Univ Crete, Multimedia Informat Lab, Dept Comp Sci, Iraklion, Greece

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Extended adaptive quasi-harmonic model; Speech modelling; Speech analysis; Sinusoidal modelling;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recent advances in speech analysis have shown that voiced speech can be very well represented using quasi-harmonic frequency tracks and local parameter adaptivity to the underlying signal. In this paper, we revisit the quasi-harmonicity approach through the extended adaptive Quasi-Harmonic Model-eaQHM, and we show that the application of a continuous integral(0) estimation method plus an adaptivity scheme can yield high resolution quasi-harmonic analysis and perceptually indistinguishable resynthesized speech. This method assumes an initial harmonic model which successively converges to quasi-harmonicity. Formal listening tests showed that eaQHM is robust against integral(0) estimation artefacts and can provide a higher quality in resynthesizing speech, compared to a recently developed model, called the adaptive Harmonic Model (aHM), and the standard Sinusoidal Model (SM).

引用

页数：5

共 50 条

[1] Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model
Degottex, Gilles
Stylianou, Yannis
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2085 - 2095
[2] A Full-Band Adaptive Harmonic Representation of Speech
Degottex, Gilles
Stylianou, Yannis
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 382 - 385
[3] A robust full-band image watermarking scheme
Liu, Jung-Chun
Lin, Chu-Hsing
Kuo, Li-Ching
[J]. 2006 10TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2006, : 250 - +
[4] Full-Band Quasi-Harmonic Analysis and Synthesis of Musical Instrument Sounds with Adaptive Sinusoids
Caetano, Marcelo
Kafentzis, George P.
Mouchtaris, Athanasios
Stylianou, Yannis
[J]. APPLIED SCIENCES-BASEL, 2016, 6 (05):
[5] ADAPTIVE-FSN: INTEGRATING FULL-BAND EXTRACTION AND ADAPTIVE SUB-BAND ENCODING FOR MONAURAL SPEECH ENHANCEMENT
Tsao, Yu-Sheng
Ho, Kuan-Hsun
Hung, Jeih-Weih
Chen, Berlin
[J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 458 - 464
[6] Local spectral attention for full-band speech enhancement
Hou, Zhongshu
Hu, Qinwen
Chen, Kai
Cao, Zhanzhong
Lu, Jing
[J]. JASA EXPRESS LETTERS, 2023, 3 (11):
[7] Low-dimensional representation of spectral envelope without deterioration for full-band speech analysis/synthesis system
Morise, Masanori
Miyashita, Genta
Ozawa, Kenji
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 409 - 413
[8] Speech analysis and synthesis with a refined adaptive sinusoidal representation
Tabet, Youcef
Boughazi, Mohamed
Afifi, Saddek
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 581 - 588
[9] DNN-Based Full-Band Speech Synthesis Using GMM Approximation of Spectral Envelope
Koguchi, Junya
Takamichi, Shinnosuke
Morise, Masanori
Saruwatari, Hiroshi
Sagayama, Shigeki
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (12): : 2673 - 2681
[10] Learnable spectral dimension compression mapping for full-band speech enhancement
Hu, Qinwen
Hou, Zhongshu
Chen, Kai
Lu, Jing
[J]. JASA EXPRESS LETTERS, 2023, 3 (02):

← 1 2 3 4 5 →