ROBUST FULL-BAND ADAPTIVE SINUSOIDAL ANALYSIS AND SYNTHESIS OF SPEECH

被引:0
|
作者
Kafentzis, George P. [1 ,3 ]
Rosec, Olivier [2 ]
Stylianou, Yannis [3 ]
机构
[1] Orange Labs, TECH ACTS MAS, Lannion, France
[2] Voxygen S A, Pole Phonix, Pleumeur Bodou, France
[3] Univ Crete, Multimedia Informat Lab, Dept Comp Sci, Iraklion, Greece
关键词
Extended adaptive quasi-harmonic model; Speech modelling; Speech analysis; Sinusoidal modelling;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent advances in speech analysis have shown that voiced speech can be very well represented using quasi-harmonic frequency tracks and local parameter adaptivity to the underlying signal. In this paper, we revisit the quasi-harmonicity approach through the extended adaptive Quasi-Harmonic Model-eaQHM, and we show that the application of a continuous integral(0) estimation method plus an adaptivity scheme can yield high resolution quasi-harmonic analysis and perceptually indistinguishable resynthesized speech. This method assumes an initial harmonic model which successively converges to quasi-harmonicity. Formal listening tests showed that eaQHM is robust against integral(0) estimation artefacts and can provide a higher quality in resynthesizing speech, compared to a recently developed model, called the adaptive Harmonic Model (aHM), and the standard Sinusoidal Model (SM).
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model
    Degottex, Gilles
    Stylianou, Yannis
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2085 - 2095
  • [2] A Full-Band Adaptive Harmonic Representation of Speech
    Degottex, Gilles
    Stylianou, Yannis
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 382 - 385
  • [3] A robust full-band image watermarking scheme
    Liu, Jung-Chun
    Lin, Chu-Hsing
    Kuo, Li-Ching
    [J]. 2006 10TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2006, : 250 - +
  • [4] Full-Band Quasi-Harmonic Analysis and Synthesis of Musical Instrument Sounds with Adaptive Sinusoids
    Caetano, Marcelo
    Kafentzis, George P.
    Mouchtaris, Athanasios
    Stylianou, Yannis
    [J]. APPLIED SCIENCES-BASEL, 2016, 6 (05):
  • [5] ADAPTIVE-FSN: INTEGRATING FULL-BAND EXTRACTION AND ADAPTIVE SUB-BAND ENCODING FOR MONAURAL SPEECH ENHANCEMENT
    Tsao, Yu-Sheng
    Ho, Kuan-Hsun
    Hung, Jeih-Weih
    Chen, Berlin
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 458 - 464
  • [6] Local spectral attention for full-band speech enhancement
    Hou, Zhongshu
    Hu, Qinwen
    Chen, Kai
    Cao, Zhanzhong
    Lu, Jing
    [J]. JASA EXPRESS LETTERS, 2023, 3 (11):
  • [7] Low-dimensional representation of spectral envelope without deterioration for full-band speech analysis/synthesis system
    Morise, Masanori
    Miyashita, Genta
    Ozawa, Kenji
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 409 - 413
  • [8] Speech analysis and synthesis with a refined adaptive sinusoidal representation
    Tabet, Youcef
    Boughazi, Mohamed
    Afifi, Saddek
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 581 - 588
  • [9] DNN-Based Full-Band Speech Synthesis Using GMM Approximation of Spectral Envelope
    Koguchi, Junya
    Takamichi, Shinnosuke
    Morise, Masanori
    Saruwatari, Hiroshi
    Sagayama, Shigeki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (12): : 2673 - 2681
  • [10] Learnable spectral dimension compression mapping for full-band speech enhancement
    Hu, Qinwen
    Hou, Zhongshu
    Chen, Kai
    Lu, Jing
    [J]. JASA EXPRESS LETTERS, 2023, 3 (02):