Real-Time Vibration Control of An Electrolarynx based on Statistical F0 Contour Prediction

被引:0
|
作者
Tanaka, Kou [1 ]
Toda, Tomoki [2 ]
Neubig, Graham [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, 8916-5 Takayama Cho, Ikoma, Nara, Japan
[2] Nagoya Univ, Informat Technol Ctr, Chikusa Ku, Furo Cho, Nagoya, Aichi, Japan
关键词
VOICE CONVERSION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An electrolarynx is a speaking aid device to artificially generate excitation sounds to help laryngectomees produce electrolaryngeal (EL) speech. Although EL speech is quite intelligible, its naturalness significantly suffers from the unnatural fundamental frequency (F-0) patterns of the mechanical excitation sounds. To make it possible to produce more naturally sounding EL speech, we have proposed a method to automatically control F-0 patterns of the excitation sounds generated from the electrolarynx based on the statistical F-0 prediction, which predicts F-0 patterns from the produced EL speech in real-time. In our previous work, we have developed a prototype system by implementing the proposed real-time prediction method in an actual, physical electrolarynx, and through the use of the prototype system, we have found that improvements of the naturalness of EL speech yielded by the prototype system tend to be lower than that yielded by the batch-type prediction. In this paper, we examine negative impacts caused by latency of the real-time prediction on the F-0 prediction accuracy, and to alleviate them, we also propose two methods, 1) modeling of segmented continuous F-0 (CF0) patterns and 2) prediction of forthcoming F-0 values. The experimental results demonstrate that 1) the conventional real-time prediction method needs a large delay to predict CF0 patterns and 2) the proposed methods have positive impacts on the real-time prediction.
引用
收藏
页码:1333 / 1337
页数:5
相关论文
共 50 条
  • [21] F0 CONTOUR PREDICTION WITH A DEEP BELIEF NETWORK-GAUSSIAN PROCESS HYBRID MODEL
    Fernandez, Raul
    Rendel, Asaf
    Ramabhadran, Bhuvana
    Hoory, Ron
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6885 - 6889
  • [22] F0 Contour Estimation using ELS-based Robust Time-Varying Complex Speech Analysis
    Funaki, Keiichi
    2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 313 - 316
  • [23] PREDICTION OF VOICING AND THE F0 CONTOUR FROM ELECTROMAGNETIC ARTICULOGRAPHY DATA FOR ARTICULATION-TO-SPEECH SYNTHESIS
    Stone, Simon
    Schmidt, Philipp
    Birkholz, Peter
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7329 - 7333
  • [24] Dynamic Feedrate Control Based on the Feedback of Path Curvature and Real-Time Contour
    Lin, Kuan-Chen
    Yang, Shyi-Kae
    2013 IEEE 10TH INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND DRIVE SYSTEMS (IEEE PEDS 2013), 2013, : 370 - 372
  • [25] Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis
    Yu, Kai
    Young, Steve
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1071 - 1079
  • [26] A statistical approach to real-time quality control
    Wiklund, H
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1999, 37 (18) : 4141 - 4155
  • [27] Real-time Pupil Detection based on Contour Tracking
    Gu, Ke-ke
    Dong, Yue-fang
    Zhou, Zhe
    Liu, Min
    Chen, Shi
    Fu, Wei-wei
    CURRENT TRENDS IN COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA), VOL 2, 2017, : 31 - 40
  • [28] JPDAF based HMM for real-time contour tracking
    Chen, YQ
    Rui, Y
    Huang, TS
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, : 543 - 550
  • [29] Real-Time Digital Image Stabilization Based on Contour
    周渝斌
    赵跃进
    Journal of Beijing Institute of Technology, 2003, (S1) : 62 - 65
  • [30] Statistical Learning Based Congestion Control for Real-Time Video Communication
    Dai, Tongyu
    Zhang, Xinggong
    Zhang, Yihang
    Guo, Zongming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2672 - 2683