Real-Time Vibration Control of An Electrolarynx based on Statistical F0 Contour Prediction

被引:0
|
作者
Tanaka, Kou [1 ]
Toda, Tomoki [2 ]
Neubig, Graham [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, 8916-5 Takayama Cho, Ikoma, Nara, Japan
[2] Nagoya Univ, Informat Technol Ctr, Chikusa Ku, Furo Cho, Nagoya, Aichi, Japan
关键词
VOICE CONVERSION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An electrolarynx is a speaking aid device to artificially generate excitation sounds to help laryngectomees produce electrolaryngeal (EL) speech. Although EL speech is quite intelligible, its naturalness significantly suffers from the unnatural fundamental frequency (F-0) patterns of the mechanical excitation sounds. To make it possible to produce more naturally sounding EL speech, we have proposed a method to automatically control F-0 patterns of the excitation sounds generated from the electrolarynx based on the statistical F-0 prediction, which predicts F-0 patterns from the produced EL speech in real-time. In our previous work, we have developed a prototype system by implementing the proposed real-time prediction method in an actual, physical electrolarynx, and through the use of the prototype system, we have found that improvements of the naturalness of EL speech yielded by the prototype system tend to be lower than that yielded by the batch-type prediction. In this paper, we examine negative impacts caused by latency of the real-time prediction on the F-0 prediction accuracy, and to alleviate them, we also propose two methods, 1) modeling of segmented continuous F-0 (CF0) patterns and 2) prediction of forthcoming F-0 values. The experimental results demonstrate that 1) the conventional real-time prediction method needs a large delay to predict CF0 patterns and 2) the proposed methods have positive impacts on the real-time prediction.
引用
收藏
页码:1333 / 1337
页数:5
相关论文
共 50 条
  • [31] Totally Data-driven Intonation Prediction Model Using a Novel F0 Contour Parametric Representation
    Yi, Lifu
    Li, Jian
    Lou, Xiaoyan
    Hao, Jie
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 313 - 316
  • [32] Accurate and Real-time Lip Contour Extraction Based on Constrained Contour Growing
    Yang, Ying
    Wang, Xiangdong
    Qian, Yueliang
    Lin, Shouxun
    JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 589 - +
  • [33] Real-time exact contour error calculation of NURBS tool path for contour control
    Liu, Zhe
    Dong, Jingchuan
    Wang, Taiyong
    Ren, Chengzu
    Guo, Jianxin
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2020, 108 (9-10): : 2803 - 2821
  • [34] Real-time exact contour error calculation of NURBS tool path for contour control
    Zhe Liu
    Jingchuan Dong
    Taiyong Wang
    Chengzu Ren
    Jianxin Guo
    The International Journal of Advanced Manufacturing Technology, 2020, 108 : 2803 - 2821
  • [35] Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence
    Ishihara, Tatsuma
    Kameoka, Hirokazu
    Yoshizato, Kota
    Saito, Daisuke
    Sagayama, Shigeki
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1016 - 1020
  • [36] Real-time exact contour error calculation of NURBS tool path for contour control
    Liu, Zhe
    Dong, Jingchuan
    Wang, Taiyong
    Ren, Chengzu
    Guo, Jianxin
    International Journal of Advanced Manufacturing Technology, 2020, 108 (9-10): : 2803 - 2821
  • [37] IMPLEMENTATION OF F0 TRANSFORMATION FOR STATISTICAL SINGING VOICE CONVERSION BASED ON DIRECTWAVEFORM MODIFICATION
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Nakamura, Satoshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5670 - 5674
  • [38] Identification and synthesis of Cantonese tones based on the command-response model for F0 contour generation
    Gu, WT
    Hirose, K
    Fujisaki, H
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 289 - 292
  • [39] Real-time Assistance Control of Hip Exoskeleton Based on Motion Prediction
    Xu L.
    Yang W.
    Yang C.
    Zhang J.
    Wang T.
    Jiqiren/Robot, 2021, 43 (04): : 473 - 483
  • [40] Real-Time Predictive Control of Structural Vibration Based on Reduced Order Model
    Liu Jianjun
    Xia Kaiquan
    Zhu Caixia
    PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, : 434 - +