Voice conversion by prosody and vocal tract modification

被引:0
|
作者
Rao, K. Sreenivasa [1 ]
Yegnanarayana, B. [2 ]
机构
[1] Indian Inst Technol, Dept Elect Commun Engn, Gauhati 781039, India
[2] Indian Inst Technol, Dept Comp Sci & Engn, Madras 600036, Tamil Nadu, India
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we proposed some flexible methods, which are useful in the process of voice conversion. The proposed methods modify the shape of the vocal tract system and the characteristics of the prosody according to the desired requirement. The shape of the vocal tract system is modified by shifting the major resonant frequencies (formants) of the short term spectrum, and altering their bandwidths accordingly. In the case of prosody modification, the required durational and intonational characteristics are imposed on the given speech signal. In the proposed method, the prosodic characteristics are manipulated using instants of significant excitation. The instants of significant excitation correspond to the instants of glottal closure (epochs) in the case of voiced speech, and to some random excitations like onset of burst in the case of nonvoiced speech. Instants of significant excitation are computed from the Linear Prediction (LP) residual of the speech signals by using the property of average group delay of minimum phase signals. The manipulations of durational characteristics and pitch contour (intonation pattern) are achieved by manipulating the LP residual with the help of the knowledge of the instants of significant excitation, The modified LP residual is used to excite the time varying filter. The filter parameters are updated according to the desired vocal tract characteristics. The proposed methods are evaluated using listening tests.
引用
收藏
页码:111 / +
页数:2
相关论文
共 50 条
  • [1] Voice Conversion System using SVM for Vocal Tract Modification and Codebook based Model for Pitch Contour Modification
    Laskar, R. H.
    Talukdar, F. A.
    Bhattacharjee, Rajib
    Das, Saugat
    [J]. 2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 2205 - 2210
  • [2] Transformation of Prosody in Voice Conversion
    Sisman, Berrak
    Li, Haizhou
    Tan, Kay Chen
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1588 - 1597
  • [3] Vocal Tract Spectrum Transformation Based on Clustering in Voice Conversion System
    Xie Weichao
    Zhang Linghua
    [J]. PROCEEDING OF THE IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2012, : 236 - 240
  • [4] Mapping Articulatory-Features to Vocal-Tract Parameters for Voice Conversion
    Ariwardhani, Narpendyah Wisjnu
    Kimura, Masashi
    Iribe, Yurie
    Katsurada, Kouichi
    Nitta, Tsuneo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 911 - 918
  • [5] A novel method for prosody prediction in voice conversion
    Helander, Elina E.
    Nurminen, Jani
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 509 - +
  • [6] Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
    Du, Zongyang
    Zhou, Kun
    Sisman, Barrak
    Li, Haizhou
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 507 - 513
  • [7] Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion
    Shah, Nirmesh J.
    Madhavi, Maulik C.
    Patil, Hemant A.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1968 - 1972
  • [8] Glottal and Vocal Tract Characteristics of Voice Impersonators
    Bin Amin, Talal
    Marziliano, Pina
    German, James Sneed
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (03) : 668 - 678
  • [9] Vocal tract resonances in singing: The soprano voice
    Joliveau, E
    Smith, J
    Wolfe, J
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04): : 2434 - 2439
  • [10] Voice quality enhancement for vocal tract rehabilitation
    Sutcliffe, Bianca
    Wiggins, Lindzi
    Rubin, David
    Aharonson, Vered
    [J]. 2018 3RD BIENNIAL SOUTH AFRICAN BIOMEDICAL ENGINEERING CONFERENCE (SAIBMEC), 2018,