Frequency-domain techniques for high-quality voice modification

被引:0
|
作者
Laroche, J [1 ]
机构
[1] Creat Adv Technol Ctr, Scotts Valley, CA 95067 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents new frequency-domain voice modification techniques that combine the high-quality usually obtained by time-domain techniques such as TD-PSOLA with the flexibility provided by the frequency-domain representation. The technique only works for monophonic sources (single-speaker), and relies on a (possibly online) pitch detection. Based on the pitch, and according to the desired pitch and formant modifications, individual harmonics are selected and shifted to new locations in the spectrum. The harmonic phases are updated according to a pitch-based method that aims to achieve time-domain shape-invariance, thereby reducing or eliminating the usual artifacts associated with frequency-domain and sinusoidal-based voice modification techniques. The result is a fairly inexpensive, flexible algorithm which is able to match the quality of time-domain techniques, but provides vastly improved flexibility in the array of available modifications.
引用
收藏
页码:328 / 332
页数:5
相关论文
共 50 条
  • [41] CAUSALITY IN THE FREQUENCY-DOMAIN
    ZHU, SQ
    INTERNATIONAL JOURNAL OF CONTROL, 1990, 52 (01) : 251 - 259
  • [42] Frequency-domain algorithms for audio signal enhancement based on transient modification
    Goodwin, Michael M.
    Avendano, Carlos
    AES: Journal of the Audio Engineering Society, 2006, 54 (09): : 827 - 840
  • [43] THE FREQUENCY-DOMAIN GRATING
    HARTMANN, WM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (04): : 1421 - 1425
  • [44] FREQUENCY-DOMAIN INTERPOLATION
    LEONDES, CT
    RIVERS, DD
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1977, 13 (03) : 323 - 326
  • [45] DECONVOLUTION IN THE FREQUENCY-DOMAIN
    FRIESEN, WI
    MICHAELIAN, KH
    APPLIED SPECTROSCOPY, 1985, 39 (03) : 484 - 490
  • [46] Voice Agents Supporting High-Quality Social Play
    Pantoja, Luiza Superti
    Diederich, Kyle
    Crawford, Liam
    Hourcade, Juan Pablo
    PROCEEDINGS OF ACM INTERACTION DESIGN AND CHILDREN (IDC 2019), 2019, : 314 - 325
  • [47] VoiceAssist: Guiding Users to High-Quality Voice Recordings
    Seetharaman, Prem
    Mysore, Gautham
    Pardo, Bryan
    Smaragdis, Paris
    Gomes, Celso
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [48] High-integrity navigation: A frequency-domain approach
    Scheding, S
    Nebot, E
    Durrant-Whyte, H
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2000, 8 (04) : 676 - 694
  • [49] High-quality prosodic modification of speech signals
    Pfister, B
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2446 - 2449
  • [50] High-speed optical frequency-domain imaging
    Yun, SH
    Tearney, GJ
    de Boer, JF
    Iftimia, N
    Bouma, BE
    OPTICS EXPRESS, 2003, 11 (22): : 2953 - 2963