An Investigation of Acoustic Features for Singing Voice Conversion based on Perceptual Age

被引:0
|
作者
Kobayashi, Kazuhiro [1 ]
Doi, Hironori [1 ]
Toda, Tomoki [1 ]
Nakano, Tomoyasu [2 ]
Goto, Masataka [2 ]
Neubig, Graham [1 ]
Sakti, Sakriani [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Nara Inst Sci & Technol NAIST, Grad Sch Informat Sci, Ikoma, Nara, Japan
[2] Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan
关键词
singing voice; voice conversion; perceptual age; spectral and prosodic features; subjective evaluations;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the acoustic features that can be modified to control the perceptual age of a singing voice. Singers can sing expressively by controlling prosody and vocal timbre, but the varieties of voices that singers can produce are limited by physical constraints. Previous work has attempted to overcome this limitation through the use of statistical voice conversion. This technique makes it possible to convert singing voice characteristics of an arbitrary source singer into those of an arbitrary target singer. However, it is still difficult to intuitively control singing voice characteristics by manipulating parameters corresponding to specific physical traits, such as gender and age. In this paper, we focus on controlling the perceived age of the singer and, as a first step, perform an investigation of the factors that play a part in the listener's perception of the singer's age. The experimental results demonstrate that 1) the perceptual age of singing voices corresponds relatively well to the actual age of the singer, 2) speech analysis/synthesis processing and statistical voice conversion processing don't cause adverse effects on the perceptual age of singing voices, and 3) prosodic features have a larger effect on the perceptual age than spectral features.
引用
收藏
页码:1056 / 1060
页数:5
相关论文
共 50 条
  • [1] Perceptual (but not acoustic) features predict singing voice preferences
    Bruder, Camila
    Poeppel, David
    Larrouy-Maestri, Pauline
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [2] REGRESSION APPROACHES TO PERCEPTUAL AGE CONTROL IN SINGING VOICE CONVERSION
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Nakano, Tomoyasu
    Goto, Masataka
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Voice Timbre Control Based on Perceived Age in Singing Voice Conversion
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Doi, Hironori
    Nakano, Tomoyasu
    Goto, Masataka
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06): : 1419 - 1428
  • [4] Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Nakano, Tomoyasu
    Goto, Masataka
    Nakamura, Satoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (11): : 2767 - 2777
  • [5] How Face Masks Affect Acoustic and Auditory Perceptual Characteristics of the Singing Voice
    Oren, Liran
    Rollins, Michael
    Gutmark, Ephraim
    Howell, Rebecca
    [J]. JOURNAL OF VOICE, 2023, 37 (04) : 515 - 521
  • [6] Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification
    Florian Eyben
    Gláucia L Salomão
    Johan Sundberg
    Klaus R Scherer
    Björn W Schuller
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [7] Voice-over: Perceptual and acoustic analysis of vocal features
    Medrado, R
    Ferreira, LP
    Behlau, M
    [J]. JOURNAL OF VOICE, 2005, 19 (03) : 340 - 349
  • [8] Unsupervised Singing Voice Conversion
    Nachmani, Eliya
    Wolf, Lior
    [J]. INTERSPEECH 2019, 2019, : 2583 - 2587
  • [9] Acoustic analysis of the singing and speaking voice in singing students
    Lundy, DS
    Roy, S
    Casiano, RR
    Xue, JW
    Evans, J
    [J]. JOURNAL OF VOICE, 2000, 14 (04) : 490 - 493
  • [10] Evaluation of a Singing Voice Conversion Method Based on Many-to-Many Eigenvoice Conversion
    Doi, Hironori
    Toda, Tomoki
    Nakano, Tomoyasu
    Goto, Masataka
    Nakamura, Satoshi
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1066 - 1070