An Investigation of Acoustic Features for Singing Voice Conversion based on Perceptual Age

被引:0
|
作者
Kobayashi, Kazuhiro [1 ]
Doi, Hironori [1 ]
Toda, Tomoki [1 ]
Nakano, Tomoyasu [2 ]
Goto, Masataka [2 ]
Neubig, Graham [1 ]
Sakti, Sakriani [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Nara Inst Sci & Technol NAIST, Grad Sch Informat Sci, Ikoma, Nara, Japan
[2] Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan
关键词
singing voice; voice conversion; perceptual age; spectral and prosodic features; subjective evaluations;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the acoustic features that can be modified to control the perceptual age of a singing voice. Singers can sing expressively by controlling prosody and vocal timbre, but the varieties of voices that singers can produce are limited by physical constraints. Previous work has attempted to overcome this limitation through the use of statistical voice conversion. This technique makes it possible to convert singing voice characteristics of an arbitrary source singer into those of an arbitrary target singer. However, it is still difficult to intuitively control singing voice characteristics by manipulating parameters corresponding to specific physical traits, such as gender and age. In this paper, we focus on controlling the perceived age of the singer and, as a first step, perform an investigation of the factors that play a part in the listener's perception of the singer's age. The experimental results demonstrate that 1) the perceptual age of singing voices corresponds relatively well to the actual age of the singer, 2) speech analysis/synthesis processing and statistical voice conversion processing don't cause adverse effects on the perceptual age of singing voices, and 3) prosodic features have a larger effect on the perceptual age than spectral features.
引用
收藏
页码:1056 / 1060
页数:5
相关论文
共 50 条
  • [21] Singing with Gesture: Acoustic and Perceptual Measures of Solo Singers
    Brunkan, Melissa C.
    Bowers, Jason
    [J]. JOURNAL OF VOICE, 2021, 35 (02) : 325.e17 - 325.e22
  • [22] Acoustic and perceptual assessment of vibrato quality of singing students
    Amir, Noam
    Michaeli, Orit
    Amir, Ofer
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2006, 1 (02) : 144 - 150
  • [23] Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2754 - 2758
  • [24] IMPROVING ADVERSARIAL WAVEFORM GENERATION BASED SINGING VOICE CONVERSION WITH HARMONIC SIGNALS
    Guo, Haohan
    Zhou, Zhiping
    Meng, Fanbo
    Liu, Kai
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6657 - 6661
  • [25] Correlation of acoustic features of pitch/rhythm/power and perceptual impressions after singing training for people with dysarthria
    Nanahara , Maki
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2022, 43 (01) : 22 - 31
  • [26] On the Perception of Affect in the Singing Voice: A Study of Acoustic Cues
    Mouawad, Pauline
    Desainte-Catherine, Myriam
    Gegout-Petit, Anne
    Semal, Catherine
    [J]. SOUND, MUSIC, AND MOTION, 2014, 8905 : 105 - 121
  • [27] Effects of vocal training on the acoustic parameters of the singing voice
    Mendes, AP
    Rothman, HB
    Sapienza, C
    Brown, WS
    [J]. JOURNAL OF VOICE, 2003, 17 (04) : 529 - 543
  • [28] ACOUSTIC COMPARISON OF VOICE USE IN SOLO AND CHOIR SINGING
    ROSSING, TD
    SUNDBERG, J
    TERNSTROM, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 79 (06): : 1975 - 1981
  • [29] Aesthetic Perception of the Singing Voice in Relation to the Acoustic Conditions
    Potocan, Zoran
    [J]. MUZIKOLOSKI ZBORNIK, 2020, 56 (01): : 282 - 284
  • [30] Comparing the acoustic expression of emotion in the speaking and the singing voice
    Scherer, Klaus R.
    Sundberg, Johan
    Tamarit, Lucas
    Salomao, Glaucia L.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01): : 218 - 235