The analysis of voice quality in speech processing

被引:0
|
作者
Keller, E [1 ]
机构
[1] Univ Lausanne, Fac Lettres, IMM, CH-1015 Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voice quality has been defined as the characteristic auditory colouring of an individual's voice, derived from a variety of laryngeal and supralaryngeal features and running continuously through the individual's speech. The distinctive tone of speech sounds produced by a particular person yields a particular voice. Voice quality is at the centre of several speech processing issues. In speech recognition, voice differences, particularly extreme divergences from the norm, are responsible for known performance degradations. In speech synthesis on the other hand, voice quality is a desirable modelling parameter, with millions of voice types that can be distinguished theoretically. This article reviews the experimental derivation of voice quality markers. Specifically, the use of perceptual judgements, the long-term averaged spectrum (LTAS) and prosodic markers is examined, as well as inverse filtering for the extraction of the glottal source waveform. This review suggests that voice quality is best investigated as a multi-dimensional parameter space involving a combination of factors involving individual prosody, temporally structured speech characteristics, spectral divergence and voice source features, and that it could profitably complement simple linguistic prosodic model processing in speech synthesis.
引用
收藏
页码:54 / 73
页数:20
相关论文
共 50 条
  • [1] EVALUATION OF SPEECH PROCESSING INNOVATIONS TO IMPROVE VOICE QUALITY AND INTELLIGIBILITY IN NARROW-BAND VOICE COMMUNICATION
    SMITH, CP
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 321 - &
  • [2] Voice signal processing for speech synthesis
    Buza, Ovidiu
    Toderean, Gavril
    Nica, Alina
    Caruntu, Alexandru
    [J]. 2006 IEEE-TTTC INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS, VOL 2, PROCEEDINGS, 2006, : 360 - 364
  • [3] Factors affecting the quality of sound recording for speech and voice analysis
    Vogel, Adam P.
    Morgan, Angela T.
    [J]. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2009, 11 (06) : 431 - 437
  • [4] PERCEPTUAL ANALYSIS OF VOICE QUALITY IN SUSTAINED VOWELS AND CONNECTED SPEECH
    SODERSTEN, M
    HAMMARBERG, B
    [J]. FOLIA PHONIATRICA, 1989, 41 (4-5): : 214 - 214
  • [5] The effect of speech melody on voice quality
    Swerts, M
    Veldhuis, R
    [J]. SPEECH COMMUNICATION, 2001, 33 (04) : 297 - 303
  • [6] The emotional quality of speech in voice services
    Maffiolo, V
    Chateau, N
    [J]. ERGONOMICS, 2003, 46 (13-14) : 1375 - 1385
  • [7] Before Speech: Cerebral Voice Processing in Infants
    Belin, Pascal
    Grosbras, Marie-Helene
    [J]. NEURON, 2010, 65 (06) : 733 - 735
  • [8] Acoustic analysis and digital signal processing for the assessment of voice quality
    Jalali-najafabadi, Farideh
    Gadepalli, Chaitanya
    Jarchi, Delaram
    Cheetham, Barry M. G.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 70
  • [9] Effects of age on speech and voice quality ratings
    Goy, Huiwen
    Pichora-Fuller, M. Kathleen
    van Lieshout, Pascal
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (04): : 1648 - 1659
  • [10] Voice Quality of European Portuguese Emotional Speech
    Nunes, Ana
    Coimbra, Rosa Lidia
    Teixeira, Antonio
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2010, 6001 : 142 - 151