Physiological Quality-of-Experience Assessment of Text-to-Speech Systems

被引:0
|
作者
Gupta, Rishabh [1 ]
Falk, Tiago H. [1 ]
机构
[1] Univ Quebec, INRS EMT, Montreal, PQ, Canada
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the emergence of various text-to-speech (TTS) systems, developers have to provide superior user experience in order to remain competitive. To this end, quality-of-experience (QoE) perception modelling and measurement has become a key priority. QoE models rely on three influence factors: technological, contextual and human. Existing solutions have typically relied on using individual physiological modalities, such as electroen-cephalography (EEG), to model human influence factors (HIFs). In this paper, we show that fusion of physiological modalities, such as EEG, functional near infrared spectroscopy (fNIRS) and heart rate, provide gains of up to 18.4% relative to utilizing only technological factors and 4% relative to using the best performing individual physiological modality.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] Multimodal Physiological Quality-of-Experience Assessment of Text-to-Speech Systems
    Gupta, Rishabh
    Banville, Hubert J.
    Falk, Tiago H.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (01) : 22 - 36
  • [2] Perceptual Quality Dimensions of Text-to-Speech Systems
    Hinterleitner, Florian
    Moeller, Sebastian
    Norrenbrock, Christoph
    Heute, Ulrich
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2188 - 2191
  • [3] Enhancing the Quality of Nepali Text-to-Speech Systems
    Ghimire, Rupak Raj
    Bal, Bal Krishna
    [J]. CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, (CIT&DS), 2017, 754 : 187 - 197
  • [4] PHYSYQX: A DATABASE FOR PHYSIOLOGICAL EVALUATION OF SYNTHESISED SPEECH QUALITY-OF-EXPERIENCE
    Gupta, Rishabh
    Banville, Hubert J.
    Falk, Tiago H.
    [J]. 2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [5] Instrumental Assessment of Prosodic Quality for Text-to-Speech Signals
    Norrenbrock, Christoph R.
    Hinterleitner, Florian
    Heute, Ulrich
    Moeller, Sebastian
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (05) : 255 - 258
  • [6] Latent factor analysis for synthesized speech quality-of-experience assessment
    Rishabh Gupta
    Tiago H. Falk
    [J]. Quality and User Experience, 2017, 2 (1)
  • [7] Comparison of measures of speech quality for listening tests of text-to-speech systems
    Viswanathan, M
    Viswanathan, M
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 11 - 14
  • [8] Automatic Speech Recognition Used for Intelligibility Assessment of Text-to-Speech Systems
    Vich, Robert
    Nouza, Jan
    Vondra, Martin
    [J]. VERBAL AND NONVERBAL FEATURES OF HUMAN-HUMAN AND HUMAN-MACHINE INTERACTIONS, 2008, 5042 : 136 - +
  • [9] Comparison of Approaches for Instrumentally Predicting the Quality of Text-To-Speech Systems
    Moeller, Sebastian
    Hinterleitner, Florian
    Falk, Tiago H.
    Polzehl, Tim
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1325 - +
  • [10] A text analyzer for Korean text-to-speech systems
    Lee, SH
    Oh, YH
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1692 - 1695