Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech

被引:7
|
作者
Pribil, Jiri [1 ,2 ]
Pribilova, Anna [3 ]
Matousek, Jindrich [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic
[2] SAS, Inst Measurement Sci, Bratislava 84104, Slovakia
[3] Slovak Univ Technol Bratislava, Inst Elect & Photon, Fac Elect Engn & Informat Technol, Bratislava 81219, Slovakia
来源
关键词
Quality of synthetic speech; Text-to-speech system; GMM classification; Statistical analysis; MODELS;
D O I
10.1007/978-3-319-24033-6_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper describes an experiment with using the statistical approach based on the Gaussian mixture models (GMM) for localization of artefacts in the synthetic speech produced by the Czech text-to-speech system employing the unit selection principle. In addition, the paper analyzes influence of different number of used GMM mixtures, and the influence of setting of the frame shift during the spectral feature analysis on the resulting artefact position accuracy. Obtained results of performed experiments confirm proper function of the chosen concept and the presented artefact position localizer can be used as an alternative to the standardly applied manual localization method.
引用
收藏
页码:23 / 31
页数:9
相关论文
共 50 条
  • [1] Artefact Determination by GMM-Based Continuous Detection of Emotional Changes in Synthetic Speech
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. 2019 42ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2019, : 45 - 48
  • [2] Evaluation of Synthetic Speech by GMM-Based Continuous Detection of Emotional States
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 264 - 273
  • [3] Comparing GMM-based speech transformation systems
    Mesbahi, Larbi
    Barreaud, Vincent
    Boeffard, Olivier
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2852 - 2855
  • [4] GMM-BASED ACOUSTIC MODELING FOR EMBEDDED SPEECH RECOGNITION
    Levy, Christophe
    Linares, Georges
    Bonastre, Jean-Francois
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1726 - 1729
  • [5] GMM-based a priori SNR estimation in speech enhancement
    Lei, Jianjun
    Wang, Jian
    Liu, Gang
    Guo, Jun
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4293 - +
  • [6] GMM-based localization algorithm under NLOS conditions
    Cui, Wei
    Wu, Cheng-Dong
    Zhang, Yun-Zhou
    Jia, Zi-Xi
    Cheng, Long
    [J]. Tongxin Xuebao/Journal on Communications, 2014, 35 (01): : 99 - 106
  • [7] GMM-based speaker age and gender classification in Czech and Slovak
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2017, 68 (01): : 3 - 12
  • [8] GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak
    Pribil, Jiri
    Pribilova, Anna
    [J]. COGNITIVE COMPUTATION, 2014, 6 (04) : 928 - 939
  • [9] GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak
    Jiří Přibil
    Anna Přibilová
    [J]. Cognitive Computation, 2014, 6 : 928 - 939
  • [10] A GMM-based telephone channel classification for Mandarin speech recognition
    Xu, W
    Peng, X
    Wang, BX
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 642 - 645