Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech

被引：7

作者：

Pribil, Jiri ^{[1
,2
]}

Pribilova, Anna ^{[3
]}

Matousek, Jindrich ^{[1
]}

机构：

[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Plzen 30614, Czech Republic

[2] SAS, Inst Measurement Sci, Bratislava 84104, Slovakia

[3] Slovak Univ Technol Bratislava, Inst Elect & Photon, Fac Elect Engn & Informat Technol, Bratislava 81219, Slovakia

来源：

TEXT, SPEECH, AND DIALOGUE (TSD 2015) | 2015年 / 9302卷

关键词：

Quality of synthetic speech; Text-to-speech system; GMM classification; Statistical analysis; MODELS;

D O I：

10.1007/978-3-319-24033-6_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper describes an experiment with using the statistical approach based on the Gaussian mixture models (GMM) for localization of artefacts in the synthetic speech produced by the Czech text-to-speech system employing the unit selection principle. In addition, the paper analyzes influence of different number of used GMM mixtures, and the influence of setting of the frame shift during the spectral feature analysis on the resulting artefact position accuracy. Obtained results of performed experiments confirm proper function of the chosen concept and the presented artefact position localizer can be used as an alternative to the standardly applied manual localization method.

引用

页码：23 / 31

页数：9

共 50 条

[1] Artefact Determination by GMM-Based Continuous Detection of Emotional Changes in Synthetic Speech
Pribil, Jiri
Pribilova, Anna
Matousek, Jindrich
[J]. 2019 42ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2019, : 45 - 48
[2] Evaluation of Synthetic Speech by GMM-Based Continuous Detection of Emotional States
Pribil, Jiri
Pribilova, Anna
Matousek, Jindrich
[J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 264 - 273
[3] Comparing GMM-based speech transformation systems
Mesbahi, Larbi
Barreaud, Vincent
Boeffard, Olivier
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2852 - 2855
[4] GMM-BASED ACOUSTIC MODELING FOR EMBEDDED SPEECH RECOGNITION
Levy, Christophe
Linares, Georges
Bonastre, Jean-Francois
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1726 - 1729
[5] GMM-based a priori SNR estimation in speech enhancement
Lei, Jianjun
Wang, Jian
Liu, Gang
Guo, Jun
[J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4293 - +
[6] GMM-based localization algorithm under NLOS conditions
Cui, Wei
Wu, Cheng-Dong
Zhang, Yun-Zhou
Jia, Zi-Xi
Cheng, Long
[J]. Tongxin Xuebao/Journal on Communications, 2014, 35 (01): : 99 - 106
[7] GMM-based speaker age and gender classification in Czech and Slovak
Pribil, Jiri
Pribilova, Anna
Matousek, Jindrich
[J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2017, 68 (01): : 3 - 12
[8] GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak
Pribil, Jiri
Pribilova, Anna
[J]. COGNITIVE COMPUTATION, 2014, 6 (04) : 928 - 939
[9] GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak
Jiří Přibil
Anna Přibilová
[J]. Cognitive Computation, 2014, 6 : 928 - 939
[10] A GMM-based telephone channel classification for Mandarin speech recognition
Xu, W
Peng, X
Wang, BX
[J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 642 - 645

← 1 2 3 4 5 →