Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System

被引:0
|
作者
Liu, Shanfeng [1 ]
Wen, Zhengqi [1 ]
Li, Ya [1 ]
Tao, Jianghua [1 ]
Liu, Bin [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China
关键词
concatenation speech synthesis; hierarchical pre-selection; weight prediction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to generate natural-sounding synthesized speech has been challenging all the researchers in speech synthesis area. Experiments show that speech concatenated by units selected from large speech corpus has a better performance. However how to limit the searching space and predict weights when calculating target cost is an important problem. This paper presents a detailed hierarchical pre-selection method to limit the searching of space. After three layers of pre-selection, a set of units are selected as the candidate units. In order to ensure the continuity in the duration, the prediction model is used in the hierarchical pre-selection. Meanwhile, M5P algorithm which is combined with decision tree and regression is presented in this paper to predict weights needed in target cost calculation. Experimental result shows that these two approaches can generate high quality speech.
引用
收藏
页码:506 / 510
页数:5
相关论文
共 50 条
  • [31] A NOVEL UNIT SELECTION METHOD FOR CONCATENATION SPEECH SYSTEM USING SIMILARITY MEASURE
    Zhang, Ran
    Tao, Jianhua
    Li, Ya
    Wen, Zhengqi
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [32] A novel word pre-selection method based on phonetic set indexing
    Sarukkai, RR
    Ballard, DH
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 857 - 860
  • [33] A Neural Network Based Pre-Selection of Big Data in Photon Science
    Becker, Daniel
    Streit, Achim
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 71 - 76
  • [34] Prominence-Based Prosody Prediction for Unit Selection Speech Synthesis
    Windmann, Andreas
    Jauk, Igor
    Tamburini, Fabio
    Wagner, Petra
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 332 - +
  • [35] HIGH-QUALITY SPEECH SYNTHESIS SYSTEM BASED ON WAVE-FORM CONCATENATION OF PHONEME SEGMENT
    HIROKAWA, T
    ITOH, K
    SATO, H
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (11) : 1964 - 1970
  • [36] Portfolio optimization based on the pre-selection of stocks by the Support Vector Machine model
    Silva, Natan Felipe
    de Andrade, Lelis Pedro
    da Silva, Washington Santos
    de Melo, Maisa Kely
    Tonelli, Adriano Olimpio
    [J]. FINANCE RESEARCH LETTERS, 2024, 61
  • [37] Pre-selection based reduced complexity MLMUD for DS-CDMA systems
    Al-Susa, EA
    Cruickshank, DGM
    [J]. IEEE VTC 53RD VEHICULAR TECHNOLOGY CONFERENCE, SPRING 2001, VOLS 1-4, PROCEEDINGS, 2001, : 491 - 495
  • [38] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    Chappell, DT
    Hansen, JHL
    [J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374
  • [39] Relevant vector machine based on gene pre-selection for cancer microarray expression classification
    Qiu Langbo
    Wang Zhengzhi
    Wang Guangyun
    [J]. ISTM/2007: 7TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-7, CONFERENCE PROCEEDINGS, 2007, : 2337 - 2340
  • [40] A Surrogate Based Multiobjective Evolution Strategy with Different Models for Local Search and Pre-Selection
    Pilat, Martin
    Neruda, Roman
    [J]. 2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 215 - 222