Automatic phonetic segmentation by score predictive model for the corpora of mandarin singing voices

被引:13
|
作者
Lin, Cheng-Yuan [1 ]
Jang, Jyh-Shing Roger [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 300, Taiwan
关键词
automatic phonetic segmentation; boundary refinement; score predictive model (SPM); singing voice synthesis;
D O I
10.1109/TASL.2007.902051
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes the concept of a score predictive model (SPM) that can refine the phoneme boundaries obtained by a hidden Markov model (HMM) and dynamic time warping (DTW) for a Mandarin singing voice corpus. An SPM is constructed by using support vector regression. It predicts the score of a phoneme boundary according to the boundary's 58-dimensional feature vector. The correctly identified boundaries of a singing corpus can then be used for corpus-based singing voice synthesis. Several experiments with different settings, including the use of different initial estimates, different acoustic features, and various regression approaches, were designed to verify the feasibility of the proposed approach. Experimental results demonstrate that the proposed SPM is able to effectively refine the results of the HMM and DTW.
引用
收藏
页码:2151 / 2159
页数:9
相关论文
共 8 条
  • [1] Automatic Phonetic Segmentation by Using a SPM-based Approach for a Mandarin Singing Voice Corpus
    Lin, Cheng-Yuan
    Jang, J-S. Roger
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2294 - 2297
  • [2] AUTOMATIC PHONETIC SEGMENTATION IN MANDARIN CHINESE: BOUNDARY MODELS, GLOTTAL FEATURES AND TONE
    Yuan, Jiahong
    Ryant, Neville
    Liberman, Mark
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Automatic Phonetic Segmentation Using HMM Model in Uyghur Language
    Eli, Gulnar
    Hamdulla, Askar
    [J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 615 - +
  • [4] Hybrid model method for automatic segmentation of mandarin TTS corpus
    Yuan, Xiaoliang
    Dong, Yuan
    Huang, Dezhi
    Guo, Jun
    Wang, Haila
    [J]. INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 906 - 912
  • [5] Automatic phonetic segmentation of Hindi speech using hidden Markov model
    Archana Balyan
    S. S. Agrawal
    Amita Dev
    [J]. AI & SOCIETY, 2012, 27 (4) : 543 - 549
  • [6] Automatic phonetic segmentation of Hindi speech using hidden Markov model
    Balyan, Archana
    Agrawal, S.
    Dev, Amita
    [J]. AI & SOCIETY, 2012, 27 (04) : 543 - 549
  • [7] Detection of Large Segmentation Errors with Score Predictive Model
    Matura, Martin
    Matousek, Jindrich
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 524 - 532
  • [8] THE VANCOUVER OUTPATIENT ILEOSTOMY CLOSURE SUITABILITY (VOICES) SCORE: A PREDICTIVE MODEL TO FACILITATE OUTPATIENT CLOSURE ILEOSTOMY SURGERY.
    Letarte, F.
    Raval, M.
    Karimuddin, A.
    Phang, T.
    Brown, C.
    [J]. DISEASES OF THE COLON & RECTUM, 2016, 59 (05) : E169 - E169