Segment Specific Concatenation Cost for Syllable Based Bengali TTS

被引:0
|
作者
Narendra, N. P. [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol, Sch Informat Technol, Kharagpur 721302, W Bengal, India
来源
CONTEMPORARY COMPUTING | 2011年 / 168卷
关键词
Concatenation cost calculation; unit selection; Bengali TTS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a new method of concatenation cost calculation for enhancing the optimality in unit selection. Instead of defining same set of concatenation costs for all types of speech unit transitions, costs are defined based on the type of unit transitions. Different types of unit transitions that can occur mainly in an utterance are voiced to voiced, voiced to unvoiced and unvoiced to unvoiced transitions. Natural measure of continuity is identified for each of these transitions, and costs are defined accordingly. For voiced to voiced transitions, in addition to spectral continuity, pitch and energy continuity metrics are proposed. In case of voiced to unvoiced and unvoiced to unvoiced transitions, silence duration embedded in the unvoiced region is proposed as the continuity metric. This approach of segment specific concatenation cost calculation improves the quality of syllable based text to speech synthesis. Listening tests provide a proof on the effectiveness of proposed methodology which has clearly shown the decrease in perceptual discontinuity at joins, and improvement in the overall quality of the synthesised speech.
引用
收藏
页码:371 / 382
页数:12
相关论文
共 50 条
  • [1] Concatenation cost calculation and optimisation for unit selection in TTS
    Blouin, C
    Rosec, O
    Bagshaw, PC
    d'Alessandro, C
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 231 - 234
  • [2] Syllable HMM based Mandarin TTS and Comparison with Concatenative TTS
    Shuang, Zhiwei
    Kang, Shiyin
    Shi, Qin
    Qin, Yong
    Cai, Lianhong
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1755 - +
  • [3] Voice synthesis application based on syllable concatenation
    Buza, O.
    Toderean, G. L.
    Domokos, J.
    Bodo, A. Zs.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2008), THETA 16TH EDITION, VOL III, PROCEEDINGS, 2008, : 473 - 478
  • [4] Syllable clustering and spectral discontinuity in syllable-based TTS systems
    Chen, FX
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 688 - 691
  • [5] A TtS system for the Greek language based on concatenation of formant coded segments
    Yiourgalis, N
    Kokkinakis, G
    [J]. SPEECH COMMUNICATION, 1996, 19 (01) : 21 - 38
  • [6] Context Based Speech Analysis of Bengali Language as a Part of TTS Conversion
    Mukherjee, Nabanita
    Mukherjee, Imon
    Bhattacharyya, Debnath
    Kim, Tai-hoon
    [J]. SIGNAL PROCESSING, IMAGE PROCESSING AND PATTERN RECOGNITION, 2011, 260 : 204 - +
  • [7] Improvement of Syllable based TTS System in Assamese using Prosody Modification
    Sharma, Bidisha
    Prasanna, S. R. Mahadeva
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [8] Improvement of syllable based TTS system in assamese using prosody modification
    Department of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, Guwahati
    781039, India
    [J]. IEEE Int. Conf. Electron., Energy, Environ., Commun., Comput., Control: (E3-C3), INDICON, 1600,
  • [9] Development of syllable-based text to speech synthesis system in Bengali
    Narendra, N.
    Rao, K.
    Ghosh, Krishnendu
    Vempada, Ramu
    Maity, Sudhamay
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) : 167 - 181
  • [10] A TDPSOLA Based Concatenation Technique for Bengali Text to Speech Synthesis System Subachan
    Swarna, Kamrunnahar
    Naser, Abu
    [J]. 2016 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2016, : 102 - 105