A Flexible Architecture for Urdu Phonemes-Based Concatenative Speech Synthesis

被引:0
|
作者
Ahmad, Muhammad Rizwan [1 ]
Arshad, Muhammad Junaid [1 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci, Lahore, Pakistan
关键词
Articulatory; Text-to-Speech; Formant; Concatenative; Natural Language Processing; Waveforms; Speech Units; Phonemes; Speech Synthesis;
D O I
10.22581/muet1982.1603.07
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
TTS (Text-to-Speech) synthesis systems are extensively used across the world to intensify the accessibility of information and to make it possible for the handicapped to be involved directly with computers to get the benefits from this high technology revolution. Various TTS synthesis techniques have been used with their own advantages and limitations. There is not a concatenative synthesis strategy based architecture for Urdu TTS synthesis system for handling the homographs and to avoid the unnatural robot sounding speech produced due the use of di-phones. In this paper, we propose a flexible architecture for Urdu TTS synthesis system that uses concatenative synthesis strategy because this approach has the ability to join together the small corpus of speech to generate natural and intelligible sound. The main aspiration of this research is to disambiguate the homographs in the Urdu language and to avoid the unnatural robot sounding speech. Finally, the effectiveness of the system is tested in terms of intelligibility and acceptability on word and sentence level. The intelligibility rate is near to 80% and 65% while acceptability rate for the naturalness is 95% (75% natural, 20% acceptable).
引用
收藏
页码:373 / 380
页数:8
相关论文
共 50 条
  • [1] A Concatenative Synthesis Based Speech Synthesiser for Hindi
    Gupta, Kshitij
    ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
  • [2] Syllable Based Concatenative Synthesis for Text to Speech Conversion
    Ananthi, S.
    Dhanalakshmi, P.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 3, 2015, 33
  • [3] VLSI architecture design for concatenative speech synthesizer
    Chu, Li-Ping
    Wang, Jia-Ching
    Wang, Jhing-Fa
    TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 723 - +
  • [4] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
    Deprez, Filip
    Odijk, Jan
    De Moortel, Jan
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360
  • [5] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
    OLIVE, J
    LIBERMAN, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
  • [6] Syllable-Based Concatenative Speech Synthesis for Marathi Language
    Ghate, Pravin M.
    Shirbahadurkar, Suresh D.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 615 - 624
  • [7] On the detection of discontinuities in concatenative speech synthesis
    Pantazis, Yannis
    Stylianou, Yannis
    PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 89 - +
  • [8] Diphone-based concatenative speech synthesis system for Mongolian
    Davaatsagaan, Munkhtuya
    Paliwal, Kuldip K.
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 276 - 279
  • [9] Triphone based unit selection for concatenative visual speech synthesis
    Huang, FJ
    Cosatto, E
    Graf, HP
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2037 - 2040
  • [10] LSM-based unit pruning for concatenative speech synthesis
    Bellegarda, Jerome R.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 521 - 524