DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis

被引:8
|
作者
Ali, Ikbel Hadj [1 ]
Mnasri, Zied [1 ]
Lachiri, Zied [1 ]
机构
[1] Univ Tunis El Manar, Elect Engn Dept, Signal Image & Technol Informat Lab, Ecole Natl Ingenieurs Tunis, Tunis, Tunisia
关键词
Arabic text-to-speech synthesis; Deep neural networks (DNN); Grapheme-to-phoneme conversion; Diacritic signs; Gemination; SYNTHESIS SYSTEM; SELECTION;
D O I
10.1007/s10772-020-09750-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Arabic text-to-speech synthesis from non-diacritized text is still a big challenge, because of unique Arabic language rules and characteristics. Indeed, the diacritic and gemination signs, which are special characters representing respectively short vowels and consonant doubling, have a major effect on accurate pronunciation of Arabic. However these signs are often not mentioned in written texts, since most of Arab readers are used to guess them from the context. To tackle this issue, this paper presents a grapheme-to-phoneme conversion system for Arabic, which constitutes the text processing module of a deep neural networks (DNN)-based Arabic TTS systems. In the case of Arabic text, this step starts with predicting the diacritic and gemination signs. In this work, this step was fully realized based on DNN. Finally, the grapheme-to-phoneme conversion of the diacritized text was achieved using the Buckwalter code. In comparison to state-of-the-art approaches, the proposed system gives a higher accuracy rate either for all phonemes or for each class, and high precision, recall and F1 score for each class of diacritic signs.
引用
收藏
页码:569 / 584
页数:16
相关论文
共 50 条
  • [21] An encoder-decoder based grapheme-to-phoneme converter for Bangla speech synthesis
    Ahmad, Arif
    Selim, Mohammad Reza
    Iqbal, Muhammed Zafar
    Rahman, Mohammad Shahidur
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2019, 40 (06) : 374 - 381
  • [22] Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
    Prabhu, Nikhil
    Kann, Katharina
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 123 - 127
  • [23] Grapheme-to-Phoneme Conversion with Convolutional Neural Networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [24] Grapheme-to-phoneme conversion in Chinese TTS system
    Dong, HH
    Tao, JH
    Xu, B
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 165 - 168
  • [25] Label Embedding for Chinese Grapheme-to-Phoneme Conversion
    Choi, Eunbi
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. INTERSPEECH 2021, 2021, : 4094 - 4098
  • [26] Pre-Training of DNN-Based Speech Synthesis Based on Bidirectional Conversion between Text and Speech
    Sone, Kentaro
    Nakashika, Toru
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (08) : 1546 - 1553
  • [27] Learning from Errors in Grapheme-to-Phoneme Conversion
    Polyakova, Tatyana
    Bonafonte, Antonio
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2442 - 2445
  • [28] NARROWADAPTIVE REGULARIZATION OF WEIGHTS FOR GRAPHEME-TO-PHONEME CONVERSION
    Kubo, Keigo
    Sakti, Sakriani
    Neubig, Graham
    Toda, Tomoki
    Nakamura, Satoshi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [29] Online Discriminative Training for Grapheme-to-Phoneme Conversion
    Jiampojamarn, Sittichai
    Kondrak, Grzegorz
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1307 - 1310
  • [30] Grapheme-to-phoneme Conversion based on Adaptive Regularization of Weight Vectors
    Kubo, Keigo
    Sakti, Sakriani
    Neubig, Graham
    Toda, Tomoki
    Nakamura, Satoshi
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1945 - 1949