DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis

被引:8
|
作者
Ali, Ikbel Hadj [1 ]
Mnasri, Zied [1 ]
Lachiri, Zied [1 ]
机构
[1] Univ Tunis El Manar, Elect Engn Dept, Signal Image & Technol Informat Lab, Ecole Natl Ingenieurs Tunis, Tunis, Tunisia
关键词
Arabic text-to-speech synthesis; Deep neural networks (DNN); Grapheme-to-phoneme conversion; Diacritic signs; Gemination; SYNTHESIS SYSTEM; SELECTION;
D O I
10.1007/s10772-020-09750-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Arabic text-to-speech synthesis from non-diacritized text is still a big challenge, because of unique Arabic language rules and characteristics. Indeed, the diacritic and gemination signs, which are special characters representing respectively short vowels and consonant doubling, have a major effect on accurate pronunciation of Arabic. However these signs are often not mentioned in written texts, since most of Arab readers are used to guess them from the context. To tackle this issue, this paper presents a grapheme-to-phoneme conversion system for Arabic, which constitutes the text processing module of a deep neural networks (DNN)-based Arabic TTS systems. In the case of Arabic text, this step starts with predicting the diacritic and gemination signs. In this work, this step was fully realized based on DNN. Finally, the grapheme-to-phoneme conversion of the diacritized text was achieved using the Buckwalter code. In comparison to state-of-the-art approaches, the proposed system gives a higher accuracy rate either for all phonemes or for each class, and high precision, recall and F1 score for each class of diacritic signs.
引用
收藏
页码:569 / 584
页数:16
相关论文
共 50 条
  • [41] Neural Machine Translation for Multilingual Grapheme-to-Phoneme Conversion
    Sokolov, Alex
    Rohlin, Tracy
    Rastrow, Ariya
    [J]. INTERSPEECH 2019, 2019, : 2065 - 2069
  • [42] Joint-sequence models for grapheme-to-phoneme conversion
    Bisani, Maximilian
    Ney, Hermann
    [J]. SPEECH COMMUNICATION, 2008, 50 (05) : 434 - 451
  • [43] A linguistically motivated approach to grapheme-to-phoneme conversion for Korean
    Yoon, Kyuchul
    Brew, Chris
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 357 - 381
  • [44] Automated grapheme-to-phoneme conversion for Central Kurdish based on optimality theory
    Mahmudi, Aso
    Veisi, Hadi
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 70
  • [45] T5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversion
    Rezackova, Marketa
    Tihelka, Daniel
    Matousek, Jindrich
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3466 - 3476
  • [46] NEURAL GRAPHEME-TO-PHONEME CONVERSION WITH PRE-TRAINED GRAPHEME MODELS
    Dong, Lu
    Guo, Zhi-Qiang
    Tan, Chao-Hong
    Hu, Ya-Jun
    Jiang, Yuan
    Ling, Zhen-Hua
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6202 - 6206
  • [47] Grapheme-to-Phoneme Conversion using Conditional Random Fields
    Illina, Irina
    Fohr, Dominique
    Jouvet, Denis
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2324 - 2327
  • [48] PART-OF-SPEECH MODELS COMPRESSION METHODS FOR ON-DEVICE GRAPHEME-TO-PHONEME CONVERSION
    Kubis, Marek
    Meloux, Maxime
    Skorzewski, Pawel
    Lewandowski, Marcin
    Jho, Gunu
    Park, Hyoungmin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7117 - 7121
  • [49] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto Suyanto
    [J]. International Journal of Speech Technology, 2019, 22 : 459 - 470
  • [50] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto, Suyanto
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 459 - 470