Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks

被引:11
|
作者
van Esch, Daan [1 ]
Chua, Mason [1 ]
Rao, Kanishka [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
LSTM; pronunciation; syllabification; stress;
D O I
10.21437/Interspeech.2016-1419
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Word pronunciations, consisting of phoneme sequences and the associated syllabification and stress patterns, are vital for both speech recognition and text-to-speech (TTS) systems. For speech recognition phoneme sequences for words may be learned from audio data. We train recurrent neural network (RNN) based models to predict the syllabification and stress pattern for such pronunciations making them usable for TTS. We find these RNN models significantly outperform naive rule based models for almost all languages we tested. Further, we find additional improvements to the stress prediction model by using the spelling as features in addition to the phoneme sequence. Finally, we train a single RNN model to predict the phoneme sequence, syllabification and stress for a given word. For several languages, this single RNN outperforms similar models trained specifically for either phoneme sequence or stress prediction. We report an exhaustive comparison of these approaches for twenty languages.
引用
收藏
页码:2841 / 2845
页数:5
相关论文
共 50 条
  • [11] Predicting Commentaries on a Financial Report with Recurrent Neural Networks
    El Mokhtari, Karim
    Maidens, John
    Bener, Ayse
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 531 - 542
  • [12] STRESS AND SYLLABIFICATION
    MCCARTHY, JJ
    LINGUISTIC INQUIRY, 1979, 10 (03) : 443 - 465
  • [13] Predicting Outcomes of the Court of Cassation of Turkey with Recurrent Neural Networks
    Ozturk, Ceyhun E.
    Ozcelik, S. Bari
    Koc, Aykut
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [14] Online Predicting Conformance of Business Process with Recurrent Neural Networks
    Wang, Jiaojiao
    Yu, Dingguo
    Ma, Xiaoyu
    Liu, Chang
    Chang, Victor
    Shen, Xuewen
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 88 - 100
  • [15] Predicting Activities in Business Processes with LSTM Recurrent Neural Networks
    Tello-Leal, Edgar
    Roa, Jorge
    Rubiolo, Mariano
    Ramirez-Alcocer, Ulises M.
    2018 ITU KALEIDOSCOPE: MACHINE LEARNING FOR A 5G FUTURE (ITU K), 2018,
  • [16] Predicting ALICE Grid throughput using recurrent neural networks
    Popa, Mircea
    Grigoras, Costin
    Vallecorsa, Sofia
    20TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2023, 2438
  • [17] Predicting bike sharing demand using recurrent neural networks
    Pan, Yan
    Zheng, Ray Chen
    Zhang, Jiaxi
    Yao, Xin
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 562 - 566
  • [18] Predicting Stock Market Trends by Recurrent Deep Neural Networks
    Yoshihara, Akira
    Fujikawa, Kazuki
    Seki, Kazuhiro
    Uehara, Kuniaki
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 759 - 769
  • [19] Predicting Temporal Activation Patterns via Recurrent Neural Networks
    Manco, Giuseppe
    Pirro, Giuseppe
    Ritacco, Ettore
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 347 - 356
  • [20] Predicting chaotic time series by boosted recurrent neural networks
    Assaad, Mohammad
    Bone, Romuald
    Cardot, Hubert
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 831 - 840