Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks

被引:11
|
作者
van Esch, Daan [1 ]
Chua, Mason [1 ]
Rao, Kanishka [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
LSTM; pronunciation; syllabification; stress;
D O I
10.21437/Interspeech.2016-1419
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Word pronunciations, consisting of phoneme sequences and the associated syllabification and stress patterns, are vital for both speech recognition and text-to-speech (TTS) systems. For speech recognition phoneme sequences for words may be learned from audio data. We train recurrent neural network (RNN) based models to predict the syllabification and stress pattern for such pronunciations making them usable for TTS. We find these RNN models significantly outperform naive rule based models for almost all languages we tested. Further, we find additional improvements to the stress prediction model by using the spelling as features in addition to the phoneme sequence. Finally, we train a single RNN model to predict the phoneme sequence, syllabification and stress for a given word. For several languages, this single RNN outperforms similar models trained specifically for either phoneme sequence or stress prediction. We report an exhaustive comparison of these approaches for twenty languages.
引用
收藏
页码:2841 / 2845
页数:5
相关论文
共 50 条
  • [21] Predicting Subjective Sleep Quality Using Recurrent Neural Networks
    Boussard, Julien
    Kochenderfer, Mykel J.
    Zeitzer, Jamie M.
    2019 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM (SPMB), 2019,
  • [22] Recurrent Neural Networks based on LSTM for Predicting Geomagnetic Field
    Liu, Tong
    Wu, Tailin
    Wang, Meiling
    Fu, Mengyin
    Kang, Jiapeng
    Zhang, Haoyuan
    PROCEEDINGSS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON AEROSPACE ELECTRONICS AND REMOTE SENSING TECHNOLOGY (ICARES 2018), 2018,
  • [23] Predicting Drug-Target Affinity Based on Recurrent Neural Networks and Graph Convolutional Neural Networks
    Tian, Qingyu
    Ding, Mao
    Yang, Hui
    Yue, Caibin
    Zhong, Yue
    Du, Zhenzhen
    Liu, Dayan
    Liu, Jiali
    Deng, Yufeng
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2022, 25 (04) : 634 - 641
  • [24] Neural ordinary differential equations and recurrent neural networks for predicting the state of health of batteries
    Pepe, Simona
    Liu, Jiapeng
    Quattrocchi, Emanuele
    Ciucci, Francesco
    JOURNAL OF ENERGY STORAGE, 2022, 50
  • [25] Predicting human decision making in psychological tasks with recurrent neural networks
    Lin, Baihan
    Bouneffouf, Djallel
    Cecchi, Guillermo
    PLOS ONE, 2022, 17 (05):
  • [26] Predicting Substance Misuse Admission Rates via Recurrent Neural Networks
    Howard, Matthew J.
    Agrawal, Rakshit
    2019 IEEE GLOBAL HUMANITARIAN TECHNOLOGY CONFERENCE (GHTC), 2019, : 691 - 698
  • [27] Quantum Recurrent Neural Networks: Predicting the Dynamics of Oscillatory and Chaotic Systems
    Chen, Yuan
    Khaliq, Abdul
    ALGORITHMS, 2024, 17 (04)
  • [28] Analyzing and Predicting the US Midterm Elections on Twitter with Recurrent Neural Networks
    Lopardo, Antonio
    Brambilla, Marco
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5389 - 5391
  • [29] Predicting Popularity of Open Source Projects Using Recurrent Neural Networks
    Sahin, Sefa Eren
    Karpat, Kubilay
    Tosun, Ayse
    OPEN SOURCE SYSTEMS, OSS 2019, 2019, 556 : 80 - 90
  • [30] PRNN: Piecewise Recurrent Neural Networks for Predicting the Tendency of Services Invocation
    Lin, Haozhe
    Fan, Yushun
    Zhang, Jia
    Bai, Bing
    2018 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2018), 2018, : 42 - 49