Articulatory Movement Prediction Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks and Word/Phone Embeddings

被引:0
|
作者
Zhu, Pengcheng [1 ]
Xie, Lei [1 ,2 ]
Chen, Yunlin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Software & Microelect, Xian, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
关键词
articulatory movement predictions; articulatory inversion; long short term memory (LSTM); word2vec; recurrent neural network (RNN); SPEECH; ACOUSTICS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic prediction of articulatory movements from speech or text can be beneficial for many applications such as speech recognition and synthesis. A recent approach has reported state-of-the-art performance in speech-to-articulatory prediction using feed forward neural networks. In this paper, we investigate the feasibility of using bidirectional long short-term memory based recurrent neural networks (BLSTM-RNNs) in articulatory movement prediction because they have long-context trajectory modeling ability. We show on the MNGU0 dataset that BLSTM-RNN apparently outperforms feed forward networks and pushes the state-of-the-art RMSE from 0.885 mm to 0.565 mm. On the other hand, predicting articulatory information from text heavily relies on handcrafted linguistic and prosodic features, e.g., POS and TOBI labels. In this paper, we propose to use word and phone embeddings to substitute these manual features. Word/phone embedding features are automatically learned from unlabeled text data by a neural network language model. We show that word and phone embeddings can achieve comparable performance without using POS and TOBI features. More promisingly, combining the conventional full feature set with phone embedding, the lowest RMSE is achieved.
引用
收藏
页码:2192 / 2196
页数:5
相关论文
共 50 条
  • [1] Twitter Bot Detection Using Bidirectional Long Short-term Memory Neural Networks and Word Embeddings
    Wei, Feng
    Uyen Trang Nguyen
    [J]. 2019 FIRST IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2019), 2019, : 101 - 109
  • [2] VOICE CONVERSION USING DEEP BIDIRECTIONAL LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS
    Sun, Lifa
    Kang, Shiyin
    Li, Kun
    Meng, Helen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4869 - 4873
  • [3] Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks
    Hanson, Jack
    Yang, Yuedong
    Paliwal, Kuldip
    Zhou, Yaoqi
    [J]. BIOINFORMATICS, 2017, 33 (05) : 685 - 692
  • [4] Multimodal Dimensional Affect Recognition Using Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks
    Pei, Ercheng
    Yang, Le
    Jiang, Dongmei
    Sahli, Hichem
    [J]. 2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 208 - 214
  • [5] A Novel Word Spotting Algorithm Using Bidirectional Long Short-Term Memory Neural Networks
    Frinken, Volkmar
    Fischer, Andreas
    Bunke, Horst
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2010, 5998 : 185 - 196
  • [6] BIDIRECTIONAL QUATERNION LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS FOR SPEECH RECOGNITION
    Parcollet, Titouan
    Morchid, Mohamed
    Linares, Georges
    De Mori, Renato
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8519 - 8523
  • [7] Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks
    Abbas, Zainab
    Al-Shishtawy, Ahmad
    Girdzijauskas, Sarunas
    Vlassov, Vladimir
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 57 - 65
  • [8] Long short-term memory-based deep recurrent neural networks for target tracking
    Gao, Chang
    Yan, Junkun
    Zhou, Shenghua
    Varshney, Pramod K.
    Liu, Hongwei
    [J]. INFORMATION SCIENCES, 2019, 502 : 279 - 296
  • [9] Session Based Recommendations Using Recurrent Neural Networks - Long Short-Term Memory
    Dobrovolny, Michal
    Selamat, Ali
    Krejcar, Ondrej
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 53 - 65
  • [10] Comparing Long Short-Term Memory (LSTM) and bidirectional LSTM deep neural networks for power consumption prediction
    da Silva, Davi Guimaraes
    Meneses, Anderson Alvarenga de Moura
    [J]. ENERGY REPORTS, 2023, 10 : 3315 - 3334