Articulatory Movement Prediction Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks and Word/Phone Embeddings

被引:0
|
作者
Zhu, Pengcheng [1 ]
Xie, Lei [1 ,2 ]
Chen, Yunlin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Software & Microelect, Xian, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
关键词
articulatory movement predictions; articulatory inversion; long short term memory (LSTM); word2vec; recurrent neural network (RNN); SPEECH; ACOUSTICS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic prediction of articulatory movements from speech or text can be beneficial for many applications such as speech recognition and synthesis. A recent approach has reported state-of-the-art performance in speech-to-articulatory prediction using feed forward neural networks. In this paper, we investigate the feasibility of using bidirectional long short-term memory based recurrent neural networks (BLSTM-RNNs) in articulatory movement prediction because they have long-context trajectory modeling ability. We show on the MNGU0 dataset that BLSTM-RNN apparently outperforms feed forward networks and pushes the state-of-the-art RMSE from 0.885 mm to 0.565 mm. On the other hand, predicting articulatory information from text heavily relies on handcrafted linguistic and prosodic features, e.g., POS and TOBI labels. In this paper, we propose to use word and phone embeddings to substitute these manual features. Word/phone embedding features are automatically learned from unlabeled text data by a neural network language model. We show that word and phone embeddings can achieve comparable performance without using POS and TOBI features. More promisingly, combining the conventional full feature set with phone embedding, the lowest RMSE is achieved.
引用
收藏
页码:2192 / 2196
页数:5
相关论文
共 50 条
  • [41] Accurate estimation of tidal level using bidirectional long short-term memory recurrent neural network
    Bai, Long-Hu
    Xu, Hang
    [J]. OCEAN ENGINEERING, 2021, 235
  • [42] Short-Term Prediction of Wind Power Based on Deep Long Short-Term Memory
    Qu Xiaoyun
    Kang Xiaoning
    Zhang Chao
    Jiang Shuai
    Ma Xiuda
    [J]. 2016 IEEE PES ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2016, : 1148 - 1152
  • [43] Prediction of Pathological Tremor Signals Using Long Short-Term Memory Neural Networks
    Pascual-Valdunciel, Alejandro
    Lopo-Martinez, Victor
    Sendra-Arranz, Rafael
    Gonzalez-Sanchez, Miguel
    Perez-Sanchez, Javier Ricardo
    Grandas, Francisco
    Torricelli, Diego
    Moreno, Juan C.
    Oliveira Barroso, Filipe
    Pons, Jose L.
    Gutierrez, Alvaro
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (12) : 5930 - 5941
  • [44] Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks
    Zazo, Ruben
    Lozano-Diez, Alicia
    Gonzalez-Dominguez, Javier
    Toledano, Doroteo T.
    Gonzalez-Rodriguez, Joaquin
    [J]. PLOS ONE, 2016, 11 (01):
  • [45] Chaotic time series prediction based on long short-term memory neural networks
    Xiong YouCheng
    Zhao Hong
    [J]. SCIENTIA SINICA-PHYSICA MECHANICA & ASTRONOMICA, 2019, 49 (12)
  • [46] Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks
    Kim, Joo-Chang
    Chung, Kyungyong
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (04) : 2060 - 2077
  • [47] Ego-Vehicle Speed Prediction Using a Long Short-Term Memory Based Recurrent Neural Network
    Yeon, Kyuhwan
    Min, Kyunghan
    Shin, Jaewook
    Sunwoo, Myoungho
    Han, Manbae
    [J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2019, 20 (04) : 713 - 722
  • [48] Ego-Vehicle Speed Prediction Using a Long Short-Term Memory Based Recurrent Neural Network
    Kyuhwan Yeon
    Kyunghan Min
    Jaewook Shin
    Myoungho Sunwoo
    Manbae Han
    [J]. International Journal of Automotive Technology, 2019, 20 : 713 - 722
  • [49] Performance prediction of fuel cells using long short-term memory recurrent neural network
    Zheng, Lu
    Hou, Yongping
    Zhang, Tao
    Pan, Xiangmin
    [J]. INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2021, 45 (06) : 9141 - 9161
  • [50] Resource Usage Prediction of Cloud Workloads using Deep Bidirectional Long Short Term Memory Networks
    Gupta, Shaifu
    Dinesh, Dileep Aroor
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (ANTS), 2017,