ACOUSTIC MODELING WITH NEURAL GRAPH EMBEDDINGS

被引:0
|
作者
Liu, Yuzong [1 ]
Kirchhoff, Katrin [1 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
关键词
Acoustic modeling; deep neural networks; graph-based learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph-based learning (GBL) is a form of semi-supervised learning that has been successfully exploited in acoustic modeling in the past. It utilizes manifold information in speech data that is represented as a joint similarity graph over training and test samples. Typically, GBL is used at the output level of an acoustic classifier; however, this setup is difficult to scale to large data sets, and the graph-based learner is not optimized jointly with other components of the speech recognition system. In this paper we explore a different approach where the similarity graph is first embedded into continuous space using a neural autoencoder. Features derived from this encoding are then used at the input level to a standard DNN-based speech recognizer. We demonstrate improved scalability and performance compared to the standard GBL approach as well as significant improvements in word error rate on a medium-vocabulary Switchboard task.
引用
收藏
页码:581 / 588
页数:8
相关论文
共 50 条
  • [1] Heterogeneous graph neural networks with denoising for graph embeddings
    Dong, Xinrui
    Zhang, Yijia
    Pang, Kuo
    Chen, Fei
    Lu, Mingyu
    KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [2] Novel Front-End Features Based on Neural Graph Embeddings for DNN-HMM and LSTM-CTC Acoustic Modeling
    Liu, Yuzong
    Kirchhoff, Katrin
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 793 - 797
  • [3] Improving Neural Entity Disambiguation with Graph Embeddings
    Sevgili, Oezge
    Panchenko, Alexander
    Biemann, Chris
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 315 - 322
  • [4] Incorporating Knowledge Graph Embeddings into Topic Modeling
    Yao, Liang
    Zhang, Yin
    Wei, Baogang
    Jin, Zhe
    Zhang, Rui
    Zhang, Yangyang
    Chen, Qinfei
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3119 - 3126
  • [5] EXPLORING THE USE OF ACOUSTIC EMBEDDINGS IN NEURAL MACHINE TRANSLATION
    Deena, Salil
    Ng, Raymond W. M.
    Madhyastha, Pranava
    Specia, Lucia
    Hain, Thomas
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 450 - 457
  • [6] Modeling Order in Neural Word Embeddings at Scale
    Trask, Andrew
    Gilmore, David
    Russell, Matthew
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2266 - 2275
  • [7] GROVE: Ownership Verification of Graph Neural Networks using Embeddings
    Waheed, Asim
    Duddu, Vasisht
    Asokan, N.
    45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 2460 - 2477
  • [8] Smooth Variational Graph Embeddings for Efficient Neural Architecture Search
    Lukasik, Jovita
    Friede, David
    Zela, Arber
    Hutter, Frank
    Keuper, Margret
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Learning semantic program embeddings with graph interval neural network
    Wang, Yu
    Wang, Ke
    Gao, Fengjuan
    Wang, Linzhang
    1600, Association for Computing Machinery (04):
  • [10] Learning Semantic Program Embeddings with Graph Interval Neural Network
    Wang, Yu
    Wang, Ke
    Gao, Fengjuan
    Wang, Linzhang
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2020, 4