An Analysis of the RNN-Based Spoken Term Detection Training

被引:0
|
作者
Svec, Jan [1 ]
Smidl, Lubos [2 ]
Psutka, Josef V. [2 ]
机构
[1] SpeechTech Sro, Plzen, Czech Republic
[2] Univ West Bohemia, Dept Cybernet, Plzen, Czech Republic
来源
关键词
Spoken term detection; Recurrent neural networks; Siamese neural networks;
D O I
10.1007/978-3-319-66429-3_11
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper studies the training process of the recurrent neural networks used in the spoken term detection (STD) task. The method used in the paper employ two jointly trained Siamese networks using unsupervised data. The grapheme representation of a searched term and the phoneme realization of a putative hit are projected into the pronunciation embedding space using such networks. The score is estimated as relative distance of these embeddings. The paper studies the influence of different loss functions, amount of unsupervised data and the meta-parameters on the performance of the STD system.
引用
收藏
页码:119 / 129
页数:11
相关论文
共 50 条
  • [1] Minimum Word Error Training of RNN-based Voice Activity Detection
    Gelly, Gregory
    Gauvain, Jean-Luc
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2650 - 2654
  • [2] RNN-based labeled data generation for spoken language understanding
    Tam, Yik-Cheung
    Shi, Yangyang
    Chen, Hunk
    Hwang, Mei-Yuh
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 125 - 129
  • [3] RNN-Based Detection of Fault Attacks on RSA
    Koylu, Troya Cagil
    Reinbrecht, Cezar Rodolfo Wedig
    Hamdioui, Said
    Taouil, Mottaqiallah
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [4] Optimization of RNN-Based Speech Activity Detection
    Gelly, Gregory
    Gauvain, Jean-Luc
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) : 646 - 656
  • [5] RNN-based Detection for Coded Partial-Response Channels
    Zheng, Simeng
    Liu, Yi
    Siegel, Paul H.
    [J]. 2020 IEEE INFORMATION THEORY WORKSHOP (ITW), 2021,
  • [6] An RNN-Based IMM Filter Surrogate
    Becker, Stefan
    Hug, Ronny
    Huebner, Wolfgang
    Arens, Michael
    [J]. IMAGE ANALYSIS, 2019, 11482 : 387 - 398
  • [7] RNN-based longitudinal analysis for diagnosis of Alzheimer's disease
    Cui, Ruoxuan
    Liu, Manhua
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2019, 73 : 1 - 10
  • [8] RNN-based Anomaly Detection in DNP3 Transport Layer
    Kwon, Sungmoon
    Yoo, Hyunguk
    Shon, Taeshik
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CONTROL, AND COMPUTING TECHNOLOGIES FOR SMART GRIDS (SMARTGRIDCOMM), 2019,
  • [9] An Approach for Poisoning Attacks against RNN-Based Cyber Anomaly Detection
    Xu, Jinghui
    Wen, Yu
    Yang, Chun
    Meng, Dan
    [J]. 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1680 - 1687
  • [10] A relevance score estimation for spoken term detection based on RNN-generated pronunciation embeddings
    Svec, Jan
    Psutka, Josef V.
    Smidl, Lubos
    Trmal, Jan
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2934 - 2938