Exploiting foreign resources for DNN-based ASR

被引:9
|
作者
Motlicek, Petr [1 ]
Imseng, David [1 ]
Potard, Blaise [1 ]
Garner, Philip N. [1 ]
Himawan, Ivan [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
Automatic speech recognition; Deep learning for speech; Acoustic model adaptation; Semi-supervised training; SPEECH; ALGORITHM; FEATURES;
D O I
10.1186/s13636-015-0058-5
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Manual transcription of audio databases for the development of automatic speech recognition (ASR) systems is a costly and time-consuming process. In the context of deriving acoustic models adapted to a specific application, or in low-resource scenarios, it is therefore essential to explore alternatives capable of improving speech recognition results. In this paper, we investigate the relevance of foreign data characteristics, in particular domain and language, when using this data as an auxiliary data source for training ASR acoustic models based on deep neural networks (DNNs). The acoustic models are evaluated on a challenging bilingual database within the scope of the MediaParl project. Experimental results suggest that in-language (but out-of-domain) data is more beneficial than in-domain (but out-of-language) data when employed in either supervised or semi-supervised training of DNNs. The best performing ASR system, an HMM/GMM acoustic model that exploits DNN as a discriminatively trained feature extractor outperforms the best performing HMM/DNN hybrid by about 5 % relative (in terms of WER). An accumulated relative gain with respect to the MFCC-HMM/GMM baseline is about 30 % WER.
引用
下载
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [21] Attacking DNN-based Intrusion Detection Models
    Zhang, Xingwei
    Zheng, Xiaolong
    Wu, Desheng Dash
    IFAC PAPERSONLINE, 2020, 53 (05): : 415 - 419
  • [22] Integration of DNN based Speech Enhancement and ASR
    Astudillo, Ramon F.
    Correia, Joana
    Trancoso, Isabel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3576 - 3580
  • [23] Threats of Adversarial Attacks in DNN-Based Modulation Recognition
    Lin, Yun
    Zhao, Haojun
    Tu, Ya
    Mao, Shiwen
    Dou, Zheng
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 2469 - 2478
  • [24] A DNN-based Post Filter for Geometric Source Separation
    Chen, Chenghao
    Zhou, Yi
    Liu, Hongqing
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [25] Unsupervised Domain Adaptation for DNN-based Automated Harvesting
    Shkanaev, Aleksandr Yu
    Sholomov, Dmitry L.
    Nikolaev, Dmitry P.
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [26] A DNN-based emotional speech synthesis by speaker adaptation
    Yang, Hongwu
    Zhang, Weizhao
    Zhi, Pengpeng
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 633 - 637
  • [27] A DNN-based semantic segmentation for detecting weed and crop
    You, Jie
    Liu, Wei
    Lee, Joonwhoan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 178
  • [28] DNN-based Indoor Fingerprinting Localization with WiFi FTM
    Eberechukwu, Paulson
    Park, Hyunwoo
    Laoudias, Christos
    Horsmanheimo, Seppo
    Kim, Sunwoo
    2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 367 - 371
  • [29] Analyzing Decision Polygons of DNN-based Classification Methods
    Kim, Jongyoung
    Woo, Seongyoun
    Lee, Wonjun
    Kim, Donghwan
    Lee, Chulhee
    ICINCO: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, 2020, : 346 - 351
  • [30] DNN-Based PolSAR Image Classification on Noisy Labels
    Ni, Jun
    Xiang, Deliang
    Lin, Zhiyuan
    Lopez-Martinez, Carlos
    Hu, Wei
    Zhang, Fan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 3697 - 3713