Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds

被引:0
|
作者
Kohler, J
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The aim of this work is to exploit the acoustic-phonetic similarities between several languages. In recent work cross-language HMM-based phoneme models have been used only for bootstrapping the language-dependent models and the multi-lingual approach has been investigated only on very small speech corpora. In this paper, we introduce a statistical distance measure to determine the similarities of sounds. Further, we present a new technique to model multi-lingual phonemes. The experiments are conducted with the OGI Multi-Language Telephone Speech Corpus for the languages American English, German and Spanish. In the first experiment phoneme recognition rates between 39.0% and 53.9% are achieved using language-dependent models. Using cross-language models yields for some phonemes improvement, but in average a degradation of recognition performance is observed. However, cross-language models speeds up the cross-language transfer and reduces the size of the phoneme inventory of multi-lingual speech recognition systems. Finally, a new method of modelling multi-lingual phonemes, which can be used for a variety of language, is presented. This technique reduces the number of phoneme-based units in a multi-lingual speech recognition system.
引用
收藏
页码:2195 / 2198
页数:4
相关论文
共 50 条
  • [31] Multi-lingual City Name Recognition for Indian Postal Automation
    Pal, Umapada
    Roy, Ramit Kumar
    Kimura, Fumitaka
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 169 - 173
  • [32] Multi-lingual character recognition using Artificial Neural Networks
    Meiyappan, SS
    Sridharan, S
    Ososanya, ET
    [J]. PROCEEDINGS OF THE IEEE SOUTHEASTCON '96: BRINGING TOGETHER EDUCATION, SCIENCE AND TECHNOLOGY, 1996, : 417 - 420
  • [33] ACOUSTIC-PHONETIC CONTEXT CONSIDERATIONS FOR SPEECH RECOGNITION TESTING OF HEARING-IMPAIRED LISTENERS
    REVOILE, S
    KOZMASPYTEK, L
    HOLDENPITT, L
    PICKETT, J
    DROGE, J
    [J]. EAR AND HEARING, 1995, 16 (03): : 254 - 262
  • [34] DELAYED ACTIVATION IN AUDITORY WORD RECOGNITION - ACOUSTIC PHONETIC SIMILARITY AND PHONEME CLASS
    CONNINE, CM
    BLASKO, DG
    TITONE, D
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1992, 30 (06) : 451 - 451
  • [35] Speech emotion recognition based on multi-feature and multi-lingual fusion
    Wang, Chunyi
    Ren, Ying
    Zhang, Na
    Cui, Fuwei
    Luo, Shiying
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 4897 - 4907
  • [36] DNN ACOUSTIC MODELING WITH MODULAR MULTI-LINGUAL FEATURE EXTRACTION NETWORKS
    Gehring, Jonas
    Quoc Bao Nguyen
    Metze, Florian
    Waibel, Alex
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 344 - 349
  • [37] Analysis of Multi-Lingual Emotion Recognition Using Auditory Attention Features
    Kalinli, Ozlem
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3613 - 3617
  • [38] SEQUENCE-BASED MULTI-LINGUAL LOW RESOURCE SPEECH RECOGNITION
    Dalmia, Siddharth
    Sanabria, Ramon
    Metze, Florian
    Black, Alan W.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4909 - 4913
  • [39] A multi-lingual speech recognition system using a neural network approach
    Chen, OTC
    Chen, CY
    Chang, HT
    Hsu, FR
    Yang, HL
    Lee, YG
    [J]. ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1576 - 1581
  • [40] A Study on Recognition of Pre-segmented Handwritten Multi-lingual Characters
    Munish Kumar
    Simpel Rani Jindal
    [J]. Archives of Computational Methods in Engineering, 2020, 27 : 577 - 589