Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds

被引:0
|
作者
Kohler, J
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The aim of this work is to exploit the acoustic-phonetic similarities between several languages. In recent work cross-language HMM-based phoneme models have been used only for bootstrapping the language-dependent models and the multi-lingual approach has been investigated only on very small speech corpora. In this paper, we introduce a statistical distance measure to determine the similarities of sounds. Further, we present a new technique to model multi-lingual phonemes. The experiments are conducted with the OGI Multi-Language Telephone Speech Corpus for the languages American English, German and Spanish. In the first experiment phoneme recognition rates between 39.0% and 53.9% are achieved using language-dependent models. Using cross-language models yields for some phonemes improvement, but in average a degradation of recognition performance is observed. However, cross-language models speeds up the cross-language transfer and reduces the size of the phoneme inventory of multi-lingual speech recognition systems. Finally, a new method of modelling multi-lingual phonemes, which can be used for a variety of language, is presented. This technique reduces the number of phoneme-based units in a multi-lingual speech recognition system.
引用
收藏
页码:2195 / 2198
页数:4
相关论文
共 50 条
  • [1] Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition
    Farooq, Muhammad Umar
    Hain, Thomas
    [J]. INTERSPEECH 2022, 2022, : 3849 - 3853
  • [2] Multi-lingual phoneme recognition and language identification using phonotactic information
    Wang, Liang
    Ambikairajah, Eliathamby
    Choi, Eric H. C.
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 245 - +
  • [3] An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    Haentjens, G
    Berman, J
    [J]. ISCAS '99: PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3: ANALOG AND DIGITAL SIGNAL PROCESSING, 1999, : 118 - 121
  • [4] ACOUSTIC-PHONETIC REPRESENTATIONS IN WORD RECOGNITION
    PISONI, DB
    LUCE, PA
    [J]. COGNITION, 1987, 25 (1-2) : 21 - 52
  • [5] Acoustic-phonetic unit similarities for context dependent acoustic model portability
    Le, Viet Bac
    Besacier, Laurent
    Schultz, Tanja
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1101 - 1104
  • [6] A fuzzy acoustic-phonetic decoder for speech recognition
    Oppizzi, O
    Fournier, D
    Gilles, P
    Meloni, H
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2270 - 2273
  • [7] Acoustic-Phonetic Analysis for Speech Recognition: A Review
    Sarma, Biswajit Dev
    Prasanna, S. R. Mahadeva
    [J]. IETE TECHNICAL REVIEW, 2018, 35 (03) : 305 - 327
  • [8] WHERE PHONEMES ARE - DEALING WITH AMBIGUITY IN ACOUSTIC-PHONETIC RECOGNITION
    SCHWARTZ, R
    MAKHOUL, J
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01): : 50 - 53
  • [9] Multi-Lingual Speech Emotion Recognition: Investigating Similarities between English and German Languages
    Devi, Ghaayathri K.
    Likhitha, Kolluru
    Akshaya, J.
    Rfj, Gokul
    Lal, Jyothish G.
    [J]. 2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [10] Exploiting multi-lingual text potentialities in EBMT systems
    Mandreoli, F
    Martoglia, R
    Tiberio, P
    [J]. RIDE - MLIM 2003: THIRTEENTH INTERNATIONAL WORK SHOP ON RESEARCH ISSUES IN DATA ENGINEERING: MULTI-LINGUAL INFORMATION MANAGEMENT, PROCEEDINGS, 2003, : 9 - 15