An efficient phonotactic-acoustic system for language identification

被引:0
|
作者
Navratil, J [1 ]
Zuhlke, W [1 ]
机构
[1] Tech Univ Ilmenau, Dept Commun & Measurement, D-98684 Ilmenau, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a combined two-component system for language identification based on phonotactic and acoustic features. The phonotactic part consisting of a multilingual phone-recognizer with a double bigram-decoding architecture and a phonetic-context mapping is supported by a second part with pronunciation modeling of the recognized phone-sequence using Gaussian density models. Both parts are post-processed by a neural-based final classifier. Measured on the NIST'95 evaluation set, the described system outperforms state-of-the-art components and, at the same time, requires considerably less computational expense, as compared to implicit phonotactic-acoustic modeling and parallel recognizer architectures.
引用
收藏
页码:781 / 784
页数:4
相关论文
共 50 条
  • [1] Integrating acoustic, prosodic and phonotactic features for spoken language identification
    Tong, Rong
    Ma, Bin
    Zhu, Donglai
    Li, Haizhou
    Chng, Eng Siong
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 205 - 208
  • [2] Fusion of Contrastive Acoustic Models for Parallel Phonotactic Spoken Language Identification
    Sim, Khe Chai
    Li, Haizhou
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 541 - 544
  • [3] Phonotactic language identification for singing
    Kruspe, Anna M.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3319 - 3323
  • [4] Improving Phonotactic Language Recognition with Acoustic Adaptation
    Shen, Wade
    Reynolds, Douglas
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2105 - 2108
  • [5] LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
    Liu, Hexin
    Perera, Leibny Paola Garcia
    Khong, Andy W. H.
    Styles, Suzy J.
    Khudanpur, Sanjeev
    INTERSPEECH 2022, 2022, : 2233 - 2237
  • [6] BAYESIAN PHONOTACTIC LANGUAGE MODEL FOR ACOUSTIC UNIT DISCOVERY
    Ondel, Lucas
    Burget, Lukas
    Cernocky, Jan
    Kesiraju, Santosh
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5750 - 5754
  • [7] FRAME-BASED PHONOTACTIC LANGUAGE IDENTIFICATION
    Han, Kyu
    Pelecanos, Jason
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 303 - 306
  • [8] Fusion of phonotactic and prosodic knowledge for language identification
    Lin, Chi-Yueh
    Wang, Hsiao-Chuan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 425 - 428
  • [9] PARALLEL ACOUSTIC MODEL ADAPTATION FOR IMPROVING PHONOTACTIC LANGUAGE RECOGNITION
    Leung, Cheung-Chi
    Ma, Bin
    Li, Haizhou
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 246 - 250
  • [10] Improved phonotactic language identification using random forest language models
    Wang, XiaoRui
    Wang, ShiJin
    Liang, JiaEn
    Xu, Bo
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4237 - 4240