Language-identification using language-dependent phonemes and language-independent speech units

被引:0
|
作者
Dalsgaard, P
Andersen, O
Hesselager, H
Petek, B
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper reports on results from ongoing research on language-identification (LID) performed on the three languages: American-English, German and Spanish. The speech material used is from the Oregon Graduate Institute Spontaneous Telephone Speech Corpus, OGI_TS. The baseline LID-system consists of three parallel phoneme recognisers each of which are followed by three language modelling modules each characterising the bigram probabilities. The phoneme models used are derived on the basis of the combined speech corpus comprising the three languages. The phonemes are handled differently in analysis performed in two experiments. In the first experiment they are trained and tested language-specifically. In the second, they are separated into a number of groups, one of which contains those language-independent speech units which are similar enough to be equated across the training languages, the remaining containing the non-combinable language-dependent phonemes for each of the languages. A data-driven technique has been devised to separate the speech sounds contained within the training corpus into these groups. In order to prepare for an optimal separation between the input classes, a linear discriminant analysis is performed on the training speech material. Results from a number of experiments show that average language-identification scores of close to 90% fan be retained by the LID-system presented hem even for a high number of language-independent speech units.
引用
收藏
页码:1808 / 1811
页数:4
相关论文
共 50 条
  • [1] Building A Highly Accurate Mandarin Speech Recognizer With Language-Independent Technologies and Language-Dependent Modules
    Hwang, Mei-Yuh
    Peng, Gang
    Ostendorf, Mari
    Wang, Wen
    Faria, Arlo
    Heidel, Aaron
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1253 - 1262
  • [2] A Proposal for the Demarcation of Theory and Knowledge Of Language-dependent and Language-independent Reality
    Kampen, Jarl K.
    [J]. METAPHILOSOPHY, 2020, 51 (01) : 97 - 110
  • [3] Language-dependent and language-independent approaches to cross-lingual text retrieval
    Kamps, J
    Monz, C
    de Rijke, M
    Sigurbjörnsson, R
    [J]. COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 152 - 165
  • [4] Identifying the risk of dyslexia in bilingual children: The potential of language-dependent and language-independent tasks
    Taha, Juhayna
    Carioti, Desire
    Stucchi, Natale
    Chailleux, Mathilde
    Granocchio, Elisa
    Sarti, Daniela
    De Salvatore, Marinella
    Guasti, Maria Teresa
    [J]. FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [5] Comparison of Methods for Language-Dependent and Language-Independent Query-by-Example Spoken Term Detection
    Tejedor, Javier
    Fapso, Michal
    Szoeke, Igor
    Cernocky, Jan 'Honza'
    Grezl, Frantisek
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (03)
  • [6] Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. SPEECH COMMUNICATION, 2015, 68 : 23 - 40
  • [7] Language-independent and language-adaptive acoustic modeling for speech recognition
    Schultz, T
    Waibel, A
    [J]. SPEECH COMMUNICATION, 2001, 35 (1-2) : 31 - 51
  • [8] Language-dependent memory
    Marian, V
    [J]. PROCEEDINGS OF THE TWENTY FIRST ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1999, : 355 - 360
  • [9] A Language-identification inspired method for spontaneous speech detection
    Rouvier, Mickael
    Dufour, Richard
    Linares, Georges
    Esteve, Yannick
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1149 - +
  • [10] LANGUAGE-INDEPENDENT STANDARDS
    MOORE, JW
    EMERY, D
    RADA, R
    [J]. COMMUNICATIONS OF THE ACM, 1994, 37 (12) : 17 - 20