Language-identification using language-dependent phonemes and language-independent speech units

被引:0
|
作者
Dalsgaard, P
Andersen, O
Hesselager, H
Petek, B
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper reports on results from ongoing research on language-identification (LID) performed on the three languages: American-English, German and Spanish. The speech material used is from the Oregon Graduate Institute Spontaneous Telephone Speech Corpus, OGI_TS. The baseline LID-system consists of three parallel phoneme recognisers each of which are followed by three language modelling modules each characterising the bigram probabilities. The phoneme models used are derived on the basis of the combined speech corpus comprising the three languages. The phonemes are handled differently in analysis performed in two experiments. In the first experiment they are trained and tested language-specifically. In the second, they are separated into a number of groups, one of which contains those language-independent speech units which are similar enough to be equated across the training languages, the remaining containing the non-combinable language-dependent phonemes for each of the languages. A data-driven technique has been devised to separate the speech sounds contained within the training corpus into these groups. In order to prepare for an optimal separation between the input classes, a linear discriminant analysis is performed on the training speech material. Results from a number of experiments show that average language-identification scores of close to 90% fan be retained by the LID-system presented hem even for a high number of language-independent speech units.
引用
收藏
页码:1808 / 1811
页数:4
相关论文
共 50 条
  • [31] The Development of Language-Specific and Language-Independent Talker Processing
    Levi, Susannah V.
    Schwartz, Richard G.
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2013, 56 (03): : 913 - 920
  • [32] Co-occurrence statistics as a language-dependent cue for speech segmentation
    Saksida, Amanda
    Langus, Alan
    Nespor, Marina
    [J]. DEVELOPMENTAL SCIENCE, 2017, 20 (03)
  • [33] The development of the Language-Independent Speech in Noise and Reverberation test (LISiNaR) and evaluation in listeners with English as a second language
    Cameron, Sharon
    Boyle, Christian
    Dillon, Harvey
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2023, 62 (08) : 756 - 766
  • [34] DegExt: a language-independent keyphrase extractor
    Marina Litvak
    Mark Last
    Abraham Kandel
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2013, 4 : 377 - 387
  • [35] Language-dependent recall of autobiographical memories
    Marian, V
    Neisser, U
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2000, 129 (03) : 361 - 368
  • [36] Speaker-and language-independent speech recognition in mobile communication systems
    Viikki, I
    Kiss, I
    Tian, J
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 5 - 8
  • [37] Language-independent talker-specificity in bilingual speech intelligibility: Individual traits persist across first-language and second-language speech
    Bradlow, Ann R.
    Blasingame, Michael
    Lee, Kyounghee
    [J]. LABORATORY PHONOLOGY, 2018, 9 (01):
  • [38] Types and classes: A language-independent view
    DSouza, D
    [J]. JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1997, 10 (01): : 10 - 13
  • [39] DegExt: a language-independent keyphrase extractor
    Litvak, Marina
    Last, Mark
    Kandel, Abraham
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2013, 4 (03) : 377 - 387
  • [40] Language-independent hyperparameter optimization based speech emotion recognition system
    Thakur A.
    Dhull S.K.
    [J]. International Journal of Information Technology, 2022, 14 (7) : 3691 - 3699