A Feature Normalisation Technique for PLLR based Language Identification Systems

被引:5
|
作者
Fernando, Sarith [1 ,2 ]
Sethu, Vidhyasaharan [1 ]
Ambikairajah, Eliathamby [1 ,2 ]
机构
[1] UNSW Australia, Sch Elect Engn & Telecommun, Kensington, NSW, Australia
[2] NICTA, ATP Res Lab, Sydney, NSW, Australia
关键词
Spoken Language Recognition; Phone Log Likelihood Ratios; Feature Transformation; Statistical Normalisation; Gaussian PLDA; i-Vectors;
D O I
10.21437/Interspeech.2016-560
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Phone log-likelihood ratio (PLLR) features have been shown to be effective in language identification systems. However, PLLR feature distributions are bounded and this may contradict assumptions of Gaussianity and consequently lead to reduced language recognition rates. In this paper, we propose a feature normalisation technique for the PLLR feature space and demonstrate that it can outperform conventional normalisation and decorrelation techniques such as mean-variance normalisation, feature warping, discrete cosine transform and principal component analysis. Experimental results on the NIST LRE 2007 and the NIST LRE 2015 databases show that the proposed method outperforms other normalisation methods by at least 9.3% in terms of %Cavg. Finally, unlike PCA which needs to be estimated from all the training data, the proposed technique can be applied on each utterance independently.
引用
收藏
页码:2925 / 2929
页数:5
相关论文
共 50 条
  • [1] Phonemes Frequency based PLLR Dimensionality Reduction for Language Recognition
    Irtza, Saad
    Sethu, Vidhyasaharan
    Phu Ngoc Le
    Ambikairajah, Eliathamby
    Li, Haizhou
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 997 - 1001
  • [2] Introducing a FM based Feature to Hierarchical Language Identification
    Yin, Bo
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Chen, Fang
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 731 - 734
  • [3] Language Identification Method Based on Fusion Feature MGCC
    Wang, Yankai
    Long, Hua
    Shao, Yubin
    Du, Qingzhi
    Wang, Yao
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (02): : 116 - 121
  • [4] Human identification technique based on iris feature watermarking
    Fan, KF
    Mo, W
    Wang, MH
    Zhao, XH
    Shen, J
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (02) : 251 - 256
  • [5] Normalisation design for delayed singular Markovian jump systems based on system transformation technique
    Zhuang, Guangming
    Xia, Jianwei
    Zhang, Weihai
    Sun, Wei
    Sun, Qun
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2018, 49 (08) : 1603 - 1614
  • [6] Fractional Fourier Transform Based Auditory Feature for Language Identification
    Zhang, Wei-Qiang
    He, Liang
    Hou, Tao
    Liu, Jia
    [J]. 2008 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2008), VOLS 1-4, 2008, : 209 - 212
  • [7] A novel weighting technique for fusing Language Identification systems based on pair-wise performances
    Yin, Bo
    Ambikairajah, Eliathamby
    Chen, Fang
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 408 - 412
  • [8] Centroid-based language identification using letter feature set
    Takci, H
    Sogukpinar, I
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 640 - 648
  • [9] A novel weighting technique for combining likelihood scores in language identification systems
    Yin, Bo
    Ambikairajah, Eliathamby
    Chen, Fang
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 448 - +
  • [10] Feature Analysis for Native Language Identification
    Nisioi, Sergiu
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 644 - 657