Minority Language;
Voice Activity Detection;
GMM-UBM;
Language Identification;
Chinese Loan Words;
D O I:
暂无
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
An approach to language identification of minority language based on GMM-UBM model is described in this paper. In the training stage, a new method of double threshold for voice activity detection is used to effectively remove noise and extract useful voice frames. Then we extract the MFCC feature parameters, and train UBM model and the GMM model of 6 languages; In the testing stage, utterances with different durations and Chinese loan words of six minority languages are selected. We analyze each language identification rate and the results with different duration testing data, and then we give some explanations of error identification in terms of phonetics. We also analyze the impact of Chinese loan words on the results.