Language accent classification in American English

被引:85
|
作者
Arslan, LM [1 ]
Hansen, JHL [1 ]
机构
[1] DUKE UNIV, DEPT ELECT ENGN, ROBUST SPEECH PROC LAB, DURHAM, NC 27708 USA
关键词
D O I
10.1016/0167-6393(96)00024-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is well known that speaker variability caused by accent is one factor that degrades performance of speech recognition algorithms. If knowledge of speaker accent can be estimated accurately, then a modified set of recognition models which addresses speaker accent could be employed to increase recognition accuracy. In this study, the problem of language accent classification in American English is considered. A database of foreign language accent is established that consists of words and phrases that are known to be sensitive to accent. Next, isolated word and phoneme based accent classification algorithms are developed. The feature set under consideration includes Mel-cepstrum coefficients and energy, and their first order differences. It is shown that as test utterance length increases, higher classification accuracy is achieved, Isolated word strings of 7-8 words uttered by the speaker results in an accent classification rate of 93% among four different language accents, A subjective listening test is also conducted in order to compare human performance with computer algorithm performance in accent discrimination. The results show that computer based accent classification consistently achieves superior performance over human listener responses for classification. It is shown, however, that some listeners are able to match algorithm performance for accent detection. Finally, an experimental study is performed to investigate the influence of foreign accent on speech recognition algorithms. It is shown that training separate models for each accent rather than using a single model for each word can improve recognition accuracy dramatically.
引用
收藏
页码:353 / 367
页数:15
相关论文
共 50 条