Speech Clarity Index (Ψ): A Distance-Based Speech Quality Indicator and Recognition Rate Prediction for Dysarthric Speakers with Cerebral Palsy

被引:1
|
作者
Kayasith, Prakasith [1 ]
Theeramunkong, Thanaruk [1 ]
机构
[1] Thammasat Univ, Informat & Comp Technol Sch, Sirindhorn Int Inst Technol, Bangkok, Thailand
关键词
speech disorder; dysarthric speech recognition; speech assessment; speech quality index; recognition rate prediction;
D O I
10.1587/transinf.E92.D.460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is a tedious and subjective task to measure severity of a dysarthria by manually evaluating his/her speech using available standard assessment methods based on human perception. This paper presents an automated approach to assess speech quality of a dysarthric speaker with cerebral palsy. With the consideration of two complementary factors, speech consistency and speech distinction, a speech quality indicator called speech clarity index (Psi) is proposed as a measure of the speaker's ability to produce consistent speech signal for a certain word and distinguished speech signal for different words. As an application, it can be used to assess speech quality and forecast speech recognition rate of speech made by an individual dysarthric speaker before actual exhaustive implementation of an automatic speech recognition system for the speaker. The effectiveness of Psi as a speech recognition rate predictor is evaluated by rank-order inconsistency, correlation coefficient, and root-mean-square of difference. The evaluations had been done by comparing its predicted recognition rates with one predicted by the standard methods called the articulatory and intelligibility tests based on the two recognition systems (HMM and ANN). The results show that Psi is a promising indicator for predicting recognition rate of dysarthric speech. All experiments had been done on speech corpus composed of speech data from eight normal speakers and eight dysarthric speakers.
引用
收藏
页码:460 / 468
页数:9
相关论文
共 15 条