Vocalization Recognition of People with Profound Intellectual and Multiple Disabilities (PIMD) Using Machine Learning Algorithms

被引:3
|
作者
Jesko, Waldemar [1 ]
机构
[1] Poznan Supercomp & Networking Ctr, Poznan, Poland
来源
基金
欧盟地平线“2020”;
关键词
speech recognition; vocalization recognition; PIMD; Profound Intellectual and Multiple Disabilities; AI; CLASSIFICATION;
D O I
10.21437/Interspeech.2021-1239
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We investigate vocalization recognition for people with Profound Intellectual and Multiple Disabilities using various machine learning algorithms. The amount of training data available for people with PIMD is typically significantly limited. Due to this fact, data augmentation process was used. Various types of Machine Learning algorithms were tested: k-NN, NB, DT, RDF, MLP and LSTM. During research we also tested various regularization techniques to improve recognition performance. The best results were obtained in case of MLP network with dropout and batch normalization: 90%.
引用
收藏
页码:2921 / 2925
页数:5
相关论文
共 50 条