Neural Network Control Interface of the Speaker Dependent Computer System "Deep Interactive Voice Assistant DIVA" to Help People with Speech Impairments

被引:1
|
作者
Khorosheva, Tatiana [1 ]
Novoseltseva, Marina [1 ]
Geidarov, Nazim [1 ]
Krivosheev, Nikolay [1 ]
Chernenko, Sergey [1 ]
机构
[1] Kemerovo State Univ, Kemerovo, Russia
关键词
Voice interface technology; Speech recognition technology; Assistive technologies; Neural network; Multilayer perceptron; Pattern recognition; Associative memory;
D O I
10.1007/978-3-030-01818-4_44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of modern informational communication systems, voice control interface and speech recognition systems find application in various fields of activity. One application of such systems is for people with special needs who have speech impairments, and thus find using speech-dependent voice interfaces challenging. Our research team is developing a speaker dependent computer system "Deep Interactive Voice Assistant" (DIVA), which allows recognizing an arbitrary set of commands to control the computing system. The article presents the results of testing various artificial neural networks to train the machine to recognize vocal inputs. We examine such architectures as associative memory, multilayer perceptron and convolutional network. The research justifies the use of multilayer perceptron for the speaker dependent computer system DIVA as a training solution that demonstrated high results on a small selection. DIVA will be implemented in voice-user interface of such systems as "Smart House", mobile applications and IT-based assistive systems.
引用
收藏
页码:444 / 452
页数:9
相关论文
共 7 条
  • [1] A Deep Neural Network Speaker Verification System Targeting Microphone Speech
    Lei, Yun
    Ferrer, Luciana
    McLaren, Mitchell
    Scheffer, Nicolas
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 681 - 685
  • [2] ADAPTATION OF AN EXPRESSIVE SINGLE SPEAKER DEEP NEURAL NETWORK SPEECH SYNTHESIS SYSTEM
    Parker, Jonathan
    Stylianou, Yannis
    Cipolla, Roberto
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5309 - 5313
  • [3] A UNIFIED SPEAKER-DEPENDENT SPEECH SEPARATION AND ENHANCEMENT SYSTEM BASED ON DEEP NEURAL NETWORKS
    Gao, Tian
    Du, Jun
    Xu, Li
    Liu, Cong
    Dai, Li-Rong
    Lee, Chin-Hui
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 687 - 691
  • [4] Neural network speaker dependent isolated malay speech recognition system: Handcrafted vs genetic algorithm
    Salam, MSH
    Mohamad, D
    Salleh, SHS
    [J]. ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 731 - 734
  • [5] Dual-input Control Interface for Deep Neural Network Based on Image/Speech Recognition
    Pai, Neng-Sheng
    Chen, Yi-Hsun
    Hung, Chin-Pao
    Chen, Pi-Yun
    Kuo, Ying-Che
    Chen, Jun-Yu
    [J]. SENSORS AND MATERIALS, 2019, 31 (11) : 3451 - 3463
  • [6] KL-divergence Regularized Deep Neural Network Adaptation for Low-resource Speaker-dependent Speech Enhancement
    Chai, Li
    Du, Jun
    Lee, Chin-Hui
    [J]. INTERSPEECH 2019, 2019, : 1806 - 1810
  • [7] Brain-Computer Interface Using Deep Neural Network and Its Application to Mobile Robot Control
    Huve, Gauvain
    Takahashi, Kazuhiko
    Hashimoto, Masafumi
    [J]. 2018 IEEE 15TH INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL (AMC), 2018, : 169 - 174