Voice command recognition in intelligent systems using deep neural networks

被引:0
|
作者
Sokolov, Artem [1 ]
Savchenko, Andrey V. [2 ]
机构
[1] Natl Res Univ Higher Sch Econ, Nizhnii Novgorod, Russia
[2] Natl Res Univ Higher Sch Econ, Lab Algorithms & Technol Network Anal, Nizhnii Novgorod, Russia
关键词
Automatic speech recognition; autonomous man-machine systems; deep neural networks; voice command recognition; non-native speech;
D O I
10.1109/sami.2019.8782755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we focus on the isolated voice command recognition for autonomous man-machine and intelligent robotic systems. We propose to create a grammar model for a small testing command set with self-loops for each state to return blank symbols for noise and out-of-vocabulary words. In addition, we use single arc connected beginning and ending of the grammar in order to filter unknown commands. As a result, the grammar is resistant to distortions and unexpected words near or inside of command. We implemented the proposed approach using Finite State Transducers in the Kaldi framework and examined it using self-recorded noised data with various level of signal-to-noise ratio. We compared recognition accuracy and average decision-making time of our approach with the state-of-the-art continuous speech recognition engines based on language models. It was experimentally shown that our approach is characterized by up to 60% higher accuracy than conventional offline speech recognition methods based on language models. The speed of utterance recognition is 3 times higher than speed of traditional continuous speech recognition algorithms.
引用
收藏
页码:113 / 116
页数:4
相关论文
共 50 条
  • [41] Speech Command Recognition Using Deep Learning
    Ayache, Mohammad
    Kanaan, Hussien
    Kassir, Kawthar
    Kassir, Yasser
    2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
  • [42] Speaker adaptation based on judge neural networks for real world implementations of Voice-Command systems
    Jeong, JH
    Kim, H
    Kim, DS
    Lee, SY
    INFORMATION SCIENCES, 2000, 123 (1-2) : 13 - 24
  • [43] Intelligent Image Recognition System for Marine Fouling Using Softmax Transfer Learning and Deep Convolutional Neural Networks
    Chin, C. S.
    Si, JianTing
    Clare, A. S.
    Ma, Maode
    COMPLEXITY, 2017,
  • [44] Voice Command Recognition for Fighter Pilots Using Grammar Tree
    Kim, Hangyu
    Park, Jeongsik
    Oh, Yunghwan
    Kim, Seongwoo
    Kirn, Bonggyu
    COMPUTER APPLICATIONS FOR DATABASE, EDUCATION, AND UBIQUITOUS COMPUTING, 2012, 352 : 116 - +
  • [45] Voice Command Recognition Using Statistical Signal Processing and SVM
    Osowska, Aleksandra
    Osowski, Stanislaw
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT I, 2019, 11506 : 65 - 73
  • [46] Deep Neural Networks for Voice Activity Detection
    Mihalache, Serban
    Ivanov, Ioan-Alexandru
    Burileanu, Dragos
    2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, : 191 - 194
  • [47] Multiview fusion for activity recognition using deep neural networks
    Kavi, Rahul
    Kulathumani, Vinod
    Rohit, Fnu
    Kecojevic, Vlad
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (04)
  • [48] Human Facial Emotion Recognition using Deep Neural Networks
    Benisha, S.
    Mirnalinee, T. T.
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (03) : 303 - 309
  • [49] EFFICIENT ARABIC EMOTION RECOGNITION USING DEEP NEURAL NETWORKS
    Hifny, Yasser
    Ali, Ahmed
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6710 - 6714
  • [50] Forest Species Recognition using Deep Convolutional Neural Networks
    Hafemann, Luiz G.
    Oliveira, Luiz S.
    Cavalin, Paulo
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1103 - 1107