A Speech to Machine Interface Based on the Frequency Domain Command Recognition

被引:0
|
作者
Almayouf, Nojood [1 ]
Qaisar, S. M. [1 ]
Alharbi, Lojain [1 ]
Madani, Raghdah [1 ]
机构
[1] Effat Univ, Elect & Comp Engn Dept, Jeddah, Saudi Arabia
关键词
speech recognition; machine interface; matlab; microcontroller; spectrum analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent technological advancements allow to develop speech to machine interfaces. These systems can be employed in a variety of potential applications like security, smart homes, centers for people with disabilities, etc. In this context a speech to machine interface is devised. The proposed system is based on the principle of frequency domain speech recognition. The spectral analysis of speech is performed in order to extract its features. Later on these extracted features could be employed to perform actions like issuing commands for actuations, granting access to the secure services, dialing with voice, banking via telephone, accessing confidential databases, etc. A simplified system prototype is designed and developed. It accepts commands in the form of speech, extract speech signal features and employ these features for piloting the actuators. The actuators are piloted according to the speaker desire. The developed system functionality is verified. Results show a proper system operation.
引用
收藏
页码:356 / 360
页数:5
相关论文
共 50 条
  • [31] Joint frequency domain and reconstructed phase space features for speech recognition
    Lindgren, AC
    Johnson, MT
    Povinelli, RJ
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 533 - 536
  • [32] Frequency domain microphone array calibration and beamforming for automatic speech recognition
    Hu, JS
    Cheng, CC
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (09): : 2401 - 2411
  • [33] Speech emotion recognition based on time domain feature
    Zhao, Lasheng
    Wei, Xiaopeng
    Zhang, Qiang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1319 - 1321
  • [34] Indonesian Automatic Speech Recognition For Command Speech Controller Multimedia Player
    Wardhany, Vivien Arief
    Sukaridhoto, Sritrusta
    Sudarsono, Amang
    EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2014, 2 (02) : 39 - 48
  • [35] Speech Recognition for Voice-Based Machine Translation
    Duarte, Tiago
    Prikladnicki, Rafael
    Calefato, Fabio
    Lanubile, Filippo
    IEEE SOFTWARE, 2014, 31 (01) : 26 - 31
  • [36] Relevance Vector Machine Based Speech Emotion Recognition
    Wang, Fengna
    Verhelst, Werner
    Sahli, Hichem
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT II, 2011, 6975 : 111 - 120
  • [37] Speech based Emotion Recognition using Machine Learning
    Deshmukh, Girija
    Gaonkar, Apurva
    Golwalkar, Gauri
    Kulkarni, Sukanya
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 812 - 817
  • [38] Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction
    Ganapathy, Sriram
    Thomas, Samuel
    Hermansky, Hynek
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 984 - +
  • [39] A SOFTWARE INTERFACE FOR SPEECH RECOGNITION
    IVERSON, RD
    ARNOTT, PJ
    PFEIFFER, GW
    COMPUTER DESIGN, 1982, 21 (03): : 147 - &
  • [40] Classical and Deep Learning Methods for Speech Command Recognition
    Xie, Jie
    Li, Qijing
    Hu, Kai
    Zhu, Mingying
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021), 2021, : 41 - 45