A Speech to Machine Interface Based on the Frequency Domain Command Recognition

被引：0

作者：

Almayouf, Nojood ^{[1
]}

Qaisar, S. M. ^{[1
]}

Alharbi, Lojain ^{[1
]}

Madani, Raghdah ^{[1
]}

机构：

[1] Effat Univ, Elect & Comp Engn Dept, Jeddah, Saudi Arabia

来源：

2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP) | 2017年

关键词：

speech recognition; machine interface; matlab; microcontroller; spectrum analysis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recent technological advancements allow to develop speech to machine interfaces. These systems can be employed in a variety of potential applications like security, smart homes, centers for people with disabilities, etc. In this context a speech to machine interface is devised. The proposed system is based on the principle of frequency domain speech recognition. The spectral analysis of speech is performed in order to extract its features. Later on these extracted features could be employed to perform actions like issuing commands for actuations, granting access to the secure services, dialing with voice, banking via telephone, accessing confidential databases, etc. A simplified system prototype is designed and developed. It accepts commands in the form of speech, extract speech signal features and employ these features for piloting the actuators. The actuators are piloted according to the speaker desire. The developed system functionality is verified. Results show a proper system operation.

引用

页码：356 / 360

页数：5

共 50 条

[31] Joint frequency domain and reconstructed phase space features for speech recognition
Lindgren, AC
Johnson, MT
Povinelli, RJ
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 533 - 536
[32] Frequency domain microphone array calibration and beamforming for automatic speech recognition
Hu, JS
Cheng, CC
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (09): : 2401 - 2411
[33] Speech emotion recognition based on time domain feature
Zhao, Lasheng
Wei, Xiaopeng
Zhang, Qiang
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1319 - 1321
[34] Indonesian Automatic Speech Recognition For Command Speech Controller Multimedia Player
Wardhany, Vivien Arief
Sukaridhoto, Sritrusta
Sudarsono, Amang
EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2014, 2 (02) : 39 - 48
[35] Speech Recognition for Voice-Based Machine Translation
Duarte, Tiago
Prikladnicki, Rafael
Calefato, Fabio
Lanubile, Filippo
IEEE SOFTWARE, 2014, 31 (01) : 26 - 31
[36] Relevance Vector Machine Based Speech Emotion Recognition
Wang, Fengna
Verhelst, Werner
Sahli, Hichem
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT II, 2011, 6975 : 111 - 120
[37] Speech based Emotion Recognition using Machine Learning
Deshmukh, Girija
Gaonkar, Apurva
Golwalkar, Gauri
Kulkarni, Sukanya
PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 812 - 817
[38] Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction
Ganapathy, Sriram
Thomas, Samuel
Hermansky, Hynek
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 984 - +
[39] A SOFTWARE INTERFACE FOR SPEECH RECOGNITION
IVERSON, RD
ARNOTT, PJ
PFEIFFER, GW
COMPUTER DESIGN, 1982, 21 (03): : 147 - &
[40] Classical and Deep Learning Methods for Speech Command Recognition
Xie, Jie
Li, Qijing
Hu, Kai
Zhu, Mingying
2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021), 2021, : 41 - 45

← 1 2 3 4 5 →