Recognition of human speech phonemes using a novel fuzzy approach

被引：11

作者：

Halavati, Ramin ^{[1
]}

Shouraki, Saeed Bagheri ^{[1
]}

Zadeh, Saman Harati ^{[1
]}

机构：

[1] Sharif Univ Technol, Dept Comp Engn, Artificial Intelligence Lab 308, Tehran, Iran

来源：

APPLIED SOFT COMPUTING | 2007年 / 7卷 / 03期

关键词：

fuzzy modeling; speech recognition; genetic algorithms;

D O I：

10.1016/j.asoc.2006.02.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recognition of human speech has long been a hot topic among artificial intelligence and signal processing researches. Most of current policies for this subject are based on extraction of precise features of voice signal and trying to make most out of them by heavy computations. But this focus on signal details has resulted in too much sensitivity to noise and as a result, the necessity of complex noise detection and removal algorithms, which composes a trade-off between fast or noise robust recognition. This paper presents a novel approach to speech recognition using fuzzy modeling and decision making that ignores noise instead of its detection and removal. To do so, the speech spectrogram is converted into a fuzzy linguistic description and this description is used instead of precise acoustic features. During the training period, a genetic algorithm finds appropriate definitions for phonemes, and when these definitions are ready, a simple novel operator consisting of low cost functions such as Max, Min, and Average makes the recognition. The approach is tested on a standard speech database and is compared with Hidden Markov model recognition system with MFCC features as a widely used speech recognition approach. (c) 2006 Elsevier B. V. All rights reserved.

引用

页码：828 / 839

页数：12

共 50 条

[1] A novel fuzzy approach to speech recognition
Halavati, R
Shouraki, SB
Eshraghi, M
Alemzadeh, M
Ziaie, P
[J]. HIS'04: Fourth International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 340 - 345
[2] Speech emotion recognition using a fuzzy approach
Ton-That, An H.
Cao, Nhan T.
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (02) : 1587 - 1597
[3] Speech Recognition System Based On Phonemes Using Neural Networks
Maheswari, N. Uma
Kabilan, A. P.
Venkatesh, R.
[J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (07): : 148 - 153
[4] A Novel Approach for Vietnamese Speech Recognition Using Conformer
Tuan, Nguyen Van Anh
Hoa, Nguyen Thi Thanh
Dat, Nguyen Thanh
Tuan, Pham Minh
Truong, Dao Duy
Phuc, Dang Thi
[J]. FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 : 723 - 730
[5] Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
Nahar, Khalid M. O.
Abu Shquier, Mohammed
Al-Khatib, Wasfi G.
Al-Muhtaseb, Husni
Elshafei, Moustafa
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 495 - 508
[6] Automatic speech recognition of Portuguese phonemes using neural networks ensemble
Nedjah, Nadia
Bonilla, Alejandra D.
Mourelle, Luiza de Macedo
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
[7] Effect of speech-intrinsic variations on human and automatic recognition of spoken phonemes
Meyer, Bernd T.
Brand, Thomas
Kollmeier, Birger
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (01): : 388 - 403
[8] HUMAN RECOGNITION OF SUSTAINED PHONEMES
FOCHT, LR
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1963, 35 (11): : 1890 - &
[9] SEGMENTATION AND RECOGNITION OF PHONEMES USING GROSS FEATURES OF SPEECH SPECTRUM AND THEIR DYNAMIC PROPERTIES
MIWA, J
MAKINO, S
KIDO, K
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S179 - S179
[10] A Novel Fuzzy HMM Approach for Human Action Recognition in Video
Mozafari, Kourosh
Charkari, Nasrollah Moghadam
Boroujeni, Hamidreza Shayegh
Behrouzifar, Mohammad
[J]. KNOWLEDGE TECHNOLOGY, 2012, 295 : 184 - 193

← 1 2 3 4 5 →