Recognition of human speech phonemes using a novel fuzzy approach

被引:11
|
作者
Halavati, Ramin [1 ]
Shouraki, Saeed Bagheri [1 ]
Zadeh, Saman Harati [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Artificial Intelligence Lab 308, Tehran, Iran
关键词
fuzzy modeling; speech recognition; genetic algorithms;
D O I
10.1016/j.asoc.2006.02.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognition of human speech has long been a hot topic among artificial intelligence and signal processing researches. Most of current policies for this subject are based on extraction of precise features of voice signal and trying to make most out of them by heavy computations. But this focus on signal details has resulted in too much sensitivity to noise and as a result, the necessity of complex noise detection and removal algorithms, which composes a trade-off between fast or noise robust recognition. This paper presents a novel approach to speech recognition using fuzzy modeling and decision making that ignores noise instead of its detection and removal. To do so, the speech spectrogram is converted into a fuzzy linguistic description and this description is used instead of precise acoustic features. During the training period, a genetic algorithm finds appropriate definitions for phonemes, and when these definitions are ready, a simple novel operator consisting of low cost functions such as Max, Min, and Average makes the recognition. The approach is tested on a standard speech database and is compared with Hidden Markov model recognition system with MFCC features as a widely used speech recognition approach. (c) 2006 Elsevier B. V. All rights reserved.
引用
收藏
页码:828 / 839
页数:12
相关论文
共 50 条
  • [1] A novel fuzzy approach to speech recognition
    Halavati, R
    Shouraki, SB
    Eshraghi, M
    Alemzadeh, M
    Ziaie, P
    [J]. HIS'04: Fourth International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 340 - 345
  • [2] Speech emotion recognition using a fuzzy approach
    Ton-That, An H.
    Cao, Nhan T.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (02) : 1587 - 1597
  • [3] Speech Recognition System Based On Phonemes Using Neural Networks
    Maheswari, N. Uma
    Kabilan, A. P.
    Venkatesh, R.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (07): : 148 - 153
  • [4] A Novel Approach for Vietnamese Speech Recognition Using Conformer
    Tuan, Nguyen Van Anh
    Hoa, Nguyen Thi Thanh
    Dat, Nguyen Thanh
    Tuan, Pham Minh
    Truong, Dao Duy
    Phuc, Dang Thi
    [J]. FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 : 723 - 730
  • [5] Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
    Nahar, Khalid M. O.
    Abu Shquier, Mohammed
    Al-Khatib, Wasfi G.
    Al-Muhtaseb, Husni
    Elshafei, Moustafa
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 495 - 508
  • [6] Automatic speech recognition of Portuguese phonemes using neural networks ensemble
    Nedjah, Nadia
    Bonilla, Alejandra D.
    Mourelle, Luiza de Macedo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [7] Effect of speech-intrinsic variations on human and automatic recognition of spoken phonemes
    Meyer, Bernd T.
    Brand, Thomas
    Kollmeier, Birger
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (01): : 388 - 403
  • [8] HUMAN RECOGNITION OF SUSTAINED PHONEMES
    FOCHT, LR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1963, 35 (11): : 1890 - &
  • [9] SEGMENTATION AND RECOGNITION OF PHONEMES USING GROSS FEATURES OF SPEECH SPECTRUM AND THEIR DYNAMIC PROPERTIES
    MIWA, J
    MAKINO, S
    KIDO, K
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S179 - S179
  • [10] A Novel Fuzzy HMM Approach for Human Action Recognition in Video
    Mozafari, Kourosh
    Charkari, Nasrollah Moghadam
    Boroujeni, Hamidreza Shayegh
    Behrouzifar, Mohammad
    [J]. KNOWLEDGE TECHNOLOGY, 2012, 295 : 184 - 193