Hierarchical Speech Recognition System Using MFCC Feature Extraction and Dynamic Spiking RSOM

被引:0
|
作者
Tarek, Behi [1 ]
Najet, Arous [1 ]
Noureddine, Ellouze [1 ]
机构
[1] Enit Univ Tunis El Manar, Natl Engn Sch Tunis, Lab Signal Image & Informat Technol, Tunis, Tunisia
关键词
Kohonen map; Temporal self organizing map; hierarchical self-organizing model; Spiking neural network; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose new variants of unsupervised and competitive learning algorithms designed to deal with temporal sequences. These algorithms combine features from Spiking Neural Networks (SNNs) and the advantages of the hierarchical self organizing map (HSOM). The first variant named Hierarchical Dynamic recurrent spiking self-organizing map (HD-RSSOM) is characterized by the integration of a temporal controller component to regulate the firing activity of the spiking neurons. The second variant is a hierarchical model which represents a multi-layer extension of HD-RSSOM model. The case study of the proposed HSOM variants is phonemes and words recognition in continuous speech. The applied HSOM variants serve as tools for developing intelligent systems and pursuing artificial intelligence applications.
引用
收藏
页码:41 / 46
页数:6
相关论文
共 50 条
  • [1] Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
    Hidayat, Risanuri
    Bejo, Agus
    Sumaryono, Sujoko
    Winursito, Anggun
    [J]. PROCEEDINGS OF 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2018, : 280 - 284
  • [2] Proposed combination of PCA and MFCC feature extraction in speech recognition system
    Hoang Trang
    Tran Hoang Loc
    Huynh Bui Hoang Nam
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2014, : 697 - 702
  • [3] Feature Extraction Using Fusion MFCC For Continuous Marathi Speech Recognition
    Gaikwad, Santosh
    Gawali, Bharti
    Yannawar, Pravin
    Mehrotra, Suresh
    [J]. 2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS, 2011,
  • [4] Arabic Speech Recognition Using MFCC Feature Extraction and ANN Classification
    Wahyuni, Elvira Sukma
    [J]. 2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 22 - 25
  • [5] Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA
    Van-Lan Dao
    Van-Danh Nguyen
    Hai-Duong Nguyen
    Van-Phuc Hoang
    [J]. ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 248 - 254
  • [6] Feature Data Reduction of MFCC Using PCA and SVD in Speech Recognition System
    Winursito, Anggun
    Hidayat, Risanuri
    Bejo, Agus
    Utomo, Muhammad Nur Yasir
    [J]. 2018 INTERNATIONAL CONFERENCE ON SMART COMPUTING AND ELECTRONIC ENTERPRISE (ICSCEE), 2018,
  • [7] Filterbank Analysis of MFCC Feature Extraction in Robust Children Speech Recognition
    Naing, Hay Mar Soe
    Miyanaga, Yoshikazu
    Hidayat, Risanuri
    Winduratna, Bondhan
    [J]. 2019 INTERNATIONAL SYMPOSIUM ON MULTIMEDIA AND COMMUNICATION TECHNOLOGY (ISMAC), 2019,
  • [8] Improved MFCC feature extraction by PCA-optimized filterbank for speech recognition
    Lee, SM
    Fang, SH
    Hung, JW
    Lee, LS
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 49 - 52
  • [9] Speech emotion recognition using MFCC-based entropy feature
    Siba Prasad Mishra
    Pankaj Warule
    Suman Deb
    [J]. Signal, Image and Video Processing, 2024, 18 : 153 - 161
  • [10] Chip design of MFCC extraction for speech recognition
    Wang, JC
    Wang, JF
    Weng, YS
    [J]. INTEGRATION-THE VLSI JOURNAL, 2002, 32 (1-2) : 111 - 131