Hierarchical Speech Recognition System Using MFCC Feature Extraction and Dynamic Spiking RSOM

被引：0

作者：

Tarek, Behi ^{[1
]}

Najet, Arous ^{[1
]}

Noureddine, Ellouze ^{[1
]}

机构：

[1] Enit Univ Tunis El Manar, Natl Engn Sch Tunis, Lab Signal Image & Informat Technol, Tunis, Tunisia

来源：

2014 15TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD) | 2014年

关键词：

Kohonen map; Temporal self organizing map; hierarchical self-organizing model; Spiking neural network; speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose new variants of unsupervised and competitive learning algorithms designed to deal with temporal sequences. These algorithms combine features from Spiking Neural Networks (SNNs) and the advantages of the hierarchical self organizing map (HSOM). The first variant named Hierarchical Dynamic recurrent spiking self-organizing map (HD-RSSOM) is characterized by the integration of a temporal controller component to regulate the firing activity of the spiking neurons. The second variant is a hierarchical model which represents a multi-layer extension of HD-RSSOM model. The case study of the proposed HSOM variants is phonemes and words recognition in continuous speech. The applied HSOM variants serve as tools for developing intelligent systems and pursuing artificial intelligence applications.

引用

页码：41 / 46

页数：6

共 50 条

[1] Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
Hidayat, Risanuri
Bejo, Agus
Sumaryono, Sujoko
Winursito, Anggun
PROCEEDINGS OF 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2018, : 280 - 284
[2] Proposed combination of PCA and MFCC feature extraction in speech recognition system
Hoang Trang
Tran Hoang Loc
Huynh Bui Hoang Nam
2014 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2014, : 697 - 702
[3] Feature Extraction Using Fusion MFCC For Continuous Marathi Speech Recognition
Gaikwad, Santosh
Gawali, Bharti
Yannawar, Pravin
Mehrotra, Suresh
2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS, 2011,
[4] Arabic Speech Recognition Using MFCC Feature Extraction and ANN Classification
Wahyuni, Elvira Sukma
2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 22 - 25
[5] Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA
Van-Lan Dao
Van-Danh Nguyen
Hai-Duong Nguyen
Van-Phuc Hoang
ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 248 - 254
[6] Feature Data Reduction of MFCC Using PCA and SVD in Speech Recognition System
Winursito, Anggun
Hidayat, Risanuri
Bejo, Agus
Utomo, Muhammad Nur Yasir
2018 INTERNATIONAL CONFERENCE ON SMART COMPUTING AND ELECTRONIC ENTERPRISE (ICSCEE), 2018,
[7] Filterbank Analysis of MFCC Feature Extraction in Robust Children Speech Recognition
Naing, Hay Mar Soe
Miyanaga, Yoshikazu
Hidayat, Risanuri
Winduratna, Bondhan
2019 INTERNATIONAL SYMPOSIUM ON MULTIMEDIA AND COMMUNICATION TECHNOLOGY (ISMAC), 2019,
[8] Improved MFCC feature extraction by PCA-optimized filterbank for speech recognition
Lee, SM
Fang, SH
Hung, JW
Lee, LS
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 49 - 52
[9] Speech emotion recognition using MFCC-based entropy feature
Siba Prasad Mishra
Pankaj Warule
Suman Deb
Signal, Image and Video Processing, 2024, 18 : 153 - 161
[10] Chip design of MFCC extraction for speech recognition
Wang, JC
Wang, JF
Weng, YS
INTEGRATION-THE VLSI JOURNAL, 2002, 32 (1-2) : 111 - 131

← 1 2 3 4 5 →