Filterbank Analysis of MFCC Feature Extraction in Robust Children Speech Recognition

被引:0
|
作者
Naing, Hay Mar Soe [1 ]
Miyanaga, Yoshikazu [2 ]
Hidayat, Risanuri [1 ]
Winduratna, Bondhan [1 ]
机构
[1] Gadjah Mada Univ, Dept Elect Engn & Informat Technol, Yogyakarta 55281, Indonesia
[2] Hokkaido Univ, GS Informat Sci & Techonol, GI CoRE GSB, Sapporo, Hokkaido 0600814, Japan
关键词
children speech recognition; gammatone frequency integration; MFCC; speaker adaptative model;
D O I
10.1109/ismac.2019.8836181
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper focused on the issue of robustness in children speech recognition system. The shape of filterbank analysis is presented in this study to suppress the additive background noise in acoustic features of children speakers. In addition, the Linear Discriminant Analysis (LDA), the Maximum Likelihood Linear Transform (MLLT) and feature space Maximum Likelihood Linear Regression (fMLLR) features are applied to build the speaker adaptive acoustic model with the help of Kaldi speech recognition toolkit. The performance of Gammatone filterbank and Bark-scale filterbank based Cepstral features were evaluated under contaminated situations using five different types of noise at a range of signal to noise ratio (SNR) 10dB to -10dB. As the detailed analysis shown, the performance of Gammatone frequency integration is superior to Mel Frequency Cepstral Coefficient (MFCC) in different types of additive background noise and various SNR situations.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Improved MFCC feature extraction by PCA-optimized filterbank for speech recognition
    Lee, SM
    Fang, SH
    Hung, JW
    Lee, LS
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 49 - 52
  • [2] Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA
    Van-Lan Dao
    Van-Danh Nguyen
    Hai-Duong Nguyen
    Van-Phuc Hoang
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 248 - 254
  • [3] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858
  • [4] A Modified MFCC Feature Extraction Technique For Robust Speaker Recognition
    Sharma, Diksha
    Ali, Israj
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1052 - 1057
  • [5] Feature Extraction Using Fusion MFCC For Continuous Marathi Speech Recognition
    Gaikwad, Santosh
    Gawali, Bharti
    Yannawar, Pravin
    Mehrotra, Suresh
    2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS, 2011,
  • [6] Proposed combination of PCA and MFCC feature extraction in speech recognition system
    Hoang Trang
    Tran Hoang Loc
    Huynh Bui Hoang Nam
    2014 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2014, : 697 - 702
  • [7] Arabic Speech Recognition Using MFCC Feature Extraction and ANN Classification
    Wahyuni, Elvira Sukma
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 22 - 25
  • [8] Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
    Hidayat, Risanuri
    Bejo, Agus
    Sumaryono, Sujoko
    Winursito, Anggun
    PROCEEDINGS OF 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2018, : 280 - 284
  • [9] Multitaper Based MFCC Feature Extraction for Robust Speaker Recognition System
    Bharath, K. P.
    Kumar, Rajesh M.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [10] Geometrical feature extraction for robust speech recognition
    Li, Xiaokun
    Kwan, Chiman
    2005 39TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2005, : 558 - 562