Voice recognition based on MFCC, SBC and Spectrograms

被引:3
|
作者
Martinez Mascorro, Guillermo Arturo [1 ]
Aguilar Torres, Gualberto [2 ]
机构
[1] Inst Politecn Nacl, Ciencias Ingn Microelect, Mexico City, DF, Mexico
[2] Inst Politecn Nacl, Secc Estudios Posgrad & Invest, ESIME Culhuacan, Mexico City, DF, Mexico
关键词
Speech recognition with voice changes; Mel Frequency Cepstral Coefficients; Subband-Based Cepstral Parameters; Spectrogram; Support Vector Machine;
D O I
10.17163/ings.n10.2013.02
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
One of the problems of the Automatic Speech Recognition systems is the voice's changes. Typically, a person can have voluntary and involuntary voice's changes and the system can get confused in these cases, also the changes could be natural and artificial. This paper proposes and recognition system with a parallel identification, using three different algorithms: MFCC, SBC and Spectrogram. Using a Support Vector Machine as a classifier, every algorithm gives a group of persons with the highest likelihood and, after an evaluation, the result is obtained. The aim of this paper is to take advantage of the three algorithms.
引用
收藏
页码:12 / 20
页数:9
相关论文
共 50 条
  • [31] The Research of Feature Extraction Based on MFCC for Speaker Recognition
    Zhang Wanli
    Li Guoxin
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1074 - 1077
  • [32] Improved Speaker Recognition for Degraded Human Voice using Modified-MFCC and LPC with CNN
    Moondra, Amit
    Chahal, Poonam
    International Journal of Advanced Computer Science and Applications, 2023, 14 (04): : 143 - 151
  • [33] Improved Speaker Recognition for Degraded Human Voice using Modified-MFCC and LPC with CNN
    Moondra, Amit
    Chahal, Dr Poonam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 143 - 151
  • [34] Speaker Recognition Method Based on Statistical Features of Spectrograms and CNN
    Chen, Xi
    Wang, Yonghui
    Wang, Lianming
    Yu, Jieqiong
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [35] EXPERIMENT ON VOICE IDENTIFICATION BY VISUAL INSPECTION OF SPECTROGRAMS
    TOSI, O
    OYER, H
    PEDREY, C
    LASHBROO.B
    NICOL, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (01): : 138 - &
  • [36] Classification and recognition of diastolic heart murmurs based on EMD and MFCC
    Li H.
    Guo X.
    Zheng Y.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2017, 36 (11): : 8 - 13
  • [37] Birdsong Recognition Based on MFCC combined with Vocal Tract Properties
    Lv, Danju
    Zhang, Yan
    Fu, Qinjian
    Xu, Haifeng
    Liu, Jiang
    Zi, Jiali
    Huang, Xing
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1523 - 1526
  • [38] Classification and Recognition of Underwater Target Based on MFCC Feature Extraction
    Tong, Yuze
    Zhang, Xin
    Ge, Yizhou
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [39] Neural Network Based Recognition of Speech Using MFCC Features
    Barua, Pialy
    Ahmad, Kanij
    Khan, Ainul Anam Shahjamal
    Sanaullah, Muhammad
    2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2014,
  • [40] Implementation and Evaluation of DWT and MFCC Based ISL Gesture Recognition
    Singh, Neha
    Baranwal, Neha
    Nandi, G. C.
    2014 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2014, : 652 - +