Mandarin Digits Speech Recognition Using Support Vector Machines

被引:2
|
作者
谢湘
匡镜明
机构
[1] Beijing Institute of Technology
[2] Beijing100081
[3] China
[4] School of Information Science and Technology
关键词
speech recognition; support vector machine (SVM); kernel function;
D O I
10.15918/j.jbit1004-0579.2005.01.003
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.
引用
收藏
页码:9 / 12
页数:4
相关论文
共 50 条
  • [1] Speech Recognition using Support Vector Machines
    Aida-zade, Kamil
    Xocayev, Anar
    Rustamov, Samir
    [J]. 2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 108 - 111
  • [2] Speech Emotion Recognition Using Support Vector Machines
    Yu, Caiming
    Tian, Qingxi
    Cheng, Fang
    Zhang, Shiqing
    [J]. ADVANCED RESEARCH ON COMPUTER SCIENCE AND INFORMATION ENGINEERING, PT I, 2011, 152 : 215 - 220
  • [3] Visual speech recognition using support vector machines
    Gordan, M
    Kotropoulos, C
    Pitas, I
    [J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 1093 - 1096
  • [4] Convolutional support vector machines for speech recognition
    Passricha, Vishal
    Aggarwal, Rajesh Kumar
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 601 - 609
  • [5] RECURRENT SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
    Zhang, Shi-Xiong
    Zhao, Rui
    Liu, Chaojun
    Li, Jinyu
    Gong, Yifan
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5885 - 5889
  • [6] An Application of Speech Recognition with Support Vector Machines
    Eray, Osman
    Tokat, Sezai
    Iplikci, Serdar
    [J]. 2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2018, : 38 - 43
  • [7] Applications of support vector machines to speech recognition
    Ganapathiraju, A
    Hamaker, JE
    Picone, J
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2348 - 2355
  • [8] Infinite Support Vector Machines in Speech Recognition
    Yang, Jingzhou
    van Dalen, Rogier C.
    Gales, Mark
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3302 - 3306
  • [9] Convolutional support vector machines for speech recognition
    Vishal Passricha
    Rajesh Kumar Aggarwal
    [J]. International Journal of Speech Technology, 2019, 22 : 601 - 609
  • [10] Mandarin Connected Digits Recognition for Whispered Speech
    Ru Tingting
    Xie Xiang
    Yin Hui
    Kuang Jingming
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1141 - 1144