Neural network ensemble based on vowel classification for chinese speaker recognition

被引:0
|
作者
Qian, Bo [1 ]
Tang, Zhen-min [1 ]
Li, Yan-ping [1 ]
Xu, Li-min [1 ]
Zhang, Yan [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp S&T, Nanjing, Peoples R China
[2] Jinling Inst Techonol, Jiangsu Sheng, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As we known, features of speech signal not only reflect the identity information, but also contain the semantical information. In this paper, we describe a novel neural network ensemble architecture based on the finding that the diphthong and multi-vowel in Chinese can approximately be considered as the complex of mono-vowel and transitional pan in the standpoint of short-term analysis. Several neural networks are trained, each for the eigenspace of one mono-vowel, and their results are combined by another combinational neural network. The architecture can effectively improve the recognition accuracy by eliminating the disturbance of semantical information. Experimental results show that the recognition accuracy of our proposed approach is higher than conventional methods such as a single neural network and other proposed ensemble structures.
引用
收藏
页码:141 / +
页数:3
相关论文
共 50 条
  • [1] Speaker recognition algorithm based on neural network ensemble and its simulation study
    Qian, Bo
    Li, Yan-Ping
    Tang, Zhen-Min
    Xu, Li-Min
    2008, Acta Simulata Systematica Sinica, Beijing, 100854, China (20):
  • [2] Vowel classification based on rough neural network
    Mei, XD
    Sun, SH
    Zhang, ZL
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 259 - 263
  • [3] Speaker recognition using artificial neural networks based on vowel phonemes
    Badran, EFMF
    Selim, H
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 796 - 802
  • [4] Speaker normalization for Chinese vowel recognition in cochlear implants
    Luo, X
    Fu, QH
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2005, 52 (07) : 1358 - 1361
  • [5] Speaker Recognition Based on Quantum Neural Network
    Wang, Geng
    Wang, Jin Ming
    Sun, Jian
    2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 238 - 241
  • [6] Text independent speaker recognition based on the attack state formants and neural network classification
    Seddik, H
    Rahmouni, ABS
    Sayadi, M
    2004 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), VOLS. 1- 3, 2004, : 1649 - 1653
  • [7] Speaker recognition method based on quantum neural network
    Wang, J.-M. (wjm_ice@163.com), 1600, University of Science and Technology (13):
  • [8] Vowel Based Neural Networks for Speaker Verification
    Xu, Yun-Fei
    Huang, Yu-Fei
    Zhou, Ruo-Hua
    Yan, Yong-Hong
    INTERNATIONAL ACADEMIC CONFERENCE ON THE INFORMATION SCIENCE AND COMMUNICATION ENGINEERING (ISCE 2014), 2014, : 89 - 97
  • [9] Phase space parameters for neural network based vowel recognition
    Prajith, P
    Sreekanth, NS
    Narayanan, NK
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 1204 - 1209
  • [10] Speaker independent speech emotion recognition by ensemble classification
    Schuller, B
    Reiter, S
    Müller, R
    Al-Hames, M
    Lang, M
    Rigoll, G
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 865 - 868