Neural network ensemble based on vowel classification for chinese speaker recognition

被引:0
|
作者
Qian, Bo [1 ]
Tang, Zhen-min [1 ]
Li, Yan-ping [1 ]
Xu, Li-min [1 ]
Zhang, Yan [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp S&T, Nanjing, Peoples R China
[2] Jinling Inst Techonol, Jiangsu Sheng, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As we known, features of speech signal not only reflect the identity information, but also contain the semantical information. In this paper, we describe a novel neural network ensemble architecture based on the finding that the diphthong and multi-vowel in Chinese can approximately be considered as the complex of mono-vowel and transitional pan in the standpoint of short-term analysis. Several neural networks are trained, each for the eigenspace of one mono-vowel, and their results are combined by another combinational neural network. The architecture can effectively improve the recognition accuracy by eliminating the disturbance of semantical information. Experimental results show that the recognition accuracy of our proposed approach is higher than conventional methods such as a single neural network and other proposed ensemble structures.
引用
收藏
页码:141 / +
页数:3
相关论文
共 50 条
  • [1] Vowel classification based on rough neural network
    Mei, XD
    Sun, SH
    Zhang, ZL
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 259 - 263
  • [2] Speaker recognition using artificial neural networks based on vowel phonemes
    Badran, EFMF
    Selim, H
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 796 - 802
  • [3] Speaker normalization for Chinese vowel recognition in cochlear implants
    Luo, X
    Fu, QH
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2005, 52 (07) : 1358 - 1361
  • [4] Speaker Recognition Based on Quantum Neural Network
    Wang, Geng
    Wang, Jin Ming
    Sun, Jian
    [J]. 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 238 - 241
  • [5] Text independent speaker recognition based on the attack state formants and neural network classification
    Seddik, H
    Rahmouni, ABS
    Sayadi, M
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), VOLS. 1- 3, 2004, : 1649 - 1653
  • [6] Vowel Based Neural Networks for Speaker Verification
    Xu, Yun-Fei
    Huang, Yu-Fei
    Zhou, Ruo-Hua
    Yan, Yong-Hong
    [J]. INTERNATIONAL ACADEMIC CONFERENCE ON THE INFORMATION SCIENCE AND COMMUNICATION ENGINEERING (ISCE 2014), 2014, : 89 - 97
  • [7] Phase space parameters for neural network based vowel recognition
    Prajith, P
    Sreekanth, NS
    Narayanan, NK
    [J]. NEURAL INFORMATION PROCESSING, 2004, 3316 : 1204 - 1209
  • [8] Speaker independent speech emotion recognition by ensemble classification
    Schuller, B
    Reiter, S
    Müller, R
    Al-Hames, M
    Lang, M
    Rigoll, G
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 865 - 868
  • [9] FLEXIBLE VOWEL RECOGNITION BY THE GENERATION OF DYNAMIC COHERENCE IN OSCILLATOR NEURAL NETWORKS - SPEAKER-INDEPENDENT VOWEL RECOGNITION
    LIU, F
    YAMAGUCHI, Y
    SHIMIZU, H
    [J]. BIOLOGICAL CYBERNETICS, 1994, 71 (02) : 105 - 114
  • [10] Speaker indexing using neural network clustering of vowel spectra
    Roy D.K.
    [J]. International Journal of Speech Technology, 1997, 1 (2) : 143 - 149