Neural network ensemble based on vowel classification for chinese speaker recognition

被引:0
|
作者
Qian, Bo [1 ]
Tang, Zhen-min [1 ]
Li, Yan-ping [1 ]
Xu, Li-min [1 ]
Zhang, Yan [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp S&T, Nanjing, Peoples R China
[2] Jinling Inst Techonol, Jiangsu Sheng, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As we known, features of speech signal not only reflect the identity information, but also contain the semantical information. In this paper, we describe a novel neural network ensemble architecture based on the finding that the diphthong and multi-vowel in Chinese can approximately be considered as the complex of mono-vowel and transitional pan in the standpoint of short-term analysis. Several neural networks are trained, each for the eigenspace of one mono-vowel, and their results are combined by another combinational neural network. The architecture can effectively improve the recognition accuracy by eliminating the disturbance of semantical information. Experimental results show that the recognition accuracy of our proposed approach is higher than conventional methods such as a single neural network and other proposed ensemble structures.
引用
收藏
页码:141 / +
页数:3
相关论文
共 50 条
  • [21] Distribution Network Connectivity Recognition Based on Ensemble Deep Neural Network
    Jiang W.
    Tang H.
    Qi H.
    Chen H.
    Chen J.
    Jiao H.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (01): : 101 - 108
  • [22] Subspace-based speaker-independent vowel recognition
    Muralishankar, R
    O'Shaughnessy, D
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 549 - 552
  • [23] Speaker-Independent Malay Vowel Recognition of Children using Neural Networks
    Ting, H. N.
    Lam, Y. M.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 288 - 291
  • [24] Speaker state recognition with neural network-based classification and self-adaptive heuristic feature selection
    Sidorov, Maxim
    Brester, Christina
    Semenkin, Eugene
    Minker, Wolfgang
    ICINCO 2014 - Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics, 2014, 1 : 699 - 703
  • [25] Convolutional neural network vectors for speaker recognition
    Soufiane Hourri
    Nikola S. Nikolov
    Jamal Kharroubi
    International Journal of Speech Technology, 2021, 24 : 389 - 400
  • [26] Speaker Recognition Based on Principal Component Analysis and Probabilistic Neural Network
    Zhou, Yan
    Shang, Li
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 708 - 715
  • [27] Convolutional Neural Network Based Ensemble Approach for Homoglyph Recognition
    Majumder, Md Taksir Hasan
    Rahman, Md Mahabur
    Iqbal, Anindya
    Rahman, M. Sohel
    MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2020, 25 (04)
  • [28] Convolutional neural network vectors for speaker recognition
    Hourri, Soufiane
    Nikolov, Nikola S.
    Kharroubi, Jamal
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 389 - 400
  • [29] Face Recognition Based on Neural Network Ensemble and Feature Fusion
    Dong, Jiwen
    Zhao, Lei
    Zhang, Liang
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 59 - 62
  • [30] Speaker Recognition Based on Lightweight Neural Network for Smart Home Solutions
    Ai, Haojun
    Xia, Wuyang
    Zhang, Quanxin
    CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 421 - 431