Neural network ensemble based on vowel classification for chinese speaker recognition

被引：0

作者：

Qian, Bo ^{[1
]}

Tang, Zhen-min ^{[1
]}

Li, Yan-ping ^{[1
]}

Xu, Li-min ^{[1
]}

Zhang, Yan ^{[1
,2
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp S&T, Nanjing, Peoples R China

[2] Jinling Inst Techonol, Jiangsu Sheng, Peoples R China

来源：

ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As we known, features of speech signal not only reflect the identity information, but also contain the semantical information. In this paper, we describe a novel neural network ensemble architecture based on the finding that the diphthong and multi-vowel in Chinese can approximately be considered as the complex of mono-vowel and transitional pan in the standpoint of short-term analysis. Several neural networks are trained, each for the eigenspace of one mono-vowel, and their results are combined by another combinational neural network. The architecture can effectively improve the recognition accuracy by eliminating the disturbance of semantical information. Experimental results show that the recognition accuracy of our proposed approach is higher than conventional methods such as a single neural network and other proposed ensemble structures.

引用

页码：141 / +

页数：3

共 50 条

[21] Distribution Network Connectivity Recognition Based on Ensemble Deep Neural Network
Jiang W.
Tang H.
Qi H.
Chen H.
Chen J.
Jiao H.
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (01): : 101 - 108
[22] Subspace-based speaker-independent vowel recognition
Muralishankar, R
O'Shaughnessy, D
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 549 - 552
[23] Speaker-Independent Malay Vowel Recognition of Children using Neural Networks
Ting, H. N.
Lam, Y. M.
WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 288 - 291
[24] Speaker state recognition with neural network-based classification and self-adaptive heuristic feature selection
Sidorov, Maxim
Brester, Christina
Semenkin, Eugene
Minker, Wolfgang
ICINCO 2014 - Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics, 2014, 1 : 699 - 703
[25] Convolutional neural network vectors for speaker recognition
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
International Journal of Speech Technology, 2021, 24 : 389 - 400
[26] Speaker Recognition Based on Principal Component Analysis and Probabilistic Neural Network
Zhou, Yan
Shang, Li
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 708 - 715
[27] Convolutional Neural Network Based Ensemble Approach for Homoglyph Recognition
Majumder, Md Taksir Hasan
Rahman, Md Mahabur
Iqbal, Anindya
Rahman, M. Sohel
MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2020, 25 (04)
[28] Convolutional neural network vectors for speaker recognition
Hourri, Soufiane
Nikolov, Nikola S.
Kharroubi, Jamal
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 389 - 400
[29] Face Recognition Based on Neural Network Ensemble and Feature Fusion
Dong, Jiwen
Zhao, Lei
Zhang, Liang
2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 59 - 62
[30] Speaker Recognition Based on Lightweight Neural Network for Smart Home Solutions
Ai, Haojun
Xia, Wuyang
Zhang, Quanxin
CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 421 - 431

← 1 2 3 4 5 →