Robust speaker detection using Neural Networks

被引：0

作者：

Shell, John R. ^{[1
]}

机构：

[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA

来源：

PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING | 2006年

关键词：

Neural Networks; speech recognition; modeling;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The work proposed in this paper utilizes Neural Networks to distinguish speech patterns. A feature extractor is used as a standard Linear Processing Coefficients (LPC) Cepstrum coder, converting the incoming speech signal captured by a Matlab interface into LPC Cepstrum feature space. A Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and makes the fixed length feature vector that is fed into a recognizer. The design of the recognizer uses a Feed Forward (FF) and Back Propagation (BP) Network approach tested with variable hidden layers with Transfer functions of hyperbolic tangent and sigmoid to test the signal output for the recognition of the feature vectors of isolated words. The feature vector was normalized and decorrelated by pruning techniques. The training process uses momentum to find the global minima of the error surface avoiding the oscillations in local minima. The goal of the work is to consistently identify a randomly chosen speech pattern from the samples of four different speakers uttering the same phrase 100% of the time and to verify the effectiveness of neural networks as a valid method in pattern recognition.

引用

页码：414 / 419

页数：6

共 50 条

[41] Face Detection based Neural Networks using Robust Skin Color Segmentation
Mohamed, Aamer
Weng, Ying
Jiang, Jianmin
Ipson, Stan
2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 287 - 291
[42] SPEAKER ADAPTIVE TRAINING IN DEEP NEURAL NETWORKS USING SPEAKER DEPENDENT BOTTLENECK FEATURES
Doddipatla, Rama
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5290 - 5294
[43] Deep Neural Networks for joint Voice Activity Detection and Speaker Localization
Vecchiotti, Paolo
Principi, Emanuele
Squartini, Stefano
Piazza, Francesco
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1567 - 1571
[44] Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection
Wang, Weiqing
Wu, Haiwei
Li, Ming
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1323 - 1327
[45] TEXT-INDEPENDENT SPEAKER RECOGNITION USING NEURAL NETWORKS
HATTORI, H
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (03) : 345 - 351
[46] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
Daghbosheh, Mohammed
Hattab, Ezz
Bisher, Ahmad
2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
[47] Speaker verification for security systems using artificial neural networks
Vieira, K
Wilamowski, B
Kubichek, R
IECON '97 - PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INDUSTRIAL ELECTRONICS, CONTROL, AND INSTRUMENTATION, VOLS. 1-4, 1997, : 1102 - 1107
[48] Using neural networks for automatic speaker recognition: A practical approach
Pinto, RGCP
Pinto, HLCP
Caloba, LP
38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 1078 - 1080
[49] Speaker recognition using dynamic synapse-neural networks
George, S
Dibazar, A
Berger, TW
SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 151 - 152
[50] Speaker identification using multimodal neural networks and wavelet analysis
Almaadeed, Noor
Aggoun, Amar
Amira, Abbes
IET BIOMETRICS, 2015, 4 (01) : 18 - 28

← 1 2 3 4 5 →