Robust speaker detection using Neural Networks

被引:0
|
作者
Shell, John R. [1 ]
机构
[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA
关键词
Neural Networks; speech recognition; modeling;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The work proposed in this paper utilizes Neural Networks to distinguish speech patterns. A feature extractor is used as a standard Linear Processing Coefficients (LPC) Cepstrum coder, converting the incoming speech signal captured by a Matlab interface into LPC Cepstrum feature space. A Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and makes the fixed length feature vector that is fed into a recognizer. The design of the recognizer uses a Feed Forward (FF) and Back Propagation (BP) Network approach tested with variable hidden layers with Transfer functions of hyperbolic tangent and sigmoid to test the signal output for the recognition of the feature vectors of isolated words. The feature vector was normalized and decorrelated by pruning techniques. The training process uses momentum to find the global minima of the error surface avoiding the oscillations in local minima. The goal of the work is to consistently identify a randomly chosen speech pattern from the samples of four different speakers uttering the same phrase 100% of the time and to verify the effectiveness of neural networks as a valid method in pattern recognition.
引用
收藏
页码:414 / 419
页数:6
相关论文
共 50 条
  • [41] Face Detection based Neural Networks using Robust Skin Color Segmentation
    Mohamed, Aamer
    Weng, Ying
    Jiang, Jianmin
    Ipson, Stan
    2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 287 - 291
  • [42] SPEAKER ADAPTIVE TRAINING IN DEEP NEURAL NETWORKS USING SPEAKER DEPENDENT BOTTLENECK FEATURES
    Doddipatla, Rama
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5290 - 5294
  • [43] Deep Neural Networks for joint Voice Activity Detection and Speaker Localization
    Vecchiotti, Paolo
    Principi, Emanuele
    Squartini, Stefano
    Piazza, Francesco
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1567 - 1571
  • [44] Deep Neural Networks with Batch Speaker Normalization for Intoxicated Speech Detection
    Wang, Weiqing
    Wu, Haiwei
    Li, Ming
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1323 - 1327
  • [46] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
    Daghbosheh, Mohammed
    Hattab, Ezz
    Bisher, Ahmad
    2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
  • [47] Speaker verification for security systems using artificial neural networks
    Vieira, K
    Wilamowski, B
    Kubichek, R
    IECON '97 - PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INDUSTRIAL ELECTRONICS, CONTROL, AND INSTRUMENTATION, VOLS. 1-4, 1997, : 1102 - 1107
  • [48] Using neural networks for automatic speaker recognition: A practical approach
    Pinto, RGCP
    Pinto, HLCP
    Caloba, LP
    38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 1078 - 1080
  • [49] Speaker recognition using dynamic synapse-neural networks
    George, S
    Dibazar, A
    Berger, TW
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 151 - 152
  • [50] Speaker identification using multimodal neural networks and wavelet analysis
    Almaadeed, Noor
    Aggoun, Amar
    Amira, Abbes
    IET BIOMETRICS, 2015, 4 (01) : 18 - 28