A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface

被引:32
|
作者
Dhakal, Parashar [1 ]
Damacharla, Praveen [2 ]
Javaid, Ahmad Y. [1 ]
Devabhaktuni, Vijay [2 ]
机构
[1] Univ Toledo, Elect Engn & Comp Sci Dept, Toledo, OH 43606 USA
[2] Purdue Univ Northwest, ECE Dept, Hammond, IN 46323 USA
来源
关键词
classifiers; convolution neural network; architecture; feature extraction; machine learning; random forest; speaker recognition; voice interface;
D O I
10.3390/make1010031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel pipelined near real-time speaker recognition architecture that enhances the performance of speaker recognition by exploiting the advantages of hybrid feature extraction techniques that contain the features of Gabor Filter (GF), Convolution Neural Networks (CNN), and statistical parameters as a single matrix set. This architecture has been developed to enable secure access to a voice-based user interface (UI) by enabling speaker-based authentication and integration with an existing Natural Language Processing (NLP) system. Gaining secure access to existing NLP systems also served as motivation. Initially, we identify challenges related to real-time speaker recognition and highlight the recent research in the field. Further, we analyze the functional requirements of a speaker recognition system and introduce the mechanisms that can address these requirements through our novel architecture. Subsequently, the paper discusses the effect of different techniques such as CNN, GF, and statistical parameters in feature extraction. For the classification, standard classifiers such as Support Vector Machine (SVM), Random Forest (RF) and Deep Neural Network (DNN) are investigated. To verify the validity and effectiveness of the proposed architecture, we compared different parameters including accuracy, sensitivity, and specificity with the standard AlexNet architecture.
引用
收藏
页码:504 / 520
页数:17
相关论文
共 50 条
  • [31] Method of real-time automatic recognition for animal posture
    Zhuang Fei
    Wu Kaihua
    Zhu Feng
    Ye Ting
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2007: RELATED TECHNOLOGIES AND APPLICATIONS, 2008, 6625
  • [32] A real-time optical automatic target recognition system
    Chen, HX
    Nan, JS
    Li, XS
    Wei, HG
    OPTICAL PATTERN RECOGNITION XV, 2004, 5437 : 293 - 300
  • [33] AUTOMATIC RECOGNITION OF DEFECTS IN INDUSTRIAL REAL-TIME RADIOGRAPHY
    GAYER, A
    SAYA, A
    NDT INTERNATIONAL, 1988, 21 (06): : 456 - 456
  • [34] AUTOMATIC RECOGNITION OF WELDING DEFECTS IN REAL-TIME RADIOGRAPHY
    GAYER, A
    SAYA, A
    SHILOH, A
    NDT INTERNATIONAL, 1990, 23 (03): : 131 - 136
  • [35] The impacts on user performance and satisfaction of a voice-based front-end interface for a standard software tool
    Molnar, KK
    Kletke, MG
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1996, 45 (03) : 287 - 303
  • [36] Lightweight Network Architecture for Real-Time Action Recognition
    Kozlov, Alexander
    Andronov, Vadim
    Gritsenko, Yana
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 2074 - 2080
  • [37] Real-time Functional Architecture of Visual Word Recognition
    Whiting, Caroline
    Shtyrov, Yury
    Marslen-Wilson, William
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (02) : 246 - 265
  • [38] Voice recognition: Software solutions in real-time ATC workstations
    Lechner, A
    Mattson, P
    Ecker, K
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2002, 17 (11) : 11 - 16
  • [39] Distributed feeder automation based on automatic recognition of real-time feeder topology
    Gao, Mengyou
    Xu, Bingyin
    Fan, Kaijun
    Zhang, Xinhui
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2015, 39 (09): : 127 - 131
  • [40] A Robust Real-Time Automatic License Plate Recognition Based on the YOLO Detector
    Laroca, Rayson
    Severo, Evair
    Zanlorensi, Luiz A.
    Oliveira, Luiz S.
    Goncalves, Gabriel Resende
    Schwartz, William Robson
    Menotti, David
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,