A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface

被引:32
|
作者
Dhakal, Parashar [1 ]
Damacharla, Praveen [2 ]
Javaid, Ahmad Y. [1 ]
Devabhaktuni, Vijay [2 ]
机构
[1] Univ Toledo, Elect Engn & Comp Sci Dept, Toledo, OH 43606 USA
[2] Purdue Univ Northwest, ECE Dept, Hammond, IN 46323 USA
来源
关键词
classifiers; convolution neural network; architecture; feature extraction; machine learning; random forest; speaker recognition; voice interface;
D O I
10.3390/make1010031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel pipelined near real-time speaker recognition architecture that enhances the performance of speaker recognition by exploiting the advantages of hybrid feature extraction techniques that contain the features of Gabor Filter (GF), Convolution Neural Networks (CNN), and statistical parameters as a single matrix set. This architecture has been developed to enable secure access to a voice-based user interface (UI) by enabling speaker-based authentication and integration with an existing Natural Language Processing (NLP) system. Gaining secure access to existing NLP systems also served as motivation. Initially, we identify challenges related to real-time speaker recognition and highlight the recent research in the field. Further, we analyze the functional requirements of a speaker recognition system and introduce the mechanisms that can address these requirements through our novel architecture. Subsequently, the paper discusses the effect of different techniques such as CNN, GF, and statistical parameters in feature extraction. For the classification, standard classifiers such as Support Vector Machine (SVM), Random Forest (RF) and Deep Neural Network (DNN) are investigated. To verify the validity and effectiveness of the proposed architecture, we compared different parameters including accuracy, sensitivity, and specificity with the standard AlexNet architecture.
引用
收藏
页码:504 / 520
页数:17
相关论文
共 50 条
  • [41] Real-Time Speaker Independent Isolated Word Recognition on Banana Pi
    Disken, Gokay
    Saribulut, Lutfu
    Tufekci, Zekeriya
    Cevik, Ulus
    PROCEEDINGS OF THE 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2018,
  • [42] Real-time hand shape recognition for human interface
    Horimoto, S
    Arita, D
    Taniguchi, R
    12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2003, : 20 - 25
  • [43] A flexible and efficient hardware architecture for real-time face recognition based on eigenface
    Ngo, HT
    Gottumukkal, R
    Asari, VK
    IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: NEW FRONTIERS IN VLSI DESIGN, 2005, : 280 - 281
  • [44] Robust and Real-time Traffic Light Recognition Based on Hierarchical Vision Architecture
    Chen, Quan
    Shi, Zhenwei
    Zou, Zhengxia
    2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 114 - 119
  • [45] Multi-lane architecture for eigenface based real-time face recognition
    Gottumukkal, Rajkiran
    Ngo, Hau T.
    Asari, Vijayan K.
    MICROPROCESSORS AND MICROSYSTEMS, 2006, 30 (04) : 216 - 224
  • [46] Real-Time Basic Principles Nuclear Reactor Simulator Based on Client-Server Network Architecture with WebBrowser as User Interface
    Juszczuk, Dymitr
    Tarnawski, Jaroslaw
    Karla, Tomasz
    Duzinkiewicz, Kazimierz
    TRENDS IN ADVANCED INTELLIGENT CONTROL, OPTIMIZATION AND AUTOMATION, 2017, 577 : 344 - 353
  • [47] BIOSTORIES Dynamic Multimedia Interfaces based on Automatic Real-time User Emotion Assessment
    Vinhas, Vasco
    Oliveira, Eugenio
    Reis, Luis Paulo
    ICEIS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 5: HUMAN-COMPUTER INTERACTION, 2010, : 21 - 29
  • [48] AN EXPLORATION OF USER INTERFACE DESIGNS FOR REAL-TIME PANORAMIC PHOTOGRAPHY
    Baudisch, Patrick
    Tan, Desney
    Steedly, Drew
    Rudolph, Eric
    Uyttendaele, Matt
    Pal, Chris
    Szeliski, Richard
    AUSTRALASIAN JOURNAL OF INFORMATION SYSTEMS, 2006, 13 (02) : 151 - 165
  • [49] IMPROVING THE USER INTERFACE OF REAL-TIME MODELS OF OFFSITE CONSEQUENCES
    JACKSON, RG
    RADIATION PROTECTION - THEORY AND PRACTICE, 1989, : 157 - 160
  • [50] VMEBUS PC COMBINES USER INTERFACE WITH REAL-TIME MULTITASKING
    WILLIAMS, T
    COMPUTER DESIGN, 1990, 29 (07): : 120 - 120