A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface

被引:32
|
作者
Dhakal, Parashar [1 ]
Damacharla, Praveen [2 ]
Javaid, Ahmad Y. [1 ]
Devabhaktuni, Vijay [2 ]
机构
[1] Univ Toledo, Elect Engn & Comp Sci Dept, Toledo, OH 43606 USA
[2] Purdue Univ Northwest, ECE Dept, Hammond, IN 46323 USA
来源
关键词
classifiers; convolution neural network; architecture; feature extraction; machine learning; random forest; speaker recognition; voice interface;
D O I
10.3390/make1010031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel pipelined near real-time speaker recognition architecture that enhances the performance of speaker recognition by exploiting the advantages of hybrid feature extraction techniques that contain the features of Gabor Filter (GF), Convolution Neural Networks (CNN), and statistical parameters as a single matrix set. This architecture has been developed to enable secure access to a voice-based user interface (UI) by enabling speaker-based authentication and integration with an existing Natural Language Processing (NLP) system. Gaining secure access to existing NLP systems also served as motivation. Initially, we identify challenges related to real-time speaker recognition and highlight the recent research in the field. Further, we analyze the functional requirements of a speaker recognition system and introduce the mechanisms that can address these requirements through our novel architecture. Subsequently, the paper discusses the effect of different techniques such as CNN, GF, and statistical parameters in feature extraction. For the classification, standard classifiers such as Support Vector Machine (SVM), Random Forest (RF) and Deep Neural Network (DNN) are investigated. To verify the validity and effectiveness of the proposed architecture, we compared different parameters including accuracy, sensitivity, and specificity with the standard AlexNet architecture.
引用
收藏
页码:504 / 520
页数:17
相关论文
共 50 条
  • [1] Real-Time Recognition Method of Counting Fingers for Natural User Interface
    Lee, Doyeob
    Shin, Dongkyoo
    Shin, Dongil
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (05): : 2363 - 2374
  • [2] A Real-time Accompaniment System Based on Sung Voice Recognition
    Luo, Li
    Lu, Peng-Fei
    Wang, Zeng-Fu
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 531 - 534
  • [3] Web-Based Real-Time Gesture Recognition with Voice
    Pralhad, Ghadekar Premanand
    Abhishek, S.
    Kachare, Tejas
    Deshpande, Om
    Chounde, Rushikesh
    Tapadiya, Prachi
    INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY (ICICCT 2021), 2021, 1417 : 119 - 131
  • [4] Web-Based Real-Time Gesture Recognition with Voice
    Pralhad, Ghadekar Premanand
    Abhishek, S.
    Kachare, Tejas
    Deshpande, Om
    Chounde, Rushikesh
    Tapadiya, Prachi
    Communications in Computer and Information Science, 2021, 1417 CCIS : 119 - 131
  • [5] An Automatic Real Time Speech-Speaker Recognition System: A Real Time Approach
    Kakade, Mandar Nitin
    Salunke, D. B.
    ICCCE 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND CYBER-PHYSICAL ENGINEERING, 2020, 570 : 151 - 158
  • [6] A Graphical User Interface for Real-time Display
    Long Zhongjie
    Nagamune, Kouki
    Xu Xiaoli
    ISTAI 2016: PROCEEDINGS OF THE SIXTH INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, 2016, : 83 - 86
  • [7] Over Head Line real-time tracking for automatic inspection or user interface enhancement
    Gomes-Mota, Joao
    Gusmao, Tiago
    2010 1ST INTERNATIONAL CONFERENCE ON APPLIED ROBOTICS FOR THE POWER INDUSTRY, 2010,
  • [8] Handy eyes: A real-time vision-based user interface
    Akhtar, J
    Khan, MH
    Akram, W
    Hashmi, BM
    Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 666 - 671
  • [9] Presentation of real-time system for automatic speaker identification and verification
    David, P
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 372 - 376
  • [10] A Real-Time Recognition System for User Characteristics Based on Deep Learning
    Nunez Fernandez, Dennis
    PROCEEDINGS OF THE 2018 IEEE 25TH INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON 2018), 2018,