Speaker Recognition using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

被引:0
|
作者
Wang, Mingshan [1 ]
Sirlapu, Tejaswini [1 ]
Kwasniewska, Alicja [2 ]
Szankin, Maciej [1 ]
Bartscherer, Marko [1 ]
Nicolas, Rey [1 ]
机构
[1] Intel Corp, San Diego, CA 92131 USA
[2] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Gdansk, Poland
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices arc providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are not suitable models to run on smart home devices. In this paper, we compare relatively small Convolutional Neural Networks (CNN) and evaluate effectiveness of speaker recognition using these models on edge devices. In addition, we also apply transfer learning technique to deal with a problem of limited training data. By developing solution suitable for running inference locally on edge devices, we eliminate the well-known cloud computing issues, such as data privacy and network latency, etc. The preliminary results proved that the chosen model adapts the benefit of computer vision task by using CNN and spectrograms to perform speaker classification with precision and recall similar to 84% in time less than 60 ms on mobile device with Atom Cherry Trail processor.
引用
收藏
页码:139 / 145
页数:7
相关论文
共 50 条
  • [21] Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech
    Simic, Nikola
    Suzic, Sinisa
    Nosek, Tijana
    Vujovic, Mia
    Peric, Zoran
    Savic, Milan
    Delic, Vlado
    ENTROPY, 2022, 24 (03)
  • [22] Speaker Adaptation of Convolutional Neural Network using Speaker Specific Subspace Vectors of SGMM
    Karthick, Murali B.
    Kolhar, Prateek
    Umesh, S.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1096 - 1100
  • [23] Enhanced Indonesian Ethnic Speaker Recognition using Data Augmentation Deep Neural Network
    Nugroho, Kristiawan
    Noersasongko, Edi
    Purwanto
    Muljono
    Setiadi, De Rosal Ignatius Moses
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (07) : 4375 - 4384
  • [24] Bangla Handwritten Character Recognition using Convolutional Neural Network with Data Augmentation
    Chowdhury, Rumman Rashid
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Andersson, Karl
    Hossain, Sazzad
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 318 - 323
  • [25] Hand Gesture Recognition Using an Adapted Convolutional Neural Network with Data Augmentation
    Alani, Ali A.
    Cosma, Georgina
    Taherkhani, Aboozar
    McGinnity, T. M.
    2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 5 - 12
  • [26] Convolutional Neural Network applied in mime speech recognition using sEMG data
    Ai, Qing
    Zhang, Wei
    Zhang, Bixuan
    Li, Guang
    Yang, Meng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3347 - 3352
  • [27] Static Hand Gesture Recognition using Convolutional Neural Network with Data Augmentation
    Islam, Md Zahirul
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Anderssor, Karl
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 324 - 329
  • [28] Human Action Recognition Using Convolutional Neural Network and Depth Sensor Data
    Ahmad, Zeeshan
    Illanko, Kandasamy
    Khan, Naimul
    Androutsos, Dimitri
    2019 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS (ITCC 2019), 2019, : 1 - 5
  • [29] Human Activity Recognition From Accelerometer Data Using Convolutional Neural Network
    Lee, Song-Mi
    Yoon, Sang Min
    Cho, Heeryon
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 131 - 134
  • [30] Voice Frequency-Based Gender Classification Using Convolutional Neural Network for Smart Home
    Nasaruddin, Nasaruddin
    Tresma, Muhammad Agung P. Pratama
    Muchamad, Masduki Khamdan
    Fuadi, Zahrul
    IEEE ACCESS, 2024, 12 : 104190 - 104203