Building a speech recognition system with privacy identification information based on Google Voice for social robots

被引:0
|
作者
Pei-Chun Lin
Benjamin Yankson
Vishal Chauhan
Manabu Tsukada
机构
[1] Feng Chia University,Faculty of Information Engineering and Computer Science
[2] University at Albany,CEHC
[3] State University of New York,Graduate School of Information Science and Technology
[4] The University of Tokyo,undefined
来源
关键词
Google AIY Voice Kit; Speech recognition; Personal identification information; Artificial intelligent; Social robots; Robot computing; Smart speaker; Google assistant;
D O I
暂无
中图分类号
学科分类号
摘要
Currently, many smart speakers, even social robots, appear on the market to help people's lives become more convenient. Usually, people use smart speakers to check their daily schedule or control home appliances in their house. Many social robots also include smart speakers. They have the common property of being used in voice control machines. Regardless of where the smart speaker is installed and used, when people start a conversation with voice equipment, a security or privacy risk is exposed. Hence, we want to build a speech recognition (SR) that contains the privacy identification information (PII) system in this paper. We call this the SR-PII system. We used a Google Artificial-Intelligence-Yourself (AIY) Voice Kit released from Google to build a simple, smart dialog speaker and included our SR-PII system. In our experiments, we test SR accuracy and the reliability of privacy settings in three environments (quiet, noise, and playing music). We also examine the cloud response and speaker response times during our experiments. The results show that the speaker response is approximately 3.74 s in the cloud environment and approximately 9.04 s from the speaker. We also showed the response accuracy of the speaker, which successfully prevented personal information with the SR-PII system in three environments. The speaker has a response mean time of approximately 8.86 s with 93% mean accuracy in a quiet room, approximately 9.18 s with 89% mean accuracy in a noisy environment, and approximately 9.62 s with 90% mean accuracy in an environment that plays music. We conclude that the SR-PII system can secure private information and that the most important factor affecting the response speed of the speaker is the network connection status. We hope that people can, through our experiments, have some guidelines in building social robots and installing the SR-PII system to protect users’ personal identification information.
引用
收藏
页码:15060 / 15088
页数:28
相关论文
共 50 条
  • [1] Building a speech recognition system with privacy identification information based on Google Voice for social robots
    Lin, Pei-Chun
    Yankson, Benjamin
    Chauhan, Vishal
    Tsukada, Manabu
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (13): : 15060 - 15088
  • [2] Bringing Contextual Information to Google Speech Recognition
    Aleksic, Petar
    Ghodsi, Mohammadreza
    Michaely, Assaf
    Allauzen, Cyril
    Hall, Keith
    Roark, Brian
    Rybach, David
    Moreno, Pedro
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 468 - 472
  • [3] Interactive Voice Response System Based on Speech Recognition
    Wang Yutai
    Li Bo
    Wang Lei
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOL. 3, 2008, : 1481 - 1484
  • [4] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
    Dovydaitis, Laurynas
    Rasymas, Tomas
    Rudzionis, Vytautas
    [J]. BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84
  • [5] A Voice Activity Detector Based on Noise Spectrum Adaptation and Discrimination Information for Automatic Speech Recognition System
    Wang, Zhe
    Bi, Guoan
    [J]. PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 301 - 305
  • [6] An expandable voice user interface as lab assistant based on an improved version of Google's speech recognition
    Vazquez, Maria Fernanda Avila
    Rupp, Nicole
    Ballardt, Larissa
    Opara, Jeannine
    Zuchner, Thole
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01):
  • [7] An expandable voice user interface as lab assistant based on an improved version of Google’s speech recognition
    Maria Fernanda Avila Vazquez
    Nicole Rupp
    Larissa Ballardt
    Jeannine Opara
    Thole Zuchner
    [J]. Scientific Reports, 13 (1)
  • [8] A voice recognition system for speech impaired people
    Plaza-Aguilar, JL
    Báez-López, D
    Guerrero-Ojeda, L
    Asomoza, JR
    [J]. 14TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS, AND COMPUTERS, PROCEEDINGS, 2004, : 162 - 167
  • [9] Visual privacy behaviour recognition for social robots based on an improved generative adversarial network
    Yang, Guanci
    Lin, Jiacheng
    Su, Zhidong
    Li, Yang
    [J]. IET COMPUTER VISION, 2024, 18 (01) : 110 - 123
  • [10] Information retrieval system based on Romanian continuous speech recognition
    Giurgiu, M
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 1104 - 1109