A Robust Speech Communication into Smart Info-Media System

被引:1
|
作者
Miyanaga, Yoshikazu [1 ]
Takahashi, Wataru [1 ]
Yoshizawa, Shingo [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Kitami Inst Technol, Dept Elect & Elect Engn, Kitami, Hokkaido 0908507, Japan
基金
日本科学技术振兴机构;
关键词
smart info-media system; robust speech recognition; voice activity detection; speech rejection; ASIC; low power consumption design; WORD RECOGNITION; NOISE; COMPENSATION; SPECTRUM;
D O I
10.1587/transfun.E96.A.2074
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces our developed noise robust speech communication techniques and describes its implementation to a smart info-media system, i.e., a small robot. Our designed speech communication system consists of automatic speech detection, recognition, and rejection. By using automatic speech detection and recognition, an observed speech waveform can be recognized without a manual trigger. In addition, using speech rejection, this system only accepts registered speech phrases and rejects any other words. In other words, although an arbitrary input speech waveform can be fed into this system and recognized, the system responds only to the registered speech phrases. The developed noise robust speech processing can reduce various noises in many environments. In addition to the design of noise robust speech recognition, the LSI design of this system has been introduced. By using the design of speech recognition application specific IC (ASIC), we can simultaneously realize low power consumption and real-time processing. This paper describes the LSI architecture of this system and its performances in some field experiments. In terms of current speech recognition accuracy, the system can realize 85-99% under 0-20 dB SNR and echo environments.
引用
收藏
页码:2074 / 2080
页数:7
相关论文
共 50 条
  • [1] Robust Speech Communication and its Embedded Smart Robot System
    Miyanaga, Yoshikazu
    Takahashi, Wataru
    Sun Xihao
    [J]. 2013 20TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2013), 2013, : 151 - 154
  • [2] Robust speech recognition system for communication robots in real environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    [J]. 2006 6TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, VOLS 1 AND 2, 2006, : 340 - +
  • [3] A robust speech recognition system for communication robots in noisy environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (03) : 759 - 763
  • [4] Communication System Using Smart Devices for Speech-Impaired Children
    Tsunemi, Nobuhide
    Yamaguchi, Toru
    Fujimoto, Yasunari
    Sato-Shimokawara, Eri
    Nitta, Osamu
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1950 - 1953
  • [5] Robust secure communication protocol for smart healthcare system with FPGA implementation
    Sureshkumar, Venkatasamy
    Amin, Ruhul
    Vijaykumar, V. R.
    Sekar, S. Raja
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 100 : 938 - 951
  • [6] Session-Oriented Communication System for Truly Reliable and Robust Smart Grid
    Neto, Augusto
    Cerqueira, Eduardo
    Souza, Neuman
    Pirmez, Luci
    Gomes, Danielo
    Aguiar, Rui
    [J]. 2011 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2011, : 1094 - 1099
  • [7] Toward Imagined Speech based Smart Communication System: Potential Applications on Metaverse Conditions
    Lee, Seo-Hyun
    Lee, Young-Eun
    Lee, Seong-Whan
    [J]. 10TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE (BCI2022), 2022,
  • [8] Recommendations for the implementation of a practical spread spectrum communication system robust against smart jamming
    des Noes, Mathieu
    [J]. 2017 IEEE VEHICULAR NETWORKING CONFERENCE (VNC), 2017, : 209 - 214
  • [9] Modeling AI Trust for 2050: perspectives from media and info-communication experts
    Feher, Katalin
    Vicsek, Lilla
    Deuze, Mark
    [J]. AI & SOCIETY, 2024, 6 (2933-2946)
  • [10] A robust adaptive speech enhancement system
    Hu, X
    Hu, AQ
    Zhao, L
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 814 - 817