Intentional Voice Command Detection for Completely Hands-Free Speech Interface in Home Environments

被引:0
|
作者
Obuchi, Yasunari [1 ]
Togami, Masahito [1 ]
Sumiyoshi, Takashi [1 ]
机构
[1] Hitachi Ltd, Cent Res Lab, Tokyo, Japan
关键词
IVCD; VAD; speech/non-speech discrimination; GMM; SVM; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new class of speech processing, called Intentional Voice Command Detection (IVCD). It is necessary to reject not only noises but also unintended voices to achieve completely hands-free speech interface. Conventional VAD framework is not sufficient for such purpose, and we discuss how we should define IVCD and how we can realize it. We investigate implementation of IVCD from the viewpoint of feature extraction and classification, and show that the combination of various features and SVM can achieve IVCD accuracy of 93.2% for a large-scale audio database in real home environments.
引用
收藏
页码:119 / 122
页数:4
相关论文
共 50 条
  • [1] Intentional Voice Command Detection for Trigger-Free Speech Interface
    Obuchi, Yasunari
    Sumiyoshi, Takashi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09) : 2440 - 2450
  • [2] Achieving a hands-free computer interface using voice recognition and speech synthesis
    Evans, JR
    Tjoland, WA
    Allred, LG
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2000, 15 (01) : 14 - 16
  • [3] A noise robust speech activity detection algorithm for voice activated hands-free
    Bagur, H
    [J]. Seventh IASTED International Conference on Signal and Image Processing, 2005, : 1 - 5
  • [4] The Hands-Free speech in post laryngectomy voice rehabilitation with tracheosophageal voice
    Serra, Agostino
    Grillo, Calogero
    Nane, Sebastiano
    Ferlito, Salvatore
    Martines, Anna Maria
    Grillo, Caterina
    Cocuzza, Salvatore
    [J]. ACTA MEDICA MEDITERRANEA, 2010, 26 (02): : 97 - 100
  • [5] HANDS-FREE SPEECH-SOUND INTERACTIONS AT HOME
    Milhorat, P.
    Istrate, D.
    Boudy, J.
    Chollet, G.
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1678 - 1682
  • [6] A robust speech detection algorithm for speech activated hands-free applications
    Wu, D
    Tanaka, M
    Chen, R
    Olorenshaw, L
    Amador, M
    Menendez-Pidal, X
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2407 - 2410
  • [7] Usability of a Hands-Free Voice Input Interface for Ecological Momentary Assessment
    Adaimi, Rebecca
    Ho, Ka Tai
    Thomaz, Edison
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [8] Hands-free Voice Communication with TV
    Papp, Istvan I.
    Saric, Zoran M.
    Teslic, Nikola Dj
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (02) : 606 - 614
  • [9] Compliance, quality of life and quantitative voice quality aspects of hands-free speech
    Op de Coul, BMR
    Ackerstaff, AH
    Van As-Brooks, CJ
    Van den Hoogen, FJA
    Meeuwis, CA
    Manni, JJ
    Hilgers, FJM
    [J]. ACTA OTO-LARYNGOLOGICA, 2005, 125 (06) : 629 - 637
  • [10] Speech enhancement for hands-free terminals
    Grbic, N
    Nordholm, S
    Johansson, A
    [J]. ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 435 - 440