A Word-Based Naive Bayes Classifier for Confidence Estimation in Speech Recognition

被引:25
|
作者
Sanchis, Alberto [1 ]
Juan, Alfons [1 ]
Vidal, Enrique [1 ]
机构
[1] Univ Politecn Valencia, Inst Tecnol Informat, Dept Sistemas Informat & Computac, E-46022 Valencia, Spain
关键词
Automatic speech recognition (ASR); confidence measures; naive Bayes; smoothing; posterior probabilities; word graphs; MODELS;
D O I
10.1109/TASL.2011.2162403
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Confidence estimation has been largely used in speech recognition to detect words in the recognized sentence that have been likely misrecognized. Confidence estimation can be seen as a conventional pattern classification problem in which a set of features is obtained for each hypothesized word in order to classify it as either correct or incorrect. We propose a smoothed naive Bayes classification model to profitably combine these features. The model itself is a combination of word-dependent (specific) and word-independent (generalized) naive Bayes models. As in statistical language modeling, the purpose of the generalized model is to smooth the (class posterior) estimates given by the specific models. Our classification model is empirically compared with confidence estimation based on posterior probabilities computed on word graphs. Empirical results clearly show that the good performance of word graph-based posterior probabilities can be improved by using the naive Bayes combination of features.
引用
收藏
页码:565 / 574
页数:10
相关论文
共 50 条
  • [1] Word-based confidence measures as a guide for stack search in speech recognition
    Neti, CV
    Roukos, S
    Eide, E
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 883 - 886
  • [2] Word-Based Bantu Language Identification using Naive Bayes
    Okgetheng, Boago
    Budu, Emmanuella A. W.
    [J]. 2022 IST-AFRICA CONFERENCE, 2022,
  • [3] Sentiment analysis with word-based Urdu speech recognition
    Shaik, Riyaz
    Venkatramaphanikumar, S.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2511 - 2531
  • [4] Fusion Architectures for Word-based Audiovisual Speech Recognition
    Wand, Michael
    Schmidhuber, Jurgen
    [J]. INTERSPEECH 2020, 2020, : 3491 - 3495
  • [5] Sentiment analysis with word-based Urdu speech recognition
    Riyaz Shaik
    S. Venkatramaphanikumar
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2511 - 2531
  • [6] A Distributed Chinese Naive Bayes Classifier Based on Word Embedding
    Feng, Mengke
    Wu, Guoshi
    [J]. PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY, 2016, 60 : 1121 - 1127
  • [7] ON NAIVE BAYES IN SPEECH RECOGNITION
    Toth, Laszlo
    Kocsor, Andras
    Csirik, Janos
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2005, 15 (02) : 287 - 294
  • [8] Emotion Recognition Using Prosodic and Spectral Features of Speech and Naive Bayes Classifier
    Khan, Atreyee
    Roy, Uttam Kumar
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1017 - 1021
  • [9] A Smoothed Naive Bayes-Based Classifier for Activity Recognition
    Sarkar, A. M. Jehad
    Lee, Young-Koo
    Lee, Sungyoung
    [J]. IETE TECHNICAL REVIEW, 2010, 27 (02) : 107 - 119
  • [10] Bayes-based confidence measure in speech recognition
    Yoma, NB
    Carrasco, J
    Molina, C
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (11) : 745 - 748