Hybrid language models for out of vocabulary word detection in large vocabulary conversational speech recognition

被引：0

作者：

Yazgan, A ^{[1
]}

Saraclar, M ^{[1
]}

机构：

[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a method for out-of-vocabulary (OOV) word detection and taking a step toward open vocabulary automatic speech recognition. The proposed method uses a hybrid language model combining words and subword units such as phones or syllables. We describe a detection algorithm based on the posterior count of the OOV words given the hybrid model, and compare it to using the posterior probability of the best word string given a conventional word only model. Experimental results on the Switchboard corpus are presented for different vocabulary sizes. The new method yields a gain of over 10% in OOV word detection. In addition, a modest number of the OOV word pronunciations are found correctly.

引用

页码：745 / 748

页数：4

共 50 条

[1] Recent experiments in Large Vocabulary Conversational Speech Recognition
Billa, J
Colhurst, T
El-Jaroudi, A
Iyer, R
Ma, K
Matsoukas, S
Quillen, C
Richardson, F
Siu, M
Zavaliagkos, G
Gish, H
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 41 - 44
[2] Large vocabulary speech recognition with multispan statistical language models
Bellegarda, JR
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
[3] Dynamic out-of-vocabulary word registration to language model for speech recognition
Norihide Kitaoka
Bohan Chen
Yuya Obashi
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021
[4] Dynamic out-of-vocabulary word registration to language model for speech recognition
Kitaoka, Norihide
Chen, Bohan
Obashi, Yuya
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
[5] Multonic Markov Word Models for Large Vocabulary Continuous Speech Recognition
Bahl, Lalit R.
Bellegarda, Jerome R.
de Souza, Peter V.
Gopalakrishnan, P. S.
Nahamoo, David
Picheny, Michael A.
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (03): : 334 - 344
[6] Modeling word-level rate-of-speech variation in large vocabulary conversational speech recognition
Zheng, J
Franco, H
Stolcke, A
[J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 273 - 285
[7] Large vocabulary speech recognition of Slovenian language using morphological models
Maucec, M
Rotovnik, T
Kacic, Z
Horvat, B
[J]. IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 158 - 161
[8] HYBRID ACOUSTIC MODELS FOR DISTANT AND MULTICHANNEL LARGE VOCABULARY SPEECH RECOGNITION
Swietojanski, Pawel
Ghoshal, Arnab
Renals, Steve
[J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 285 - 290
[9] Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
Vanhainen, Niklas
Salvi, Giampiero
[J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
[10] The 2001 BYBLOS english large vocabulary conversational speech recognition system
Matsoukas, S
Colthurst, T
Kimball, O
Solomonoff, A
Richardson, F
Quillen, C
Gish, H
Dognin, P
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 721 - 724

← 1 2 3 4 5 →