Japanese Personal Name and Location Search for Spoken Utterances by Using Hierarchical Language Model of Speech Recognition

被引：0

作者：

Hu, Xinhui ^{[1
]}

Wu, Youzheng ^{[1
]}

Kashioka, Hideki ^{[1
]}

机构：

[1] Natl Inst Informat & Commun Technol, Seika, Kyoto 6190228, Japan

来源：

RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES | 2008年

关键词：

spoken document retrieval; OOV; hierarchical language model; confusion network;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a new scheme for searching Japanese personal and location names, the two main sources of out-of-vocabulary (OOV) words, in spoken documents. We use a hierarchical language model for recognition and indexing, which is composed of two independently trained layers. Retrieval experiments performed using a Japanese spontaneous speech corpus reveal that the retrieval performance for OOV words is significantly improved, while that for in-vocabulary (IV) words is not greatly influenced. Further, the retrieval performance of using confusion network is better than the 1-best of recognition results, particularly for OOV words.

引用

页码：193 / 198

页数：6

共 50 条

[1] Using speech recognition for an automated test of spoken Japanese
Suzuki, Masanori
Harada, Yasunari
[J]. PACLIC 19: The 19th Pacific Asia Conference on Language, Information and Computation, 2005, : 317 - 323
[2] INTEGRATION OF SPEECH RECOGNITION AND LANGUAGE PROCESSING IN A JAPANESE TO ENGLISH SPOKEN LANGUAGE TRANSLATION SYSTEM
MORIMOTO, T
SHIKANO, K
KOGURE, K
IIDA, H
KUREMATSU, A
[J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1889 - 1896
[3] Recognition of target domain Japanese speech using language model replacement
Mori, Daiki
Ohta, Kengo
Nishimura, Ryota
Ogawa, Atsunori
Kitaoka, Norihide
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
[4] Evaluating Spoken Language Model Based on Filler Prediction Model in Speech Recognition
Ohta, Kengo
Tsuchiya, Masatoshi
Nakagawa, Seiichi
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1558 - +
[5] A Discriminative Hierarchical PLDA-Based Model for Spoken Language Recognition
Ferrer, Luciana
Castan, Diego
McLaren, Mitchell
Lawson, Aaron
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2396 - 2410
[6] Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations
Pelemans, Joris
Vanallemeersch, Tom
Demuynck, Kris
Van Hamme, Hugo
Wambacq, Patrick
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2262 - 2266
[7] Spoken language identification using large vocabulary speech recognition.
Hieronymus, JL
Kadambe, S
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1780 - 1783
[8] SPEECH RECOGNITION OF FOREIGN OUT-OF-VOCABULARY WORDS USING A HIERARCHICAL LANGUAGE MODEL
Yamamoto, Hirofumi
Kikui, Genichiro
Nakamura, Satoshi
Sagisaka, Yoshinori
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1870 - +
[9] Speech Recognition and Spoken Language Understanding for Mobile Personal Assistants: a Case Study of "Shabette Concier"
Tsujino, Kosuke
Nakashima, Yusuke
Iizuka, Shinya
Isoda, Yoshinori
[J]. 2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2, 2013, : 225 - 228
[10] Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition
Masumura, Ryo
Hahm, Seongjun
Ito, Akinori
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1476 - 1479

← 1 2 3 4 5 →