Japanese Personal Name and Location Search for Spoken Utterances by Using Hierarchical Language Model of Speech Recognition

被引:0
|
作者
Hu, Xinhui [1 ]
Wu, Youzheng [1 ]
Kashioka, Hideki [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Seika, Kyoto 6190228, Japan
关键词
spoken document retrieval; OOV; hierarchical language model; confusion network;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new scheme for searching Japanese personal and location names, the two main sources of out-of-vocabulary (OOV) words, in spoken documents. We use a hierarchical language model for recognition and indexing, which is composed of two independently trained layers. Retrieval experiments performed using a Japanese spontaneous speech corpus reveal that the retrieval performance for OOV words is significantly improved, while that for in-vocabulary (IV) words is not greatly influenced. Further, the retrieval performance of using confusion network is better than the 1-best of recognition results, particularly for OOV words.
引用
收藏
页码:193 / 198
页数:6
相关论文
共 50 条
  • [1] Using speech recognition for an automated test of spoken Japanese
    Suzuki, Masanori
    Harada, Yasunari
    [J]. PACLIC 19: The 19th Pacific Asia Conference on Language, Information and Computation, 2005, : 317 - 323
  • [2] INTEGRATION OF SPEECH RECOGNITION AND LANGUAGE PROCESSING IN A JAPANESE TO ENGLISH SPOKEN LANGUAGE TRANSLATION SYSTEM
    MORIMOTO, T
    SHIKANO, K
    KOGURE, K
    IIDA, H
    KUREMATSU, A
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1889 - 1896
  • [3] Recognition of target domain Japanese speech using language model replacement
    Mori, Daiki
    Ohta, Kengo
    Nishimura, Ryota
    Ogawa, Atsunori
    Kitaoka, Norihide
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
  • [4] Evaluating Spoken Language Model Based on Filler Prediction Model in Speech Recognition
    Ohta, Kengo
    Tsuchiya, Masatoshi
    Nakagawa, Seiichi
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1558 - +
  • [5] A Discriminative Hierarchical PLDA-Based Model for Spoken Language Recognition
    Ferrer, Luciana
    Castan, Diego
    McLaren, Mitchell
    Lawson, Aaron
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2396 - 2410
  • [6] Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations
    Pelemans, Joris
    Vanallemeersch, Tom
    Demuynck, Kris
    Van Hamme, Hugo
    Wambacq, Patrick
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2262 - 2266
  • [7] Spoken language identification using large vocabulary speech recognition.
    Hieronymus, JL
    Kadambe, S
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1780 - 1783
  • [8] SPEECH RECOGNITION OF FOREIGN OUT-OF-VOCABULARY WORDS USING A HIERARCHICAL LANGUAGE MODEL
    Yamamoto, Hirofumi
    Kikui, Genichiro
    Nakamura, Satoshi
    Sagisaka, Yoshinori
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1870 - +
  • [9] Speech Recognition and Spoken Language Understanding for Mobile Personal Assistants: a Case Study of "Shabette Concier"
    Tsujino, Kosuke
    Nakashima, Yusuke
    Iizuka, Shinya
    Isoda, Yoshinori
    [J]. 2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2, 2013, : 225 - 228
  • [10] Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition
    Masumura, Ryo
    Hahm, Seongjun
    Ito, Akinori
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1476 - 1479