Latent semantic language modeling for speech recognition

被引:0
|
作者
Bellegarda, JR [1 ]
机构
[1] Apple Comp Inc, Spoken Language Grp, Cupertino, CA 95014 USA
关键词
statistical language modeling; multi-span integration; n-grams; latent semantic analysis; speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Statistical language models used in large vocabulary speech recognition must properly capture the various constraints, both local and global, present in the language. While n-gram modeling readily accounts for the former, it has been more difficult to handle the latter, and in particular long-term semantic dependencies, within a suitable data-driven formalism. This paper focuses on the use of latent semantic analysis (LSA) for this purpose. The LSA paradigm automatically uncovers meaningful associations in the language based on word-document co-occurrences in a given corpus. The resulting semantic knowledge is encapsulated in a (continuous) vector space of comparatively low dimension, where are mapped all (discrete) words and documents considered. Comparison in this space is done through a simple similarity measure, so familiar clustering techniques can be applied. This leads to a powerful framework for both automatic semantic classification and semantic language modeling. In the latter case, the large-span nature of LSA models makes them particularly well suited to complement conventional n-grams. This synergy can be harnessed through an integrative formulation, in which latent semantic knowledge is exploited to judiciously adjust the usual n-gram probability. The paper concludes with a discussion of intrinsic trade-offs, such as the influence of training data selection on the resulting performance enhancement.
引用
收藏
页码:73 / 103
页数:31
相关论文
共 50 条
  • [1] Latent semantic information in maximum entropy language models for conversational speech recognition
    Deng, YG
    Khudanpur, S
    [J]. HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 56 - 63
  • [2] LATENT DIRICHLIET LANGUAGE MODEL FOR SPEECH RECOGNITION
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 201 - 204
  • [3] Statistical language modeling with semantic classes for large vocabulary speech recognition in embedded systems
    Oria, Daniela
    Olsen, Jesper
    [J]. PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 496 - +
  • [4] RELEVANCE LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chen, Kuan-Yu
    Chen, Berlin
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5568 - 5571
  • [5] Augmenting speech recognition and understanding using latent semantic analysis
    McCauley, L
    [J]. ICWN'04 & PCC'04, VOLS, 1 AND 2, PROCEEDINGS, 2004, : 871 - 877
  • [6] SEMANTIC LANGUAGE MODELS FOR AUTO MATIC SPEECH RECOGNITION
    Bayer, Ali Orkan
    Riccardi, Giuseppe
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 7 - 12
  • [7] Exploiting latent semantic information in statistical language modeling
    Bellegarda, JR
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1279 - 1296
  • [8] Just-in-time latent semantic adaptation on language model for Chinese speech recognition using web data
    Gao, Qin
    Lin, Xiaojun
    Wu, Xihong
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 50 - +
  • [9] Joint acoustic and language modeling for speech recognition
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    [J]. SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
  • [10] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196