Combination of latent semantic analysis based language models for meeting recognition

被引:0
|
作者
Puscher, Michael [1 ,2 ]
Huang, Yan [3 ]
Cetin, Ozgur [3 ]
机构
[1] Telecommun Res Ctr Vienna, Vienna, Austria
[2] Graz Univ Technol, Speech & Signal Proc Lab, Graz, Austria
[3] Int Comp Sci Inst, Berkeley, CA 94704 USA
关键词
speech recognition; latent semantic indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Latent Semantic Analysis (LSA) defines a semantic similarity space using a training corpus. This semantic similarity can be used for dealing with long distance dependencies, which are an inherent problem for traditional wordbased n-gram models. Since LSA models adapt dynamically to topics, and meetings have clear topics, we conjecture that these models can improve speech recognition accuracy on meetings. This paper presents perplexity and word error rate results for LSA models for meetings. We present results for models trained on a variety of corpora including meeting data and background domain data, and for combinations of multiple LSA models together with a word-based n-gram model. We show that the meeting and background LSA models can improve over the baseline n-grain models in terms of perplexity and that some background LSA models can significantly improve over the n-gram models in terms of word error rate. For the combination of multiple LSA models we did however not see such an improvement.
引用
收藏
页码:465 / +
页数:2
相关论文
共 50 条
  • [1] Latent semantic information in maximum entropy language models for conversational speech recognition
    Deng, YG
    Khudanpur, S
    HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 56 - 63
  • [2] Latent semantic language modeling for speech recognition
    Bellegarda, JR
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103
  • [3] Latent Semantic Analysis (LSA) Based Object Recognition and Clustering
    Hebballi, Vinaykumar
    Rojit, Vidhu
    2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 416 - 421
  • [4] New latent semantic analysis language model
    Ren, Jisheng
    Wang, Zuoying
    Gaojishu Tongxin/Chinese High Technology Letters, 2005, 15 (08): : 1 - 5
  • [5] Latent Semantic Analysis Models on Wikipedia and TASA
    Stefanescu, Dan
    Banjade, Rajendra
    Rus, Vasile
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1417 - 1422
  • [6] Satellite Recognition via Sparse Coding Based Probabilistic Latent Semantic Analysis
    Zhao, Danpei
    Lu, Ming
    Zhang, Xuguang
    Shi, Jun
    Jiang, Zhiguo
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2014, 11 (02)
  • [7] A rough concept recognition approach for information retrieval based on latent semantic analysis
    Wang, Yi-chuan
    Guo, Yan-hui
    Li, Lei
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 90 - +
  • [8] SEMANTIC LANGUAGE MODELS FOR AUTO MATIC SPEECH RECOGNITION
    Bayer, Ali Orkan
    Riccardi, Giuseppe
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 7 - 12
  • [9] Latent semantic analysis: A theory of the psychology of language and mind
    Landauer, TK
    DISCOURSE PROCESSES, 1999, 27 (03) : 303 - 310
  • [10] Randomized Probabilistic Latent Semantic Analysis for Scene Recognition
    Rodner, Erik
    Denzler, Joachim
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 945 - 953