Combination of latent semantic analysis based language models for meeting recognition

被引:0
|
作者
Puscher, Michael [1 ,2 ]
Huang, Yan [3 ]
Cetin, Ozgur [3 ]
机构
[1] Telecommun Res Ctr Vienna, Vienna, Austria
[2] Graz Univ Technol, Speech & Signal Proc Lab, Graz, Austria
[3] Int Comp Sci Inst, Berkeley, CA 94704 USA
关键词
speech recognition; latent semantic indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Latent Semantic Analysis (LSA) defines a semantic similarity space using a training corpus. This semantic similarity can be used for dealing with long distance dependencies, which are an inherent problem for traditional wordbased n-gram models. Since LSA models adapt dynamically to topics, and meetings have clear topics, we conjecture that these models can improve speech recognition accuracy on meetings. This paper presents perplexity and word error rate results for LSA models for meetings. We present results for models trained on a variety of corpora including meeting data and background domain data, and for combinations of multiple LSA models together with a word-based n-gram model. We show that the meeting and background LSA models can improve over the baseline n-grain models in terms of perplexity and that some background LSA models can significantly improve over the n-gram models in terms of word error rate. For the combination of multiple LSA models we did however not see such an improvement.
引用
收藏
页码:465 / +
页数:2
相关论文
共 50 条
  • [41] Medical Record Text Analysis Based on Latent Semantic Analysis
    Jin, Xinyu
    Ma, Wentao
    Li, Yunze
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 108 - 110
  • [42] Latent semantic models for collaborative filtering
    Hofmann, T
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 89 - 115
  • [43] Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Sakauchi, Sumitaka
    Ito, Akinori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2557 - 2567
  • [44] Sentiment Classification of Documents Based on Latent Semantic Analysis
    Wang, Lan
    Wan, Yuan
    ADVANCED RESEARCH ON COMPUTER EDUCATION, SIMULATION AND MODELING, PT II, 2011, 176 (02): : 356 - +
  • [45] A protein classification method based on Latent Semantic Analysis
    Yuan, Yongsheng
    Lin, Lei
    Dong, Qiwen
    Wang, Xiaolong
    Li, Minghui
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 7738 - 7741
  • [46] Automatic document classification based on latent semantic analysis
    I. Kuralenok
    I. Nekrest'yanov
    Programming and Computer Software, 2000, 26 : 199 - 206
  • [47] Text structure analysis based on latent semantic indexing
    Lin, Hongfei
    Zhan, Xuegang
    Yao, Tianshun
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2000, 13 (01): : 47 - 51
  • [48] Discipline literature retrieval based on latent semantic analysis
    College of Economics & Management, Anhui Agricultural University, Hefei
    230026, China
    不详
    230027, China
    Wuhan Daxue Xuebao Xinxi Kexue Ban, (51-56):
  • [49] Latent semantic analysis for text-based research
    Foltz, PW
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1996, 28 (02): : 197 - 202
  • [50] Web text categorization based on latent semantic analysis
    Wang Jianfeng
    Yuan Jinsha
    ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 826 - 828