Learning Spoken Document Similarity and Recommendation using Supervised Probabilistic Latent Semantic Analysis

被引:0
|
作者
Thambiratnam, K. [1 ]
Seide, F. [1 ]
机构
[1] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Document Similarity; Document Recommendation; Probabilistic Latent Semantic Analysis; Spoken Document Retrieval; Information Retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a model-based approach to spoken document similarity called Supervised Probabilistic Latent Semantic Analysis (PLSA). The method differs from traditional spoken document similarity techniques in that it allows similarity to be learned rather than approximated. The ability to learn similarity is desirable in applications such as Internet video recommendation, in which complex relationships like user-preference or speaking style need to be predicted. The proposed method exploits prior knowledge of document relationships to learn similarity. Experiments on broadcast news and Internet video corpora yielded 16.2% and 9.7% absolute mAP gains over traditional PLSA. Additionally, a cascaded Supervised+Discriminative PLSA system achieved a 3.0% absolute mAP gain over a Discriminative PLSA system, demonstrating the complementary nature of Supervised and Discriminative PLSA training.
引用
收藏
页码:2840 / 2843
页数:4
相关论文
共 50 条
  • [1] Discriminatively trained spoken document similarity models and their application to probabilistic latent semantic analysis
    Thambiratnam, K.
    Seide, F.
    Yu, P.
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 42 - +
  • [2] Improved spoken document summarization using Probabilistic Latent Semantic Analysis (PLSA)
    Kong, Sheng-Yi
    Lee, Lin-shan
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 941 - 944
  • [3] Learning Similarity with Probabilistic Latent Semantic Analysis for Image Retrieval
    Li, Xiong
    Lv, Qi
    Huang, Wenting
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (04): : 1424 - 1440
  • [4] Supervised learning probabilistic Latent Semantic Analysis for human motion analysis
    Wang, Jin
    Liu, Ping
    She, Mary F. H.
    Kouzani, Abbas
    Nahavandi, Saeid
    [J]. NEUROCOMPUTING, 2013, 100 : 134 - 143
  • [5] Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
    Hsieh, Ya-chao
    Huang, Yu-tsun
    Wang, Chien-chih
    Lee, Lin-shan
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 961 - 964
  • [6] Visualizing Document Similarity Using N-Grams and Latent Semantic Analysis
    Hussein, Ashraf S.
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 269 - 279
  • [7] Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing
    Crestani, F
    [J]. TECHNOLOGIES FOR CONSTRUCTING INTELLIGENT SYSTEMS 1: TASKS, 2002, 89 : 363 - 375
  • [8] Incremental Probabilistic Latent Semantic Analysis for Automatic Question Recommendation
    Wu, Hu
    Wang, Yongji
    Cheng, Xiang
    [J]. RECSYS'08: PROCEEDINGS OF THE 2008 ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2008, : 99 - 106
  • [9] A web recommendation technique based on probabilistic latent semantic analysis
    Xu, GD
    Zhang, YC
    Zhou, XF
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 15 - 28
  • [10] Chinese spoken document summarization using probabilistic latent topical information
    Chen, Berlin
    Yeh, Yao-Ming
    Huang, Yao-Min
    Chen, Yi-Ting
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 969 - 972