Chinese spoken document summarization using probabilistic latent topical information

被引:0
|
作者
Chen, Berlin [1 ]
Yeh, Yao-Ming [1 ]
Huang, Yao-Min [1 ]
Chen, Yi-Ting [1 ]
机构
[1] Natl Taiwan Normal Univ, Grad Inst Comp Sci & Informat Engn, Taipei, Taiwan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with the conventional vector space model and latent semantic indexing model, as well as the HMM model. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained.
引用
收藏
页码:969 / 972
页数:4
相关论文
共 50 条
  • [31] SPOKEN DOCUMENT SUMMARIZATION BASED ON DYNAMIC TIME WARPING AND WAVELETS
    Guido, Rodrigo Capobianco
    Barbon Junior, Sylvio
    Vieira, Lucimar Sasso
    Sanchez, Fabricio Lopes
    Maciel, Carlos Dias
    Scalassara, Paulo Rogerio
    Pereira, Jose Carlos
    Puia, Vitor Muller
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2007, 1 (03) : 347 - 357
  • [32] Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
    Hsieh, Ya-chao
    Huang, Yu-tsun
    Wang, Chien-chih
    Lee, Lin-shan
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 961 - 964
  • [33] Probabilistic Latent Document Network Embedding
    Le, Tuan M. V.
    Lauw, Hady W.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 270 - 279
  • [34] Using Latent Semantic Indexing for Morph-based Spoken Document Retrieval
    Turunen, Ville T.
    Kurimo, Mikko
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 341 - 344
  • [35] Chinese text summarization using a trainable summarizer and latent semantic analysis
    Yeh, JY
    Ke, HR
    Yang, WP
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 76 - 87
  • [36] Spoken document classification with SVMs using linguistic unit weighting and probabilistic couplers
    Iurgel, U
    Rigoll, G
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 667 - 670
  • [37] Comparison Probabilistic Latent Semantic Indexing Model In Chinese Information Retrieval
    Xie Fang
    Liu Xiaoguang
    Hu Quan
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 559 - +
  • [38] Information fusion for spoken document retrieval
    Ng, K
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2405 - 2408
  • [39] Single document summarization using the information from documents with the same topic
    Mao, Xiangke
    Huang, Shaobin
    Shen, Linshan
    Li, Rongsheng
    Yang, Hui
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [40] INCORPORATING PARAGRAPH EMBEDDINGS AND DENSITY PEAKS CLUSTERING FOR SPOKEN DOCUMENT SUMMARIZATION
    Chen, Kuan-Yu
    Shih, Kai-Wun
    Liu, Shih-Hung
    Chen, Berlin
    Wang, Hsin-Min
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 207 - 214