Extractive Chinese spoken document summarization using probabilistic ranking models

被引:0
|
作者
Chen, Yi-Ting [1 ]
Yu, Suhan [1 ]
Wang, Hsin-Min [2 ]
Chen, Berlin [1 ]
机构
[1] Natl Taiwan Normal Univ, Taipei, Taiwan
[2] Acad Sinica, Taipei, Taiwan
关键词
hidden Markov model; probabilistic ranking; relevance model; speech recognition; spoken document summarization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of extractive summarization is to automatically select indicative sentences, passages, or paragraphs from an original document according to a certain target summarization ratio, and then sequence them to form a concise summary. In this paper, in contrast to conventional approaches, our objective is to deal with the extractive summarization problem under a probabilistic modeling framework. We investigate the use of the hidden Markov model (HMM) for spoken document summarization, in which each sentence of a spoken document is treated as an HMM for generating the document, and the sentences are ranked and selected according to their likelihoods. In addition, the relevance model (RM) of each sentence, estimated from a contemporary text collection, is integrated with the HMM model to improve the representation of the sentence model. The experiments were performed on Chinese broadcast news compiled in Taiwan. The proposed approach achieves noticeable performance gains over conventional summarization approaches.
引用
收藏
页码:660 / +
页数:3
相关论文
共 50 条
  • [1] A comparative study of probabilistic ranking models for Chinese spoken document summarization
    Lin, Shih-Hsiang
    Chen, Berlin
    Wang, Hsin-Min
    [J]. ACM Transactions on Asian Language Information Processing, 2009, 8 (01):
  • [2] A comparative study of probabilistic ranking models for spoken document summarization
    Lin, Shih-Hsiang
    Chen, Yi-Ting
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5025 - +
  • [3] A Unified Probabilistic Generative Framework for Extractive Spoken Document Summarization
    Chen, Yi-Ting
    Chiu, Hsuan-Sheng
    Wang, Hsin-Min
    Chen, Berlin
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2528 - 2531
  • [4] Chinese spoken document summarization using probabilistic latent topical information
    Chen, Berlin
    Yeh, Yao-Ming
    Huang, Yao-Min
    Chen, Yi-Ting
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 969 - 972
  • [5] Word topical mixture models for extractive spoken document summarization
    Chen, Berlin
    Chen, Yi-Ting
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 52 - 55
  • [6] Extractive spoken document summarization for information retrieval
    Chen, Berlin
    Chen, Yi-Ting
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (04) : 426 - 437
  • [7] Improved spoken document summarization using Probabilistic Latent Semantic Analysis (PLSA)
    Kong, Sheng-Yi
    Lee, Lin-shan
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 941 - 944
  • [8] Extractive Summarization of a Document Using Lexical Chains
    Mallick, Chirantana
    Dutta, Madhurima
    Das, Ajit Kumar
    Sarkar, Apurba
    Das, Asit Kumar
    [J]. SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 825 - 836
  • [9] Spoken document summarization using relevant information
    Chen, Yi-Ting
    Lin, Shih-Hsiang
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 189 - +
  • [10] A Probabilistic Approach for Extractive Summarization Based on Clustering Cum Graph Ranking Method
    Ahmad, Amreen
    Ahmad, Tanvir
    Masood, Sarfaraz
    Siddiqui, Mohd Khizir
    Abd El-Rahiem, Basma
    Plawiak, Pawel
    Alblehai, Fahad
    [J]. IEEE ACCESS, 2024, 12 : 70464 - 70479