Chinese spoken document summarization using probabilistic latent topical information

被引:0
|
作者
Chen, Berlin [1 ]
Yeh, Yao-Ming [1 ]
Huang, Yao-Min [1 ]
Chen, Yi-Ting [1 ]
机构
[1] Natl Taiwan Normal Univ, Grad Inst Comp Sci & Informat Engn, Taipei, Taiwan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with the conventional vector space model and latent semantic indexing model, as well as the HMM model. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained.
引用
收藏
页码:969 / 972
页数:4
相关论文
共 50 条
  • [21] Probabilistic aspects in spoken document retrieval
    Macherey, W. (w.macherey@informatik.rwth-aachen.de), 1600, Hindawi Publishing Corporation (2003):
  • [22] Probabilistic aspects in spoken document retrieval
    Macherey, W
    Viechtbauer, HJ
    Ney, H
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 115 - 127
  • [23] Probabilistic Aspects in Spoken Document Retrieval
    Wolfgang Macherey
    Hans Jörg Viechtbauer
    Hermann Ney
    EURASIP Journal on Advances in Signal Processing, 2003
  • [24] The Repository of Web Document Summarization using Social Information
    Minh-Tien Nguyen
    Van-Hau Nguyen
    Duc-Vu Tran
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 445 - 449
  • [25] IMPROVED SPOKEN DOCUMENT SUMMARIZATION WITH COVERAGE MODELING TECHNIQUES
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Chen, Berlin
    Wang, Hsin-Min
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6010 - 6014
  • [26] A Novel Paragraph Embedding Method for Spoken Document Summarization
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Chen, Berlin
    Wang, Hsin-Min
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [27] Graph Summarization with Latent Variable Probabilistic Models
    Fukushima, Shintaro
    Kanai, Ryoga
    Yamanishi, Kenji
    COMPLEX NETWORKS & THEIR APPLICATIONS X, VOL 2, 2022, 1016 : 428 - 440
  • [28] Spoken document summarization using topic-related corpus and semantic dependency grammar
    Hsieh, CH
    Huang, CL
    Wu, CH
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 333 - 336
  • [29] LATENT TOPIC MODELING OF WORD CO-OCCURRENCE INFORMATION FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Berlin
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3961 - 3964
  • [30] FINE GRAINED SPOKEN DOCUMENT SUMMARIZATION THROUGH TEXT SEGMENTATION
    Kotey, Samantha
    Dahyot, Rozenn
    Harte, Naomi
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 647 - 654