Spoken document summarization using acoustic, prosodic and semantic information

被引:0
|
作者
Huang, CL [1 ]
Hsieh, CH [1 ]
Wu, CH [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a spoken document summarization scheme using acoustic, prosodic and semantic information. First, speech recognition confidence is estimated to choose reliable words from the speech transcription. Prosodic information, including pitch and energy, is used for stressed word selection. Latent semantic indexing (LSI) is adopted to identify significant words. Finally, word trigram and semantic dependency is measured to include the syntactic and semantic information for speech summarization. The dynamic programming (DP) algorithm is used to find the best summarization result according to the summarization score estimated from the above five measures. Finally, the summarized result is presented by the concatenation of the summarized speech words. Experimental results indicate that the proposed approach effectively extracts important words and gives a promising speech summary.
引用
收藏
页码:434 / 437
页数:4
相关论文
共 50 条
  • [1] Spoken document summarization using relevant information
    Chen, Yi-Ting
    Lin, Shih-Hsiang
    Wang, Hsin-Min
    Chen, Berlin
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 189 - +
  • [2] Extractive spoken document summarization for information retrieval
    Chen, Berlin
    Chen, Yi-Ting
    PATTERN RECOGNITION LETTERS, 2008, 29 (04) : 426 - 437
  • [3] Multi-layered Summarization of Spoken Document Archives by Information Extraction and Semantic Structuring
    Lee, Lin-shan
    Kong, Sheng-yi
    Pan, Yi-cheng
    Fu, Yi-sheng
    Huang, Yu-tsun
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1539 - 1542
  • [4] Summarization of spoken lectures based on linguistic surface and prosodic information
    Togashi, Shingo
    Yamaguchi, Masaru
    Nakagawa, Seiichi
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 34 - +
  • [5] Improved spoken document summarization using Probabilistic Latent Semantic Analysis (PLSA)
    Kong, Sheng-Yi
    Lee, Lin-shan
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 941 - 944
  • [6] Chinese spoken document summarization using probabilistic latent topical information
    Chen, Berlin
    Yeh, Yao-Ming
    Huang, Yao-Min
    Chen, Yi-Ting
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 969 - 972
  • [7] Spoken document summarization using topic-related corpus and semantic dependency grammar
    Hsieh, CH
    Huang, CL
    Wu, CH
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 333 - 336
  • [8] Document Summarization Using Semantic Clouds
    Rinaldi, Antonio M.
    2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, : 100 - 103
  • [9] Tamil Document Summarization using semantic graph method
    Banu, M.
    Karthika, C.
    Sudarmani, P.
    Geetha, T., V
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 128 - 134
  • [10] Extractive Chinese spoken document summarization using probabilistic ranking models
    Chen, Yi-Ting
    Yu, Suhan
    Wang, Hsin-Min
    Chen, Berlin
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 660 - +