Spoken document summarization using acoustic, prosodic and semantic information

被引:0
|
作者
Huang, CL [1 ]
Hsieh, CH [1 ]
Wu, CH [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a spoken document summarization scheme using acoustic, prosodic and semantic information. First, speech recognition confidence is estimated to choose reliable words from the speech transcription. Prosodic information, including pitch and energy, is used for stressed word selection. Latent semantic indexing (LSI) is adopted to identify significant words. Finally, word trigram and semantic dependency is measured to include the syntactic and semantic information for speech summarization. The dynamic programming (DP) algorithm is used to find the best summarization result according to the summarization score estimated from the above five measures. Finally, the summarized result is presented by the concatenation of the summarized speech words. Experimental results indicate that the proposed approach effectively extracts important words and gives a promising speech summary.
引用
收藏
页码:434 / 437
页数:4
相关论文
共 50 条
  • [11] Spoken document summarization and retrieval for wireless application
    Wu, CH
    Huang, CL
    Hsieh, CH
    2005 INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS, COMMUNICATIONS AND MOBILE COMPUTING, VOLS 1 AND 2, 2005, : 1388 - 1393
  • [12] Using prosodic information to constrain language models for spoken dialogue
    Taylor, P
    Shimodaira, H
    Isard, S
    King, S
    Kowtko, J
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 216 - 219
  • [13] Leveraging Word Embeddings for Spoken Document Summarization
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-Hsi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1383 - 1387
  • [14] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [15] Using a multimedia semantic graph for web document visualization and summarization
    Rinaldi, Antonio M.
    Russo, Cristiano
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 3885 - 3925
  • [16] Using a multimedia semantic graph for web document visualization and summarization
    Antonio M. Rinaldi
    Cristiano Russo
    Multimedia Tools and Applications, 2021, 80 : 3885 - 3925
  • [17] Document Summarization Based on Semantic Representations
    Zhang, Hui
    Zhang, Xueliang
    Gao, Guanglai
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 152 - 155
  • [18] Assessing the semantic space bias caused by ASR error propagation and its effect on spoken document summarization
    Tundik, Mate Akos
    Kaszas, Valer
    Szaszak, Gyorgy
    INTERSPEECH 2019, 2019, : 1333 - 1337
  • [19] Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing
    Crestani, F
    TECHNOLOGIES FOR CONSTRUCTING INTELLIGENT SYSTEMS 1: TASKS, 2002, 89 : 363 - 375
  • [20] Spoken document retrieval using multilevel knowledge and semantic verification
    Huang, Chien-Lin
    Wu, Chung-Hsien
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2551 - 2560