Spoken document summarization using acoustic, prosodic and semantic information

被引:0
|
作者
Huang, CL [1 ]
Hsieh, CH [1 ]
Wu, CH [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a spoken document summarization scheme using acoustic, prosodic and semantic information. First, speech recognition confidence is estimated to choose reliable words from the speech transcription. Prosodic information, including pitch and energy, is used for stressed word selection. Latent semantic indexing (LSI) is adopted to identify significant words. Finally, word trigram and semantic dependency is measured to include the syntactic and semantic information for speech summarization. The dynamic programming (DP) algorithm is used to find the best summarization result according to the summarization score estimated from the above five measures. Finally, the summarized result is presented by the concatenation of the summarized speech words. Experimental results indicate that the proposed approach effectively extracts important words and gives a promising speech summary.
引用
收藏
页码:434 / 437
页数:4
相关论文
共 50 条
  • [41] Video summarization by learning semantic information
    Hua R.
    Wu X.
    Zhao W.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 650 - 657
  • [42] SEMANTIC CONTEXT INFERENCE FOR SPOKEN DOCUMENT RETRIEVAL USING TERM ASSOCIATION MATRICES
    Huang, Chien-Lin
    Hori, Chiori
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [43] SKGSUM: Abstractive Document Summarization with Semantic Knowledge Graphs
    Ji, Xin
    Zhao, Wen
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [44] Using Latent Semantic Indexing for Morph-based Spoken Document Retrieval
    Turunen, Ville T.
    Kurimo, Mikko
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 341 - 344
  • [45] Information fusion for spoken document retrieval
    Ng, K
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2405 - 2408
  • [46] Spoken English Assessment System for Non-Native Speakers Using Acoustic and Prosodic Features
    Shi, Qin
    Li, Kun
    Zhang, ShiLei
    Chu, Stephen M.
    Xiao, Ji
    Ou, ZhiJian
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1874 - +
  • [47] A Semantic Based Approach for Automatic Patent Document Summarization
    Trappey, Amy J. C.
    Trappey, Charles V.
    Wu, Chun-Yi
    COLLABORATIVE PRODUCTIVE AND SERVICE LIFE CYCLE MANAGEMENT FOR A SUSTAINABLE WORLD, 2008, : 485 - +
  • [48] Comparing Semantic Models for Evaluating Automatic Document Summarization
    Campr, Michal
    Jezek, Karel
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 252 - 260
  • [49] Single document summarization using the information from documents with the same topic
    Mao, Xiangke
    Huang, Shaobin
    Shen, Linshan
    Li, Rongsheng
    Yang, Hui
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [50] A comparative study of probabilistic ranking models for Chinese spoken document summarization
    Lin, Shih-Hsiang
    Chen, Berlin
    Wang, Hsin-Min
    ACM Transactions on Asian Language Information Processing, 2009, 8 (01):