Automatic Summarization of Highly Spontaneous Speech

被引:4
|
作者
Beke, Andras [1 ]
Szaszak, Gyorgy [2 ]
机构
[1] Hungarian Acad Sci, Res Inst Linguist, Budapest, Hungary
[2] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speech summarization; Latent semantic indexing; Spontaneous speech;
D O I
10.1007/978-3-319-43958-7_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses speech summarization of highly spontaneous speech. Speech is converted into text using an ASR, then segmented into tokens. Human made and automatic, prosody based tokenization are compared. The obtained sentence-like units are analysed by a syntactic parser to help automatic sentence selection for the summary. The preprocessed sentences are ranked based on thematic terms and sentence position. The thematic term is expressed in two ways: TF-IDF and Latent Semantic Indexing. The sentence score is calculated as linear combination of the thematic term score and a sentence position score. To generate the summary, the top 10 candidates for the most informative/best summarizing sentences are selected. The system performance showed comparable results (recall: 0.62, precision: 0.79 and F-measure 0.68) with the prosody based tokenization approach. A subjective test is also carried out on a Likert scale.
引用
收藏
页码:140 / 147
页数:8
相关论文
共 50 条
  • [1] Summarization of Spontaneous Speech using Automatic Speech Recognition and a Speech Prosody based Tokenizer
    Szaszak, Gyorgy
    Tundik, Mate Akos
    Beke, Andras
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 221 - 227
  • [2] Automatic sentence segmentation of speech for automatic summarization
    Mrozinski, Joanna
    Whittaker, Edward W. D.
    Chatain, Pierre
    Furui, Sadaoki
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 981 - 984
  • [3] A new approach to automatic speech summarization
    Hori, C
    Furui, S
    IEEE TRANSACTIONS ON MULTIMEDIA, 2003, 5 (03) : 368 - 378
  • [4] A statistical approach to automatic speech summarization
    Hori, C
    Furui, S
    Malkin, R
    Yu, H
    Waibel, A
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 128 - 139
  • [5] A Statistical Approach to Automatic Speech Summarization
    Chiori Hori
    Sadaoki Furui
    Rob Malkin
    Hua Yu
    Alex Waibel
    EURASIP Journal on Advances in Signal Processing, 2003
  • [6] A statistical approach to automatic speech summarization
    Hori, Chiori
    Furul, Sadaoki
    Malkin, Rob
    Yu, Hua
    Waibel, Alex
    Eurasip Journal on Applied Signal Processing, 1600, 2003 (02): : 128 - 139
  • [7] Speech-to-text and speech-to-speech summarization of spontaneous speech
    Furui, S
    Kikuchi, T
    Shinnaka, Y
    Hori, C
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 401 - 408
  • [8] Recent advances in automatic speech summarization
    Furui, Sadaoki
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 16 - 21
  • [9] Automatic speech summarization applied to English broadcast news speech
    Hori, C
    Furui, S
    Malkin, R
    Yu, H
    Waibel, A
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 9 - 12
  • [10] Automatic Twitter Topic Summarization With Speech Acts
    Zhang, Renxian
    Li, Wenjie
    Gao, Dehong
    Ouyang, You
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (03): : 649 - 658