Automatic Summarization of Highly Spontaneous Speech

被引：4

作者：

Beke, Andras ^{[1
]}

Szaszak, Gyorgy ^{[2
]}

机构：

[1] Hungarian Acad Sci, Res Inst Linguist, Budapest, Hungary

[2] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary

来源：

SPEECH AND COMPUTER | 2016年 / 9811卷

关键词：

Speech summarization; Latent semantic indexing; Spontaneous speech;

D O I：

10.1007/978-3-319-43958-7_16

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses speech summarization of highly spontaneous speech. Speech is converted into text using an ASR, then segmented into tokens. Human made and automatic, prosody based tokenization are compared. The obtained sentence-like units are analysed by a syntactic parser to help automatic sentence selection for the summary. The preprocessed sentences are ranked based on thematic terms and sentence position. The thematic term is expressed in two ways: TF-IDF and Latent Semantic Indexing. The sentence score is calculated as linear combination of the thematic term score and a sentence position score. To generate the summary, the top 10 candidates for the most informative/best summarizing sentences are selected. The system performance showed comparable results (recall: 0.62, precision: 0.79 and F-measure 0.68) with the prosody based tokenization approach. A subjective test is also carried out on a Likert scale.

引用

页码：140 / 147

页数：8

共 50 条

[1] Summarization of Spontaneous Speech using Automatic Speech Recognition and a Speech Prosody based Tokenizer
Szaszak, Gyorgy
Tundik, Mate Akos
Beke, Andras
KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 221 - 227
[2] Automatic sentence segmentation of speech for automatic summarization
Mrozinski, Joanna
Whittaker, Edward W. D.
Chatain, Pierre
Furui, Sadaoki
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 981 - 984
[3] A new approach to automatic speech summarization
Hori, C
Furui, S
IEEE TRANSACTIONS ON MULTIMEDIA, 2003, 5 (03) : 368 - 378
[4] A statistical approach to automatic speech summarization
Hori, C
Furui, S
Malkin, R
Yu, H
Waibel, A
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 128 - 139
[5] A Statistical Approach to Automatic Speech Summarization
Chiori Hori
Sadaoki Furui
Rob Malkin
Hua Yu
Alex Waibel
EURASIP Journal on Advances in Signal Processing, 2003
[6] A statistical approach to automatic speech summarization
Hori, Chiori
Furul, Sadaoki
Malkin, Rob
Yu, Hua
Waibel, Alex
Eurasip Journal on Applied Signal Processing, 1600, 2003 (02): : 128 - 139
[7] Speech-to-text and speech-to-speech summarization of spontaneous speech
Furui, S
Kikuchi, T
Shinnaka, Y
Hori, C
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 401 - 408
[8] Recent advances in automatic speech summarization
Furui, Sadaoki
2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 16 - 21
[9] Automatic speech summarization applied to English broadcast news speech
Hori, C
Furui, S
Malkin, R
Yu, H
Waibel, A
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 9 - 12
[10] Automatic Twitter Topic Summarization With Speech Acts
Zhang, Renxian
Li, Wenjie
Gao, Dehong
Ouyang, You
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (03): : 649 - 658

← 1 2 3 4 5 →