Automatic Summarization of Highly Spontaneous Speech

被引:4
|
作者
Beke, Andras [1 ]
Szaszak, Gyorgy [2 ]
机构
[1] Hungarian Acad Sci, Res Inst Linguist, Budapest, Hungary
[2] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speech summarization; Latent semantic indexing; Spontaneous speech;
D O I
10.1007/978-3-319-43958-7_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses speech summarization of highly spontaneous speech. Speech is converted into text using an ASR, then segmented into tokens. Human made and automatic, prosody based tokenization are compared. The obtained sentence-like units are analysed by a syntactic parser to help automatic sentence selection for the summary. The preprocessed sentences are ranked based on thematic terms and sentence position. The thematic term is expressed in two ways: TF-IDF and Latent Semantic Indexing. The sentence score is calculated as linear combination of the thematic term score and a sentence position score. To generate the summary, the top 10 candidates for the most informative/best summarizing sentences are selected. The system performance showed comparable results (recall: 0.62, precision: 0.79 and F-measure 0.68) with the prosody based tokenization approach. A subjective test is also carried out on a Likert scale.
引用
收藏
页码:140 / 147
页数:8
相关论文
共 50 条
  • [31] Automatic detection of contrastive elements in spontaneous speech
    Nenkova, Ani
    Jurafsky, Dan
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 201 - +
  • [32] Automatic detection of prosodic boundaries in spontaneous speech
    Biron, Tirza
    Baum, Daniel
    Freche, Dominik
    Matalon, Nadav
    Ehrmann, Netanel
    Weinreb, Eyal
    Biron, David
    Moses, Elisha
    PLOS ONE, 2021, 16 (05):
  • [33] Automatic phrase segmentation and clustering in spontaneous speech
    Beke, Andras
    Szaszak, Gyorgy
    Varadi, Viola
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 459 - 462
  • [34] Automatic Summarization
    Nenkova, Ani
    McKeown, Kathleen
    FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2011, 5 (2-3): : 103 - 233
  • [35] Automatic summarization
    Paice, CD
    COMPUTATIONAL LINGUISTICS, 2002, 28 (02) : 221 - 223
  • [36] AUTOMATIC RECOGNITION OF SCHWA VARIANTS IN SPONTANEOUS HUNGARIAN SPEECH
    Andras Beke
    Gyoergy Szaszak
    ACTA LINGUISTICA HUNGARICA, 2010, 57 (2-3) : 329 - 353
  • [37] Automatic assessment of syntactic complexity for spontaneous speech scoring
    Bhat, Suma
    Yoon, Su-Youn
    SPEECH COMMUNICATION, 2015, 67 : 42 - 57
  • [38] Automatic generation of pronunciation lexicons for Mandarin spontaneous speech
    Byrne, W
    Venkataramani, V
    Kamm, T
    Zheng, TF
    Song, Z
    Fung, P
    Liu, Y
    Ruhi, U
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 569 - 572
  • [39] Summarization of Spontaneous Conversations
    Zhu, Xiaodan
    Penn, Gerald
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1531 - 1534
  • [40] Automatic summarization.
    Lincicum, SJ
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (05): : 410 - 411