Automatic Summarization of Highly Spontaneous Speech

被引:4
|
作者
Beke, Andras [1 ]
Szaszak, Gyorgy [2 ]
机构
[1] Hungarian Acad Sci, Res Inst Linguist, Budapest, Hungary
[2] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speech summarization; Latent semantic indexing; Spontaneous speech;
D O I
10.1007/978-3-319-43958-7_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses speech summarization of highly spontaneous speech. Speech is converted into text using an ASR, then segmented into tokens. Human made and automatic, prosody based tokenization are compared. The obtained sentence-like units are analysed by a syntactic parser to help automatic sentence selection for the summary. The preprocessed sentences are ranked based on thematic terms and sentence position. The thematic term is expressed in two ways: TF-IDF and Latent Semantic Indexing. The sentence score is calculated as linear combination of the thematic term score and a sentence position score. To generate the summary, the top 10 candidates for the most informative/best summarizing sentences are selected. The system performance showed comparable results (recall: 0.62, precision: 0.79 and F-measure 0.68) with the prosody based tokenization approach. A subjective test is also carried out on a Likert scale.
引用
收藏
页码:140 / 147
页数:8
相关论文
共 50 条
  • [41] Automatic Text Summarization
    Soumya, S.
    Kumar, Geethu S.
    Naseem, Rasia
    Mohan, Saumya
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 787 - 789
  • [42] Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text
    Rott, Michal
    Cerva, Petr
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 101 - 108
  • [43] Automatic Summarization: An Overview
    Saggion, Horacio
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2008, 13 (01): : 63 - 81
  • [44] Automatic Text Summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 192 - +
  • [45] The challenges of automatic summarization
    Hahn, U
    Mani, I
    COMPUTER, 2000, 33 (11) : 29 - +
  • [46] Automatic Software Summarization
    Moreno, Laura
    Marcus, Andrian
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 530 - 531
  • [47] Automatic Segmentation and Labeling for Spontaneous Standard Malay Speech Recognition
    Seman, Noraini
    Jusoff, Kamarazaman
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 59 - +
  • [48] Comparison of read and spontaneous speech in case of Automatic Detection of Depression
    Kiss, Gabor
    Vicsi, Klara
    2017 8TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2017, : 213 - 217
  • [49] Acoustic Models for the Automatic Identification of Prosodic Boundaries in Spontaneous Speech
    Falcao Teixeira, Barbara Heloha
    Mittmann, Maryuale Malvessi
    REVISTA DE ESTUDOS DA LINGUAGEM, 2018, 26 (04) : 1455 - 1488
  • [50] Comparative Analysis of Classifiers for Automatic Language Recognition in Spontaneous Speech
    Simonchik, Konstantin
    Novoselov, Sergey
    Lavrentyeva, Galina
    SPEECH AND COMPUTER, 2016, 9811 : 174 - 181