Automatic sentence segmentation of speech for automatic summarization

被引:0
|
作者
Mrozinski, Joanna [1 ]
Whittaker, Edward W. D. [1 ]
Chatain, Pierre [1 ]
Furui, Sadaoki [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Grad Sch Informat Sci & Engn, Meguro Ku, Tokyo 1528552, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents an automatic sentence segmentation method for an automatic speech summarization system. The segmentation method is based on combining word- and class-based statistical language models to predict sentence and non-sentence boundaries. We study both the performance of the sentence segmentation system itself and the effect of the segmentation on the summarization accuracy. The sentence segmentation is done by modelling the probability of a sentence boundary given a certain word history with language models trained on transcriptions and texts from several sources. The resulting segmented data is used as the input to an existing automatic summarization system to determine the effect it has on the summarization process. We conduct all our experiments with two types of evaluation data: broadcast news and lecture transcriptions. The automatic summarizations are created with different sentence segmentations and different summarization ratios (30% and 40%) and evaluated by comparing them to human-made summaries. We show that a proper sentence segmentation is essential to achieve good performance with an automatic summarization system.
引用
收藏
页码:981 / 984
页数:4
相关论文
共 50 条
  • [1] Impact of automatic sentence segmentation on meeting summarization
    Liu, Yang
    Xie, Shasha
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5009 - 5012
  • [2] Automatic speech summarization based on sentence extraction and compaction
    Kikuchi, T
    Furui, S
    Hori, C
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 384 - 387
  • [3] Sentence-extractive automatic speech summarization and evaluation techniques
    Hirohata, Makoto
    Shinnaka, Yosuke
    Iwano, Koji
    Furui, Sadaoki
    [J]. SPEECH COMMUNICATION, 2006, 48 (09) : 1151 - 1161
  • [4] Sentence-Level Automatic Speech Segmentation for Amharic
    Tamiru, Rahel Mekonen
    Abate, Solomon Teferra
    [J]. PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 477 - 485
  • [5] Sentence reduction for automatic text summarization
    Jing, HY
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 310 - 315
  • [6] Automatic Speech Segmentation for Automatic Speech Translation
    Klosowski, Piotr
    Dustor, Adam
    [J]. COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
  • [7] AUTOMATIC SEGMENTATION OF SPEECH
    VANHEMERT, JP
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 1008 - 1012
  • [8] Automatic Summarization of Highly Spontaneous Speech
    Beke, Andras
    Szaszak, Gyorgy
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 140 - 147
  • [9] A new approach to automatic speech summarization
    Hori, C
    Furui, S
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2003, 5 (03) : 368 - 378
  • [10] A statistical approach to automatic speech summarization
    Hori, C
    Furui, S
    Malkin, R
    Yu, H
    Waibel, A
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 128 - 139