USING N-BEST RECOGNITION OUTPUT FOR EXTRACTIVE SUMMARIZATION AND KEYWORD EXTRACTION IN MEETING SPEECH

被引：6

作者：

Liu, Yang ^{[1
]}

Xie, Shasha ^{[1
]}

Liu, Fei ^{[1
]}

机构：

[1] Univ Texas Dallas, Richardson, TX 75083 USA

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

summarization; keyword extraction; n-best hypotheses;

D O I：

10.1109/ICASSP.2010.5494972

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

There has been increasing interest recently in meeting understanding, such as summarization, browsing, action item detection, and topic segmentation. However, there is very limited effort on using rich recognition output (e.g., recognition confidence measure or more recognition candidates) for these downstream tasks. This paper presents an initial study using n-best recognition hypotheses for two tasks, extractive summarization and keyword extraction. We extend the approach used on 1-best output to n-best hypotheses: MMR (maximum marginal relevance) for summarization and TFIDF (term frequency, inverse document frequency) weighting for keyword extraction. Our experiments on the ICSI meeting corpus demonstrate promising improvement using n-best hypotheses over 1-best output. These results suggest worthy future studies using n-best or lattices as the interface between speech recognition and downstream tasks.

引用

页码：5310 / 5313

页数：4

共 50 条

[1] Using N-Best Lists and Confusion Networks for Meeting Summarization
Xie, Shasha
Liu, Yang
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1160 - 1169
[2] Improvement in N-best search for continuous speech recognition
Illina, I
Gong, YF
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2147 - 2150
[3] N-best vector quantization for isolated word speech recognition
Nose, Masaya
Maki, Shuichi
Yartiane, Noburnoto
Morikawa, Yoshitaka
PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 2053 - +
[4] Results of the N-Best 2008 Dutch Speech Recognition Evaluation
van Leeuwen, David A.
Kessens, Judith
Sanders, Eric
van den Heuvel, Henk
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2531 - +
[5] Determination of the number of candidates using recognition scores for N-best based speech interface
Cho, K
Yamashita, Y
Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 268 - 272
[6] Discriminative keyword spotting using triphones information and N-best search
Tabibian, Shima
Akbari, Ahmad
Nasersharif, Babak
INFORMATION SCIENCES, 2018, 423 : 157 - 171
[7] The ESAT 2008 System for N-Best Dutch Speech Recognition Benchmark
Demuynck, Kris
Puurula, Antti
Van Compernolle, Dirk
Wambacq, Patrick
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 339 - 344
[8] A word graph based N-Best search in continuous speech recognition
Tran, BH
Seide, F
Steinbiss, V
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2127 - 2130
[9] ESSumm: Extractive Speech Summarization from Untranscribed Meeting
Wang, Jun
INTERSPEECH 2022, 2022, : 3243 - 3247
[10] An N-Best Candidates-Based Discriminative Training for Speech Recognition Applications
Chen, Jung-Kuei
Soong, Frank K.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 206 - 216

← 1 2 3 4 5 →