USING N-BEST RECOGNITION OUTPUT FOR EXTRACTIVE SUMMARIZATION AND KEYWORD EXTRACTION IN MEETING SPEECH

被引:6
|
作者
Liu, Yang [1 ]
Xie, Shasha [1 ]
Liu, Fei [1 ]
机构
[1] Univ Texas Dallas, Richardson, TX 75083 USA
关键词
summarization; keyword extraction; n-best hypotheses;
D O I
10.1109/ICASSP.2010.5494972
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There has been increasing interest recently in meeting understanding, such as summarization, browsing, action item detection, and topic segmentation. However, there is very limited effort on using rich recognition output (e.g., recognition confidence measure or more recognition candidates) for these downstream tasks. This paper presents an initial study using n-best recognition hypotheses for two tasks, extractive summarization and keyword extraction. We extend the approach used on 1-best output to n-best hypotheses: MMR (maximum marginal relevance) for summarization and TFIDF (term frequency, inverse document frequency) weighting for keyword extraction. Our experiments on the ICSI meeting corpus demonstrate promising improvement using n-best hypotheses over 1-best output. These results suggest worthy future studies using n-best or lattices as the interface between speech recognition and downstream tasks.
引用
收藏
页码:5310 / 5313
页数:4
相关论文
共 50 条
  • [41] A Comparison of Two N-Best Extraction Methods for Weighted Tree Automata
    Bjorklund, Johanna
    Drewes, Frank
    Jonsson, Anna
    IMPLEMENTATION AND APPLICATION OF AUTOMATA, CIAA 2018, 2018, 10977 : 97 - 108
  • [42] N-best speech hypothesis reordering based on comprehensive information theory
    Liu, JY
    Zhong, YX
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 29 - 32
  • [43] Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition
    Huet, Stephane
    Gravier, Guillaume
    Sebillot, Pascale
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 663 - 684
  • [44] Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
    Shi, Yongzhe
    Zhang, Wei-Qiang
    Cai, Meng
    Liu, Jia
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [45] 3-D N-best search for simultaneous recognition of distant-talking speech of multiple talkers
    Nakamura, S
    Heracleous, P
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 59 - 63
  • [46] RESCORING N-BEST SPEECH RECOGNITION LIST BASED ON ONE-ON-ONE HYPOTHESIS COMPARISON USING ENCODER-CLASSIFIER MODEL
    Ogawa, Atsunori
    Delcroix, Marc
    Karita, Shigeki
    Nakatani, Tomohiro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6099 - 6103
  • [47] Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
    Yongzhe Shi
    Wei-Qiang Zhang
    Meng Cai
    Jia Liu
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [48] Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents
    Zhang, Heng
    Wang, Da-Han
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2014, 47 (05) : 1880 - 1890
  • [49] N-best list rescoring using syntactic trigrams
    Salgado-Garza, LR
    Stern, RM
    Nolazco, JA
    MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 79 - 88
  • [50] Discriminative feature extraction for speech recognition using continuous output codes
    Dehzangi, Omid
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1703 - 1709