Topic extraction based on continuous speech recognition in broadcast news speech

被引:0
|
作者
Ohtsuki, K [1 ]
Matsuoka, T
Matsunaga, S
Furui, S
机构
[1] NTT Corp, NTT Cyber Space Labs, Yokosuka, Kanagawa 2390847, Japan
[2] NTT E Corp, Broadband Business Dept, Tokyo 1000004, Japan
[3] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528552, Japan
关键词
topic extraction; topic word; relevance score; continuous speech recognition; broadcast news;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose topic extraction models based on statistical relevance scores between topic words and words in articles, and report results obtained in topic extraction experiments using continuous speech recognition for Japanese broadcast news utterances. We attempt to represent a topic of news speech using a combination of multiple topic words, which are important words in the news article or words relevant to the news. We assume a topic of news is represented by a combination of words. We statistically model mapping from words in an article to topic words. Using the mapping, the topic extraction model can extract topic words even if they do not appear in the article. We train a topic extraction model capable of computing the degree of relevance between a topic word and a word in an article by using newspaper text covering a five-year period. The degree of relevance between those words is calculated based on measures such as mutual information or the chi(2)-method. In experiments extracting five topic words using a chi(2)-based model, we achieve 72% precision and 12% recall for speech recognition results. Speech recognition results generally include a number of recognition errors, which degrades topic extraction performance. To avoid this, we employ N-best candidates and likelihood given by acoustic and language models. In experiments, we find that extracting five topic words using N-best candidate and likelihood values achieves significantly improved precision.
引用
收藏
页码:1138 / 1144
页数:7
相关论文
共 50 条
  • [1] Topic extraction based on continuous speech recognition in broadcast-news speech
    Ohtsuki, K
    Matsunaga, S
    Matsuoka, T
    Furui, S
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 527 - 534
  • [2] Topic extraction with multiple topic-words in broadcast-news speech
    Ohtsuki, K
    Matsutoka, T
    Matsunaga, S
    Furui, S
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 329 - 332
  • [3] The Slovenian BNSI Broadcast News database for continuous speech recognition
    Zgank, Andrej
    Verdonik, Darinka
    Kacic, Zdravko
    [J]. ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2008, 75 (03): : 85 - 90
  • [4] Connectionist speech recognition of Broadcast News
    Robinson, AJ
    Cook, GD
    Ellis, DPW
    Fosler-Lussier, E
    Renals, SJ
    Williams, DAG
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 27 - 45
  • [5] Speech recognition for Turkish broadcast news
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1054 - 1057
  • [6] Investigation on Mandarin Broadcast News Speech Recognition
    Hwang, Mei-Yuh
    Lei, Xin
    Wang, Wen
    Shinozaki, Takahiro
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
  • [7] A study on Mandarin broadcast news speech recognition
    Chen, CL
    Wang, YR
    Chen, SH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
  • [8] Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach
    Beyerlein, P
    Aubert, X
    Haeb-Umbach, R
    Harris, M
    Klakow, D
    Wendemuth, A
    Molau, S
    Ney, H
    Pitz, M
    Sixtus, A
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 109 - 131
  • [9] ENGLISH BROADCAST NEWS SPEECH RECOGNITION BY HUMANS AND MACHINES
    Thomas, Samuel
    Suzuki, Masayuki
    Huang, Yinghui
    Kurata, Gakuto
    Tuske, Zoltan
    Saon, George
    Kingsbury, Brian
    Picheny, Michael
    Dibert, Tom
    Kaiser-Schatzlein, Alice
    Samko, Bern
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6455 - 6459
  • [10] SPEECH SHOT EXTRACTION FROM BROADCAST NEWS VIDEOS
    Kumagai, Shogo
    Doman, Keisuke
    Takahashi, Tomokazu
    Deguchi, Daisuke
    Ide, Ichiro
    Murase, Hiroshi
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2012, 6 (02) : 179 - 204