Information fusion for spoken document retrieval

被引:0
|
作者
Ng, K [1 ]
机构
[1] MIT, Spoken Language Syst Grp, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we investigate the fusion of different information sources with the goal of improving performance on spoken document retrieval (SDR) tasks. In particular, we explore the use of multiple transcriptions from different automatic speech recognizers, the combination of different types of subword unit indexing terms, and the combination of word and subword-based units. To perform retrieval, we use a novel probabilistic information retrieval model which retrieves documents based on maximum likelihood ratio scores. Experiments on the 1998 TREC-7 SDR task show that the use of these different information fusion approaches can result in significantly improved retrieval performance.
引用
收藏
页码:2405 / 2408
页数:4
相关论文
共 50 条
  • [11] New Approaches to Spoken Document Retrieval
    Martin Wechsler
    Eugen Munteanu
    Peter Schäuble
    [J]. Information Retrieval, 2000, 3 : 173 - 188
  • [12] Spoken document representations for probabilistic retrieval
    Jourlin, P
    Johnson, SE
    Sparck-Jones, K
    Woodland, PC
    [J]. SPEECH COMMUNICATION, 2000, 32 (1-2) : 21 - 36
  • [13] The THISL spoken document retrieval project
    Renals, S
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 1049 - 1051
  • [14] Probabilistic aspects in spoken document retrieval
    [J]. Macherey, W. (w.macherey@informatik.rwth-aachen.de), 1600, Hindawi Publishing Corporation (2003):
  • [15] Phonetic recognition for spoken document retrieval
    Ng, K
    Zue, VW
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 325 - 328
  • [16] New approaches to spoken document retrieval
    Wechsler, M
    Munteanu, E
    Schäuble, P
    [J]. INFORMATION RETRIEVAL, 2000, 3 (03): : 173 - 188
  • [17] Exploring the use of latent topical information for statistical Chinese spoken document retrieval
    Chen, B
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (01) : 9 - 18
  • [18] SPEECHFIND: Spoken document retrieval for a national gallery of the spoken word
    Hansen, JHL
    Huang, RQ
    Mangalath, P
    Zhou, B
    Seadle, M
    Deller, JR
    [J]. NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 1 - 4
  • [19] A Robust Fusion Method for Multilingual Spoken Document Retrieval Systems Employing Tiered Resources
    Akbacak, Murat
    Hansen, John H. L.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1177 - 1180
  • [20] LATENT TOPIC MODELING OF WORD CO-OCCURRENCE INFORMATION FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Berlin
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3961 - 3964