Information retrieval from spoken documents

被引:0
|
作者
Fapso, M [1 ]
Smrz, P [1 ]
Schwarz, P [1 ]
Szöke, I [1 ]
Schwarz, M [1 ]
Cernocky, J [1 ]
Karafiát, M [1 ]
Burget, L [1 ]
机构
[1] Brno Univ Technol, Fac Informat Technol, Brno 61266, Czech Republic
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a designed and implemented system for efficient storage, indexing and search in collections of spoken documents that takes advantage of automatic speech recognition. As the quality of current speech recognizers is not sufficient for a great deal of applications, it is necessary to index the ambiguous output of the recognition, i.e. the acyclic graphs of word hypotheses - recognition lattices. Then, it is not possible to directly apply the standard methods known from text-based systems. The paper discusses an optimized indexing system for efficient search in the complex and large data structure that has been developed by our group. The search engine works as a server. The meeting browser JFerret, developed withing the European AMI project, is used as a client to browse search results.
引用
收藏
页码:410 / 416
页数:7
相关论文
共 50 条
  • [1] Improving the suitability of imperfect transcriptions for information retrieval from spoken documents
    Siegler, M
    Witbrock, M
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 505 - 508
  • [2] Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents
    Witbrock, MJ
    Hauptmann, AG
    [J]. ACM DIGITAL LIBRARIES '97, 1997, : 30 - 35
  • [3] The MERL spokenquery Information Retrieval system - A system for retrieving pertinent documents from a spoken query
    Wolf, P
    Raj, B
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A317 - A320
  • [4] Information Retrieval from Documents: A Survey
    M. Mitra
    B.B. Chaudhuri
    [J]. Information Retrieval, 2000, 2 (2-3): : 141 - 163
  • [5] XML information retrieval from spoken word archives
    Aly, Robin
    Hiemstra, Djoerd
    Ordelman, Roeland
    van der Werff, Laurens
    de Jong, Franciska
    [J]. EVALUATION OF MULTILINGUAL AND MULTI-MODAL INFORMATION RETRIEVAL, 2007, 4730 : 770 - +
  • [6] Multilingual and multimedia Information Retrieval from Web documents
    Gatius, M
    Bertran, M
    Rodriguez, H
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 20 - 24
  • [7] Information fusion for spoken document retrieval
    Ng, K
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2405 - 2408
  • [8] Spoken query processing for information retrieval
    Moreno-Daniel, A.
    Parthasarathy, S.
    Juang, B. H.
    Wilpon, J. G.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 121 - +
  • [9] ANNOTATIONS ON DOCUMENTS FOR INFORMATION RETRIEVAL
    Patil, Vishal A.
    Khambre, Pankaj
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [10] INFORMATION RETRIEVAL FOR SHORT DOCUMENTS
    Qi Haoliang Li Mu Gao Jianfeng Li Sheng Ministry of Education Microsoft Key Laboratory of Natural Language Processing and Speech Harbin Institute of Technology Harbin China Microsoft Research Asia Beijing China Microsoft Research Redmond WA USA
    [J]. JournalofElectronics., 2006, (06) - 936