English Spoken Term Detection in Multilingual Recordings

被引:0
|
作者
Motlicek, Petr [1 ]
Valente, Fabio [1 ]
Garner, Philip N. [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
Spoken Term Detection (STD); LVCSR; Confidence Measure (CM); Out-Of-Language (OOL) detection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the automatic detection of English spoken terms in a multi-language scenario over real lecture recordings. Spoken Term Detection (STD) is based on an LVCSR where the output is represented in the form of word lattices. The lattices are then used to search the required terms. Processed lectures are mainly composed of English, French and Italian recordings where the language can also change within one recording. Therefore, the English STD system uses an Out-Of-Language (OOL) detection module to filter out non-English input segments. OOL detection is evaluated w.r.t. various confidence measures estimated from word lattices. Experimental studies of OOL detection followed by English STD are performed on several hours of multilingual recordings. Significant improvement of OOL+STD over a stand-alone STD system is achieved (relatively more than 50% in EER). Finally, an additional modality (text slides in the form of PowerPoint presentations) is exploited to improve STD.
引用
收藏
页码:206 / 209
页数:4
相关论文
共 50 条
  • [21] Spoken term detection based on DTW
    Hou J.
    Xie L.
    Yang P.
    Xiao X.
    Leung C.-C.
    Xu H.
    Wang L.
    Lü H.
    Ma B.
    Chng E.
    Li H.
    Xie, Lei (lxie@nwpu.edu.cn), 1600, Tsinghua University (57): : 18 - 23
  • [22] Multilingual spoken language processing - Challenges for multilingual systems
    Fung, Pascale
    Schultz, Tanja
    IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (03) : 89 - 97
  • [23] EXPLOITING DIVERSITY FOR SPOKEN TERM DETECTION
    Mangu, Lidia
    Soltau, Hagen
    Kuo, Hong-Kwang
    Kingsbury, Brian
    Saon, George
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8282 - 8286
  • [24] Optimization of Spoken Term Detection System
    Wang, Chuanxu
    Zhang, Pengyuan
    JOURNAL OF APPLIED MATHEMATICS, 2012,
  • [25] Lattice Indexing for Spoken Term Detection
    Can, Dogan
    Saraclar, Murat
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2338 - 2347
  • [26] Semantically Expanded Spoken Term Detection
    Kozhirbayev, Zhanibek
    Yessenbayev, Zhandos
    IEEE ACCESS, 2024, 12 : 177844 - 177855
  • [27] ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION
    Torbati, Amir Hossein Harati Nejad
    Picone, Joe
    2013 IEEE INTERNATIONAL MULTI-DISCIPLINARY CONFERENCE ON COGNITIVE METHODS IN SITUATION AWARENESS AND DECISION SUPPORT (COGSIMA), 2013, : 114 - 117
  • [28] Multilingual hope speech detection in English and Dravidian languages
    Chakravarthi, Bharathi Raja
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (04) : 389 - 406
  • [29] Multilingual hope speech detection in English and Dravidian languages
    Bharathi Raja Chakravarthi
    International Journal of Data Science and Analytics, 2022, 14 : 389 - 406
  • [30] Model-Based Unsupervised Spoken Term Detection with Spoken Queries
    Chan, Chun-an
    Lee, Lin-shan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1330 - 1342