English Spoken Term Detection in Multilingual Recordings

被引:0
|
作者
Motlicek, Petr [1 ]
Valente, Fabio [1 ]
Garner, Philip N. [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
Spoken Term Detection (STD); LVCSR; Confidence Measure (CM); Out-Of-Language (OOL) detection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the automatic detection of English spoken terms in a multi-language scenario over real lecture recordings. Spoken Term Detection (STD) is based on an LVCSR where the output is represented in the form of word lattices. The lattices are then used to search the required terms. Processed lectures are mainly composed of English, French and Italian recordings where the language can also change within one recording. Therefore, the English STD system uses an Out-Of-Language (OOL) detection module to filter out non-English input segments. OOL detection is evaluated w.r.t. various confidence measures estimated from word lattices. Experimental studies of OOL detection followed by English STD are performed on several hours of multilingual recordings. Significant improvement of OOL+STD over a stand-alone STD system is achieved (relatively more than 50% in EER). Finally, an additional modality (text slides in the form of PowerPoint presentations) is exploited to improve STD.
引用
收藏
页码:206 / 209
页数:4
相关论文
共 50 条
  • [41] Stochastic Pronunciation Modelling for Spoken Term Detection
    Wang, Dong
    King, Simon
    Frankel, Joe
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2091 - 2094
  • [42] SpeeData: Multilingual spoken data entry
    Ackermann, U
    Angelini, B
    Brugnara, F
    Federico, M
    Giuliani, D
    Gretter, R
    Lazzari, G
    Niemann, H
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2211 - 2214
  • [43] Predicting search term reliability for spoken term detection systems
    Torbati, Amir
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 1 - 9
  • [44] Web Derived Pronunciations for Spoken Term Detection
    Can, Dogan
    Cooper, Erica
    Ghoshal, Arnab
    Jansche, Martin
    Khudanpur, Sanjeev
    Ramabhadran, Bhuvana
    Riley, Michael
    Saraclar, Murat
    Sethy, Abhinav
    Ulinski, Morgan
    White, Christopher
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 83 - 90
  • [45] Spoken language processing in a multilingual context
    Lamel, LF
    AddaDecker, M
    Gauvain, JL
    Adda, G
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2203 - 2206
  • [46] English as it is spoken
    不详
    MODERN LANGUAGE JOURNAL, 1940, 25 (02): : 154 - 154
  • [47] Introduction of False Detection Control Parameters in Spoken Term Detection
    Furuya, Yuto
    Natori, Satoshi
    Nishizaki, Hiromitsu
    Sekiguchi, Yoshihiro
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [48] APPLICATION OF OUT-OF-LANGUAGE DETECTION TO SPOKEN TERM DETECTION
    Motlicek, Petr
    Valente, Fabio
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5098 - 5101
  • [49] Incremental Dependency Parsing and Disfluency Detection in Spoken Learner English
    Moore, Russell
    Caines, Andrew
    Graham, Calbert
    Buttery, Paula
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 470 - 479
  • [50] SPOKEN GERMAN RECORDINGS - GERMAN - LITTMANN,A
    KRAUSS, PG
    MODERN LANGUAGE JOURNAL, 1964, 48 (04): : 247 - 247