English Spoken Term Detection in Multilingual Recordings

被引:0
|
作者
Motlicek, Petr [1 ]
Valente, Fabio [1 ]
Garner, Philip N. [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
Spoken Term Detection (STD); LVCSR; Confidence Measure (CM); Out-Of-Language (OOL) detection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the automatic detection of English spoken terms in a multi-language scenario over real lecture recordings. Spoken Term Detection (STD) is based on an LVCSR where the output is represented in the form of word lattices. The lattices are then used to search the required terms. Processed lectures are mainly composed of English, French and Italian recordings where the language can also change within one recording. Therefore, the English STD system uses an Out-Of-Language (OOL) detection module to filter out non-English input segments. OOL detection is evaluated w.r.t. various confidence measures estimated from word lattices. Experimental studies of OOL detection followed by English STD are performed on several hours of multilingual recordings. Significant improvement of OOL+STD over a stand-alone STD system is achieved (relatively more than 50% in EER). Finally, an additional modality (text slides in the form of PowerPoint presentations) is exploited to improve STD.
引用
收藏
页码:206 / 209
页数:4
相关论文
共 50 条
  • [1] Survey on Multilingual Spoken Term Detection
    Caranica, Alexandru
    Cucu, Horia
    Buzo, Andi
    Burileanu, Corneliu
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2017, 20 (03): : 210 - 221
  • [2] Multilingual spoken term detection: a review
    G. Deekshitha
    Leena Mary
    International Journal of Speech Technology, 2020, 23 : 653 - 667
  • [3] Multilingual spoken term detection: a review
    Deekshitha, G.
    Mary, Leena
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 653 - 667
  • [4] An Empirical Study of Multilingual Spoken Term Detection
    Ma, Zejun
    Wang, Xiaorui
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1932 - 1935
  • [5] INVESTIGATION OF MULTILINGUAL DEEP NEURAL NETWORKS FOR SPOKEN TERM DETECTION
    Knill, K. M.
    Gales, M. J. F.
    Rath, S. P.
    Woodland, P. C.
    Zhang, C.
    Zhang, S-X
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 138 - 143
  • [6] MULTILINGUAL BOTTLENECK FEATURES FOR QUERY BY EXAMPLE SPOKEN TERM DETECTION
    Ram, Dhananjay
    Miculicich, Lesly
    Bourlard, Herve
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 621 - 628
  • [7] Multilingual query-by-example spoken term detection in Indian languages
    Popli, Abhimanyu
    Kumar, Arun
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 131 - 141
  • [8] Multilingual query-by-example spoken term detection in Indian languages
    Abhimanyu Popli
    Arun Kumar
    International Journal of Speech Technology, 2019, 22 : 131 - 141
  • [9] Multilingual Query by Example Spoken Term Detection for Under-Resourced Languages
    Buzo, Andi
    Cucu, Horia
    Safta, Mihai
    Burileanu, Corneliu
    2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED), 2013,
  • [10] Code-switched English Pronunciation Modeling for Swahili Spoken Term Detection
    Kleynhans, Neil
    Hartman, William
    van Niekerk, Daniel
    van Heerden, Charl
    Schwartz, Rich
    Tsakalidis, Stavros
    Davel, Marelie
    SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 128 - 135