N-Gram FST Indexing for Spoken Term Detection

被引:0
|
作者
Liu, Chao [1 ]
Wang, Dong [1 ]
Tejedor, Javier
机构
[1] Tsinghua Univ, Ctr Speech & Language Technol, Beijing, Peoples R China
关键词
spoken term indexing; finite state transducer; spoken term detection; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An efficient indexing scheme is essentially important for spoken term detection (STD) on large databases, particularly for phone-based systems that have been widely adopted to achieve vocabulary-independent detection. While the finite state transducer (FST) composition provides a standard indexing approach, the n-gram reverse indexing is more flexible in connectivity representation and confidence measuring and therefore may result in better performance than searching within the original lattices or the equivalent FSTs. In this paper we present an n-gram FST indexing approach which combines the flexibility of n-gram indexing and the efficiency of FST indexing. Specifically, we employ the n-gram indexing to relax connectivity in original lattices and then formalize the indices into an FST for online search. We demonstrate this approach with a phone-based STD task where the lattice is sparse due to strong language models. The results show that n-gram FST indexing provides not only better detection performance than lattice search, but also a faster detection than both conventional n-gram and FST indexing.
引用
收藏
页码:2091 / 2094
页数:4
相关论文
共 50 条
  • [21] Automatic detection of task-incompleted dialog for spoken dialog system based on dialog act N-gram
    Hara, Sunao
    Kitaoka, Norihide
    Takeda, Kazuya
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 3034 - 3037
  • [22] Multi-Scale Chroma n-Gram Indexing for Cover Song Identification
    Seo, Jin S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (01) : 59 - 62
  • [23] Metric Subspace Indexing for Fast Spoken Term Detection
    Kaneko, Taisuke
    Akiba, Tomoyosi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 689 - 692
  • [24] N-gram Insight
    Prans, George
    AMERICAN SCIENTIST, 2011, 99 (05) : 356 - 357
  • [25] Byte Level n-Gram Analysis for Malware Detection
    Jain, Sacbin
    Meena, Yogesb Kumar
    COMPUTER NETWORKS AND INTELLIGENT COMPUTING, 2011, 157 : 51 - 59
  • [26] N-Gram Based Secure Similar Document Detection
    Jiang, Wei
    Samanthula, Bharath K.
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXV, 2011, 6818 : 239 - 246
  • [27] Bugram: Bug Detection with N-gram Language Models
    Wang, Song
    Chollak, Devin
    Movshovitz-Attias, Dana
    Tan, Lin
    2016 31ST IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2016, : 708 - 719
  • [28] An evaluation of n-gram correspondence models for transliteration detection
    Department of Information Systems, SCIT, CoCIS, Makerere University, Kampala, Uganda
    Lect. Notes Electr. Eng., (615-622):
  • [29] HTTP attack detection using n-gram analysis
    Oza, Aditya
    Ross, Kevin
    Low, Richard M.
    Stamp, Mark
    COMPUTERS & SECURITY, 2014, 45 : 242 - 254
  • [30] A unified context-free grammar and n-gram model for spoken language processing
    Wang, YY
    Mahajan, M
    Huang, XD
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1639 - 1642