N-Gram FST Indexing for Spoken Term Detection

被引:0
|
作者
Liu, Chao [1 ]
Wang, Dong [1 ]
Tejedor, Javier
机构
[1] Tsinghua Univ, Ctr Speech & Language Technol, Beijing, Peoples R China
关键词
spoken term indexing; finite state transducer; spoken term detection; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An efficient indexing scheme is essentially important for spoken term detection (STD) on large databases, particularly for phone-based systems that have been widely adopted to achieve vocabulary-independent detection. While the finite state transducer (FST) composition provides a standard indexing approach, the n-gram reverse indexing is more flexible in connectivity representation and confidence measuring and therefore may result in better performance than searching within the original lattices or the equivalent FSTs. In this paper we present an n-gram FST indexing approach which combines the flexibility of n-gram indexing and the efficiency of FST indexing. Specifically, we employ the n-gram indexing to relax connectivity in original lattices and then formalize the indices into an FST for online search. We demonstrate this approach with a phone-based STD task where the lattice is sparse due to strong language models. The results show that n-gram FST indexing provides not only better detection performance than lattice search, but also a faster detection than both conventional n-gram and FST indexing.
引用
收藏
页码:2091 / 2094
页数:4
相关论文
共 50 条
  • [31] Detection of task-incomplete dialogs based on utterance-and-behavior tag N-gram for spoken dialog systems
    Hara, Sunao
    Kitaoka, Norihide
    Takeda, Kazuya
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1312 - 1315
  • [32] Evaluation of N-gram term conflation approach for arabic texts
    Abu-Salem, H
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 2561 - 2567
  • [33] Pseudo-Conventional N-Gram Representation of the Discriminative N-Gram Model for LVCSR
    Zhou, Zhengyu
    Meng, Helen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 943 - 952
  • [34] Pipilika N-gram Viewer: An Efficient Large Scale N-gram Model for Bengali
    Ahmad, Adnan
    Talha, Mahbubur Rub
    Amin, Md. Ruhul
    Chowdhury, Farida
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [35] Selection of Best Match Keyword using Spoken Term Detection for Spoken Document Indexing
    Domoto, Kentaro
    Utsuro, Takehito
    Sawada, Naoki
    Nishizaki, Hiromitsu
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [36] Association Analysis and N-Gram Based Detection of Incorrect Arguments
    Li C.
    Liu H.
    Ruan Jian Xue Bao/Journal of Software, 2018, 29 (08): : 2243 - 2257
  • [37] N-GRAM ANALYSIS FOR SLEEPING CELL DETECTION IN LTE NETWORKS
    Chernogorov, Fedor
    Ristaniemi, Tapani
    Brigatti, Kimmo
    Chernov, Sergey
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4439 - 4443
  • [38] Malicious Domain Names Detection Algorithm Based on N-Gram
    Zhao, Hong
    Chang, Zhaobin
    Bao, Guangbin
    Zeng, Xiangyan
    JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2019, 2019
  • [39] N-gram模型综述
    尹陈
    吴敏
    计算机系统应用, 2018, 27 (10) : 33 - 38
  • [40] N-gram over Context
    Kawamae, Noriaki
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 1045 - 1055