Fast fuzzy keyword spotting using syllable confusion network indexing

被引:0
|
作者
Jian, Shao [1 ]
Qingwei, Zhao [1 ]
Pengyuan, Zhang [1 ]
Zhaojie, Liu [1 ]
Yonghong, Yan [1 ]
机构
[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing 100080, Peoples R China
关键词
keyword spotting; syllable confusion network; syllable confusion matrix; Mandarin spontaneous speech;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a fast fuzzy search algorithm to extract keyword candidates from Syllable confusion networks (SCNs) in Mandarin spontaneous speech. Since the recognition accuracy of spontaneous speech is quite poor, Syllable confusion matrix (SCM) is applied to compensate for the recognition errors and to improve recall. In order to scale up to large collections and support quick query response, an efficient vocabulary-independent index structure is designed, which selects individual arcs of syllable confusion network as indexing unit. An inverted search algorithm that use syllable confusion matrix to calculate relevance score and search in this index structure is proposed. In experiments performed on a telephone conversational task, the Equal error rate (EER) was reduced by about 33% relative over the baseline where keywords are directly extracted from phoneme lattices. Additionally, it only took computer one or two seconds to search 100 keywords in one hour speech data.
引用
收藏
页码:265 / 269
页数:5
相关论文
共 50 条
  • [1] A Fast Fuzzy Keyword Spotting Algorithm Based on Syllable Confusion Network
    Shao, Jian
    Zhao, Qingwei
    Zhang, Pengyuan
    Liu, Zhaojie
    Yan, Yonghong
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1665 - 1668
  • [2] Keyword spotting based on syllable confusion network
    Zhang, Pengyuan
    Shao, Jian
    Zhao, Qingwei
    Yan, Yonghong
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 656 - +
  • [3] Assisted keyword indexing for lecture videos using unsupervised keyword spotting
    Kanadje, Manish
    Miller, Zachary
    Agarwal, Anurag
    Gaborski, Roger
    Zanibbi, Richard
    Ludi, Stephanie
    [J]. PATTERN RECOGNITION LETTERS, 2016, 71 : 8 - 15
  • [4] Keyword spotting for multimedia document indexing
    Gelin, P
    Wellekens, CJ
    [J]. MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 : 366 - 377
  • [5] Keyword spotting for video soundtrack indexing.
    Gelin, P
    Wellekens, CJ
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 299 - 302
  • [6] Keyword spotting enhancement for video soundtrack indexing
    Gelin, P
    Wellekens, CJ
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 586 - 589
  • [7] Discriminatively trained phoneme confusion model for keyword spotting
    Karanasou, Panagiota
    Burget, Lukas
    Vergyri, Dimitra
    Akbacak, Murat
    Mandal, Arindam
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2433 - 2436
  • [8] Line Segmentation Free Probabilistic Keyword Spotting and Indexing
    Barrere, Killian
    Toselli, Alejandro H.
    Vidal, Enrique
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2019, PT II, 2019, 11868 : 201 - 213
  • [9] A fast keyword-spotting technique
    Li, Linlin
    Lu, Shijian
    Tan, Chew Lim
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 68 - 72
  • [10] Fast Keyword Spotting in Telephone Speech
    Nouza, Jan
    Silovsky, Jan
    [J]. RADIOENGINEERING, 2009, 18 (04) : 665 - 670