A multi-phase approach for fast spotting of large vocabulary Chinese keywords from Mandarin speech using prosodic information

被引:0
|
作者
Bai, BR
Tseng, CY
Lee, LS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a multi-phase approach for fast spotting of large vocabulary Chinese keywords from a spontaneous Mandarin speech utterance using prosodic knowledge. Without searching through the whole utterance using large number of keyword models, the multi-phase framework proposed here including some special scoring schemes provides very good efficiency by considering the monosyllable-based structure of Mandarin Chinese. This approach is therefore very fast due to very goad boundary estimations and the deletion of most impossible syllable and keyword candidates using context independent models, and also very accurate with the carefully designed scoring processes. A task with 2611 keywords was tested here. An inclusion rate of 85.79% for the top 10 candidates is attained, at a speed requiring only 1.2 times of the utterance length on a Spare 20 workstation.
引用
收藏
页码:903 / 906
页数:4
相关论文
共 10 条
  • [1] Utterance verification using prosodic information for Mandarin telephone speech keyword spotting
    Chen, YJ
    Wu, CH
    Yan, GL
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 697 - 700
  • [2] Improved Large Vocabulary Mandarin Speech Recognition Using Prosodic and Lexical Information in Maximum Entropy Framework
    Ni, Chongjia
    Liu, Wenju
    Xu, Bo
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 626 - 629
  • [3] Improved Large Vocabulary Mandarin Speech Recognition by Selectively Using Tone Information with a Two-stage Prosodic Model
    Cheng, Li-Wei
    Lee, Lin-Shan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1137 - 1140
  • [4] Use of prosodic information to integrate acoustic and linguistic knowledge in continuous Mandarin speech recognition with very large vocabulary
    Hsieh, HY
    Lyu, RY
    Lee, LS
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 809 - 812
  • [5] A multi-stream audio-video large-vocabulary Mandarin Chinese speech database
    Liang, LH
    Luo, Y
    Huang, FY
    Nefian, AV
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1787 - 1790
  • [6] Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
    Wang, HM
    Ho, TH
    Yang, RC
    Shen, JL
    Bai, BR
    Hong, JC
    Chen, WP
    Yu, TL
    Lee, LS
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (02): : 195 - 200
  • [7] Fast and accurate recognition of very-large-vocabulary continuous mandarin speech for Chinese language with improved segmental probability modeling
    Shen, JL
    Lee, LS
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 125 - 128
  • [8] Fast speaker adaptation of large vocabulary continuous density HMM speech recognizer using a basis transform approach
    Boulis, C
    Digalakis, V
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 989 - 992
  • [9] A multi-phase approach for developing a conceptual model and preliminary content for patient-reported outcome measurement in TKA patients: from a Chinese perspective
    Xu, Chao
    Wei, Jie
    Li, Liang
    Yao, Shuxin
    Chang, Xiaofeng
    Ma, Jianbing
    Shang, Lei
    QUALITY OF LIFE RESEARCH, 2025, 34 (03) : 763 - 775
  • [10] Liver Tumor Localization and Characterization from Multi-phase MR Volumes Using Key-Slice Prediction: A Physician-Inspired Approach
    Lai, Bolin
    Wu, Yuhsuan
    Bai, Xiaoyu
    Zhou, Xiao-Yun
    Wang, Peng
    Cai, Jinzheng
    Huo, Yuankai
    Huang, Lingyun
    Xia, Yong
    Xiao, Jing
    Lu, Le
    Hu, Heping
    Harrison, Adam
    PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2021, 2021, 12928 : 47 - 58