Removal of Heterogeneous Candidates Using Positional Accuracy Based on Levenshtein Distance on Isolated n-best Recognition

被引:0
|
作者
Yun, Young-Sun [1 ]
机构
[1] Hannam Univ, Dept Informat & Commun Engn, 70 Hananm To, Daejeon 306791, South Korea
来源
关键词
Levenshtein Distance; Isolated Word Recognition; n-best Candidates Selection; Positional Accuracy;
D O I
10.7776/ASK.2011.30.8.428
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many isolated word recognition systems may generate irrelevant words for recognition results because they use only acoustic information or small amount of language information. In this paper, I propose word similarity that is used for selecting (or removing) less common words from candidates by applying Levenshtein distance. Word similarity is obtained by using positional accuracy that reflects the frequency information along to character's alignment information. This paper also discusses various improving techniques of selection of disparate words. The methods include different loss values, phone accuracy based on confusion information, weights of candidates by ranking order and partial comparisons. Through experiments, I found that the proposed methods are effective for removing heterogeneous words without loss of performance.
引用
收藏
页码:428 / 435
页数:8
相关论文
共 29 条
  • [1] Determination of the number of candidates using recognition scores for N-best based speech interface
    Cho, K
    Yamashita, Y
    Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 268 - 272
  • [2] An N-Best Candidates-Based Discriminative Training for Speech Recognition Applications
    Chen, Jung-Kuei
    Soong, Frank K.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 206 - 216
  • [3] ISOLATED WORD RECOGNITION USING THE WEIGHTED LEVENSHTEIN DISTANCE
    ACKROYD, MH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (02): : 243 - 244
  • [4] N-best vector quantization for isolated word speech recognition
    Nose, Masaya
    Maki, Shuichi
    Yartiane, Noburnoto
    Morikawa, Yoshitaka
    PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 2053 - +
  • [5] A word graph based N-Best search in continuous speech recognition
    Tran, BH
    Seide, F
    Steinbiss, V
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2127 - 2130
  • [6] Semantic Features Based N-Best Rescoring Methods for Automatic Speech Recognition
    Liu, Chang
    Zhang, Pengyuan
    Li, Ta
    Yan, Yonghong
    APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [7] USING DEEP-Q NETWORK TO SELECT CANDIDATES FROM N-BEST SPEECH RECOGNITION HYPOTHESES FOR ENHANCING DIALOGUE STATE TRACKING
    Tsai, Richard Tzong-Han
    Chen, Chia-Hao
    Wu, Chun-Kai
    Hsiao, Yu-Cheng
    Lee, Hung-yi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7375 - 7379
  • [8] BERT-based Semantic Model for Rescoring N-best Speech Recognition List
    Fohr, Dominique
    Illina, Irina
    INTERSPEECH 2021, 2021, : 1867 - 1871
  • [9] N-best Based Stochastic Mapping on Stereo HMM for Noise Robust Speech Recognition
    Cui, Xiaodong
    Afify, Mohamed
    Gao, Yuqing
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1261 - +
  • [10] USING N-BEST RECOGNITION OUTPUT FOR EXTRACTIVE SUMMARIZATION AND KEYWORD EXTRACTION IN MEETING SPEECH
    Liu, Yang
    Xie, Shasha
    Liu, Fei
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5310 - 5313