Removal of Heterogeneous Candidates Using Positional Accuracy Based on Levenshtein Distance on Isolated n-best Recognition

被引:0
|
作者
Yun, Young-Sun [1 ]
机构
[1] Hannam Univ, Dept Informat & Commun Engn, 70 Hananm To, Daejeon 306791, South Korea
来源
关键词
Levenshtein Distance; Isolated Word Recognition; n-best Candidates Selection; Positional Accuracy;
D O I
10.7776/ASK.2011.30.8.428
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many isolated word recognition systems may generate irrelevant words for recognition results because they use only acoustic information or small amount of language information. In this paper, I propose word similarity that is used for selecting (or removing) less common words from candidates by applying Levenshtein distance. Word similarity is obtained by using positional accuracy that reflects the frequency information along to character's alignment information. This paper also discusses various improving techniques of selection of disparate words. The methods include different loss values, phone accuracy based on confusion information, weights of candidates by ranking order and partial comparisons. Through experiments, I found that the proposed methods are effective for removing heterogeneous words without loss of performance.
引用
收藏
页码:428 / 435
页数:8
相关论文
共 29 条
  • [21] Improving N-Best Rescoring in Under-Resourced Code-Switched Speech Recognition Using Pretraining and Data Augmentation
    van Vuren, Joshua Jansen
    Niesler, Thomas
    LANGUAGES, 2022, 7 (03)
  • [22] Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method
    Panikos Heracleous
    Satoshi Nakamura
    Kiyohiro Shikano
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 105 - 116
  • [23] SIDiLDNG: A similarity-based intrusion detection system using improved Levenshtein Distance and N-gram for CAN
    Song, Jiaru
    Qin, Guihe
    Liang, Yanhua
    Yan, Jie
    Sun, Minghui
    COMPUTERS & SECURITY, 2024, 142
  • [24] Simultaneous recognition of distant-talking speech of multiple sound sources based on 3-D N-best search algorithm
    Heracleous, P
    Nakamura, S
    Shikano, K
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 111 - 114
  • [25] DEGRADED GRAY-SCALE TEXT RECOGNITION USING PSEUDO-2D HIDDEN MARKOV-MODELS AND N-BEST HYPOTHESES
    YEN, CC
    KUO, SS
    GRAPHICAL MODELS AND IMAGE PROCESSING, 1995, 57 (02): : 131 - 145
  • [26] A microphone array-based 3-D N-best search algorithm for the simultaneous recognition of multiple sound sources in real environments
    Heracleous, P
    Nakamura, S
    Shikano, K
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 193 - 196
  • [27] Prefix Tree based N-best list Re-scoring for Recurrent Neural Network Language Model used in Speech Recognition System
    Si, Yujing
    Zhang, Qingqing
    Li, Ta
    Pan, Jielin
    Yan, Yonghong
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3386 - 3390
  • [28] IMPROVING NOISE ROBUSTNESS FOR SPOKEN CONTENT RETRIEVAL USING SEMI-SUPERVISED ASR AND N-BEST TRANSCRIPTS FOR BERT-BASED RANKING MODELS
    Moriya, Yasufumi
    Jones, Gareth. J. F.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 398 - 405
  • [29] Improving the Accuracy of Estimates of Indoor Distance Moved Using Deep Learning-Based Movement Status Recognition
    Ma, Zhenjie
    Zhang, Wenjun
    Shi, Ke
    SENSORS, 2022, 22 (01)