SCORE NORMALIZATION AND SYSTEM COMBINATION FOR IMPROVED KEYWORD SPOTTING

被引:0
|
作者
Karakos, Damianos [1 ]
Schwartz, Richard [1 ]
Tsakalidis, Stavros [1 ]
Zhang, Le [1 ]
Ranjan, Shivesh [1 ]
Ng, Tim [1 ]
Hsiao, Roger [1 ]
Saikumar, Guruprasad [1 ]
Bulyko, Ivan [1 ]
Long Nguyen [1 ]
Makhoul, John [1 ]
Grezl, Frantisek [2 ]
Hannemann, Mirko [2 ]
Karafiat, Martin [2 ]
Szoke, Igor [2 ]
Vesely, Karel [2 ]
Lamel, Lori [3 ]
Le, Viet-Bac [4 ]
机构
[1] Raytheon BBN Technol, Cambridge, MA 02138 USA
[2] Brno Univ Technol, SpeechFIT, Brno, Czech Republic
[3] CNRS LIMSI, Paris, France
[4] Vocapia Res, Paris, France
关键词
keyword search; score normalization; system combination; indexing and search;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present two techniques that are shown to yield improved Keyword Spotting (KWS) performance when using the ATWV/MTWV performance measures: (i) score normalization, where the scores of different keywords become commensurate with each other and they more closely correspond to the probability of being correct than raw posteriors; and (ii) system combination, where the detections of multiple systems are merged together, and their scores are interpolated with weights which are optimized using MTWV as the maximization criterion. Both score normalization and system combination approaches show that significant gains in ATWV/MTWV can be obtained, sometimes on the order of 8-10 points (absolute), in five different languages. A variant of these methods resulted in the highest performance for the official surprise language evaluation for the IARPA-funded Babel project in April 2013.
引用
下载
收藏
页码:210 / 215
页数:6
相关论文
共 50 条
  • [1] White Listing and Score Normalization for Keyword Spotting of Noisy Speech
    Zhang, Bing
    Schwartz, Richard
    Tsakalidis, Stavros
    Long Nguyen
    Matsoukas, Spyros
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1830 - 1833
  • [2] Comparison of Multiple System Combination Techniques for Keyword Spotting
    Hartmann, William
    Zhang, Le
    Barnes, Kerri
    Hsiao, Roger
    Tsakalidis, Stavros
    Schwartz, Richard
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1913 - 1917
  • [3] KEYWORD-SPECIFIC NORMALIZATION BASED KEYWORD SPOTTING FOR SPONTANEOUS SPEECH
    Li, Weifeng
    Liao, Qingmin
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 233 - 237
  • [4] Score Normalization for Keyword Search
    Sari, Leda
    Saraclar, Murat
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 761 - 764
  • [5] Improved Keyword Spotting based on Keyword/Garbage Models
    Chen, Qiyu
    Zhang, Weibin
    Xu, Xiangmin
    Xing, Xiaofen
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [6] A novel keyword rescoring method for improved spoken keyword spotting
    Rebai, Ilyes
    BenAyed, Yassine
    Mahdi, Walid
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 312 - 320
  • [7] Effective Combination of DenseNet and BiLSTM for Keyword Spotting
    Zeng, Mengjun
    Xiao, Nanfeng
    IEEE ACCESS, 2019, 7 : 10767 - 10775
  • [8] RICH SYSTEM COMBINATION FOR KEYWORD SPOTTING IN NOISY AND ACOUSTICALLY HETEROGENEOUS AUDIO STREAMS
    Akbacak, Murat
    Burget, Lukas
    Wang, Wen
    van Hout, Julien
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8267 - 8271
  • [9] DISCRIMINATIVE SCORE NORMALIZATION FOR KEYWORD SEARCH DECISION
    Van Tung Pham
    Xu, Haihua
    Chen, Nancy F.
    Sivadas, Sunil
    Lim, Boon Pang
    Chng, Eng Siong
    Li, Haizhou
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] Novel Score Normalization Methods for Keyword Search
    Gundogdu, Batuhan
    Saraclar, Murat
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,