Data selection for speech recognition

被引:36
|
作者
Wu, Yi [1 ]
Zhang, Rong [1 ]
Rudnicky, Alexander [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
data selection; maximum entropy; speech recognition; acoustic modeling;
D O I
10.1109/ASRU.2007.4430173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a strategy for efficiently selecting informative data from large corpora of transcribed speech. We propose to choose data uniformly according to the distribution of some target speech unit (phoneme, word, character, etc). In our experiment, in contrast to the common belief that "there is no data like more data", we found it possible to select a highly informative subset of data that produces recognition performance comparable to a system that makes use of a much larger amount of data. At the same time, our selection process is efficient and fast.
引用
收藏
页码:562 / 565
页数:4
相关论文
共 50 条
  • [1] Multilingual Data Selection For Low Resource Speech Recognition
    Thomas, Samuel
    Audhkhasi, Kartik
    Cui, Jia
    Kingsbury, Brian
    Ramabhadran, Bhuvana
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3853 - 3857
  • [2] TOWARDS DATA SELECTION ON TTS DATA FOR CHILDREN'S SPEECH RECOGNITION
    Wang, Wei
    Zhou, Zhikai
    Lu, Yizhou
    Wang, Hongji
    Du, Chenpeng
    Qian, Yanmin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6888 - 6892
  • [3] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [4] UNSUPERVISED DATA SELECTION FOR SPEECH RECOGNITION WITH CONTRASTIVE LOSS RATIOS
    Park, Chanho
    Ahmad, Rehan
    Hain, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8587 - 8591
  • [5] Feature selection of mime speech recognition using surface electromyography data
    Zhang, Ming
    Zhang, Wei
    Zhang, Bixuan
    Wang, You
    Li, Guang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3173 - 3178
  • [6] In search of optimal data selection for training of automatic speech recognition systems
    Nagórski, A
    Boves, L
    Steeneken, H
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 67 - 72
  • [7] SNR-Selection-Based-Data Augmentation for Dysarthric Speech Recognition
    Nawroly, Sarkhell Sirwan
    Popescu, Decebal Gheorghe
    Antony, Mariya Celin Thekekara
    Philominal, Actlin Jeeva Muthu
    STUDIES IN INFORMATICS AND CONTROL, 2023, 32 (04): : 129 - 140
  • [8] Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech Recognition
    Hsieh, I-Ting
    Wu, Chung-Hsien
    Tsai, Shu-Wei
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2196 - 2202
  • [9] Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition
    Mezzoudj, Freha
    Langlois, David
    Jouvet, Denis
    Benyettou, Abdelkader
    1ST INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING, 2018, 128 : 55 - 64
  • [10] SUBMODULAR DATA SELECTION WITH ACOUSTIC AND PHONETIC FEATURES FOR AUTOMATIC SPEECH RECOGNITION
    Ni, Chongjia
    Wang, Lei
    Liu, Haibo
    Leung, Cheung-Chi
    Lu, Li
    Ma, Bin
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4629 - 4633