Using prompts to produce quality corpus for training automatic speech recognition systems

被引:0
|
作者
Lecouteux, Benjamin [1 ]
Linares, Georges [1 ]
机构
[1] Univ Avignon, LIA, Avignon, France
关键词
speech recognition; closed captioning; corpus building; automatic segmentation;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this paper we present an integrated unsupervised method to produce a quality corpus for training automatic speech recognition system (ASR) using prompts or closed captions. Closed captions and prompts do not always have timestamps and do not necessarily correspond to the exact speech. We propose a method allowing to extract quality corpus from imperfect transcript. The proposed approach works in two steps. During the search, the ASR system finds matching segments in a large prompt database. Matching segments are then used inside a Driven Decoding Algorithm (DDA) to produce a high quality corpus. Results show a F-measure of 96% in term of spotting while the DDA corrects the output according to the prompts: a high quality corpus is easily extracted. (1)
引用
收藏
页码:820 / 825
页数:6
相关论文
共 50 条
  • [21] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Adiga, Devaraja
    Kumar, Rishabh
    Krishna, Amrith
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    Goyal, Pawan
    Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021, : 5039 - 5050
  • [22] Automatic Speech Recognition Performance for Training on Noised Speech
    Prodeus, Arkadiy
    Kukharicheva, Kateryna
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION AND COMMUNICATION TECHNOLOGIES-2017 (AICT 2017), 2017, : 71 - 74
  • [23] Towards a Continuous Speech Corpus for Banking Domain Automatic Speech Recognition
    Suciu, George
    Toma, Stefan-Adrian
    Cheyeresan, Romulus
    2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,
  • [24] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Adiga, Devaraja
    Kumar, Rishabh
    Krishna, Amrith
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    Goyal, Pawan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 5039 - 5050
  • [25] Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training
    Salimbajevs, Askars
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE (HLT 2020), 2020, 328 : 47 - 54
  • [26] Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
    Dhawan, Kunal
    Koluguri, Nithin Rao
    Jukic, Ante
    Langman, Ryan
    Balam, Jagadeesh
    Ginsburg, Boris
    INTERSPEECH 2024, 2024, : 2574 - 2578
  • [27] Parliament Archives Used for Automatic Training of Multi-lingual Automatic Speech Recognition Systems
    Nouza, Jan
    Safarik, Radek
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 174 - 182
  • [28] Study of parallelization of the training for automatic speech recognition
    Daoudi, EM
    Meziane, A
    El Hadj, YOM
    HIGH PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 2000, 1823 : 576 - 579
  • [29] An audio-visual corpus for speech perception and automatic speech recognition (L)
    Cooke, Martin
    Barker, Jon
    Cunningham, Stuart
    Shao, Xu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424
  • [30] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition
    Maulana, Muhammad Rizki Aulia Rahman
    Fanany, Mohamad Ivan
    2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385