Methodology for Obtaining High-Quality Speech Corpora

被引:0
|
作者
Wieczorkowska, Alicja [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Koszykowa 86, PL-02008 Warsaw, Poland
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
关键词
natural language processing; speech corpora; automatic speech recognition; CORPUS;
D O I
10.3390/app15041848
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Creating speech corpora.Abstract Speech-based communication between users and machines is a very lively branch of research that covers speech recognition, synthesis, and, generally, natural language processing. Speech corpora are needed for training algorithms for human-machine communication, especially for automatic speech recognition and for speech synthesis. Generative artificial intelligence models also need corpora for training for every language implemented. Therefore, speech corpora are constantly being created. In this paper, we discuss how to create high-quality corpora. The technical parameters of the recordings and audio files are addressed, and a methodology is proposed for planning speech corpus creation with an emphasis on usability. The proposed methodology draws the attention of potential creators of speech corpora to often neglected aspects of the corpus creation process. The criteria for a quality assessment of particular components are also discussed. The author recommends not combining all quality metrics into one (or at least allowing users to adjust particular weights), as different users might be interested in different quality components. The presented guidelines lead to obtaining high-quality corpora that meet the needs of their end users and are easy to use.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Integrating imperfect transcripts into speech recognition systems for building high-quality corpora
    Lecouteux, Benjamin
    Linares, Georges
    Oger, Stanislas
    COMPUTER SPEECH AND LANGUAGE, 2012, 26 (02): : 67 - 89
  • [2] Process Model for Composing High-quality Text Corpora
    Lounela, Mikko
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 87 - 90
  • [3] Techniques for Obtaining High-quality Recordings in Electrocochleography
    Simpson, Michael J.
    Jennings, Skyler G.
    Margolis, Robert H.
    FRONTIERS IN SYSTEMS NEUROSCIENCE, 2020, 14
  • [5] SVitchboard II and FiSVer I: High-Quality Limited-Complexity Corpora of Conversational English Speech
    Liu, Yuzong
    Iyer, Rishabh
    Kirchhoff, Katrin
    Bilmes, Jeff
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 673 - 677
  • [6] HIGH-QUALITY PARCOR SPEECH SYNTHESIZER
    SAMPEI, T
    ASADA, A
    NAKATA, K
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1980, 26 (03) : 353 - 359
  • [7] High-quality speech processor for comms
    不详
    ELECTRONICS WORLD, 2001, 107 (1784): : 604 - 606
  • [8] SPEECH DIGITIZATION AND COMPRESSION - THE HIGH-QUALITY SPEECH PROCESS
    BROWN, D
    MICROELECTRONICS AND RELIABILITY, 1981, 21 (06): : 815 - 816
  • [9] Conditions for Obtaining High-Quality Crystals by the Czochralski Method
    V. N. Matrosov
    Crystallography Reports, 2019, 64 : 174 - 176
  • [10] Obtaining High-Quality Relevance Judgments Using Crowdsourcing
    Vuurens, Jeroen B. P.
    de Vries, Arjen P.
    IEEE INTERNET COMPUTING, 2012, 16 (05) : 20 - 27