Methodology for Obtaining High-Quality Speech Corpora

被引:0
|
作者
Wieczorkowska, Alicja [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Koszykowa 86, PL-02008 Warsaw, Poland
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
关键词
natural language processing; speech corpora; automatic speech recognition; CORPUS;
D O I
10.3390/app15041848
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Creating speech corpora.Abstract Speech-based communication between users and machines is a very lively branch of research that covers speech recognition, synthesis, and, generally, natural language processing. Speech corpora are needed for training algorithms for human-machine communication, especially for automatic speech recognition and for speech synthesis. Generative artificial intelligence models also need corpora for training for every language implemented. Therefore, speech corpora are constantly being created. In this paper, we discuss how to create high-quality corpora. The technical parameters of the recordings and audio files are addressed, and a methodology is proposed for planning speech corpus creation with an emphasis on usability. The proposed methodology draws the attention of potential creators of speech corpora to often neglected aspects of the corpus creation process. The criteria for a quality assessment of particular components are also discussed. The author recommends not combining all quality metrics into one (or at least allowing users to adjust particular weights), as different users might be interested in different quality components. The presented guidelines lead to obtaining high-quality corpora that meet the needs of their end users and are easy to use.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
    Miao, Chenfeng
    Liang, Shuang
    Liu, Zhencheng
    Chen, Minchuan
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [42] PortaSpeech: Portable and High-Quality Generative Text-to-Speech
    Ren, Yi
    Liu, Jinglin
    Zhao, Zhou
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] MDSWriter: Annotation tool for creating high-quality multi-document summarization corpora
    Meyer, Christian M.
    Benikova, Darina
    Mieskes, Margot
    Gurevych, Iryna
    PROCEEDINGS OF 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL-2016): SYSTEM DEMONSTRATIONS, 2016, : 97 - 102
  • [44] Cotranslate: A Web-Based Tool for Crowdsourcing High-Quality Sentence Pair Corpora
    National Center for Artificial Intelligence, Chile
    不详
    2023,
  • [45] CoTranslate: A web-based tool for crowdsourcing high-quality sentence pair corpora
    Carvallo, Andres
    Jorquera, Ignacio
    Aspillaga, Carlos
    SOFTWAREX, 2023, 23
  • [46] Translatotron 2: High-quality direct speech-to-speech translation with voice preservation
    Jia, Ye
    Ramanovich, Michelle Tadmor
    Remez, Tal
    Pomerantz, Roi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 10120 - 10134
  • [47] SEGMENTAL INTELLIGIBILITY AND SPEECH INTERFERENCE THRESHOLDS OF HIGH-QUALITY SYNTHETIC SPEECH IN PRESENCE OF NOISE
    KOUL, RK
    ALLEN, GD
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1993, 36 (04): : 790 - 798
  • [48] Evaluation of extraction methods for obtaining high-quality RNA from sweet potato
    Goncalves, R. C.
    Daude, M. M.
    de Souza, M. R.
    Moraes, M. B. F.
    Moreira, R. O.
    da Silveira, M. A.
    Sagio, S. A.
    Barreto, H. G.
    GENETICS AND MOLECULAR RESEARCH, 2021, 20 (04):
  • [49] Obtaining high-quality cast billets from Cu-Mg alloys
    Ten, E. B.
    Badmazhapova, I. B.
    RUSSIAN JOURNAL OF NON-FERROUS METALS, 2013, 54 (02) : 166 - 170
  • [50] A Clean Process for Obtaining High-Quality Cellulose Acetate from Cigarette Butts
    De Fenzo, Anna
    Giordano, Michele
    Sansone, Lucia
    MATERIALS, 2020, 13 (21) : 1 - 13