Methodology for Obtaining High-Quality Speech Corpora

被引:0
|
作者
Wieczorkowska, Alicja [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Koszykowa 86, PL-02008 Warsaw, Poland
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
关键词
natural language processing; speech corpora; automatic speech recognition; CORPUS;
D O I
10.3390/app15041848
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Creating speech corpora.Abstract Speech-based communication between users and machines is a very lively branch of research that covers speech recognition, synthesis, and, generally, natural language processing. Speech corpora are needed for training algorithms for human-machine communication, especially for automatic speech recognition and for speech synthesis. Generative artificial intelligence models also need corpora for training for every language implemented. Therefore, speech corpora are constantly being created. In this paper, we discuss how to create high-quality corpora. The technical parameters of the recordings and audio files are addressed, and a methodology is proposed for planning speech corpus creation with an emphasis on usability. The proposed methodology draws the attention of potential creators of speech corpora to often neglected aspects of the corpus creation process. The criteria for a quality assessment of particular components are also discussed. The author recommends not combining all quality metrics into one (or at least allowing users to adjust particular weights), as different users might be interested in different quality components. The presented guidelines lead to obtaining high-quality corpora that meet the needs of their end users and are easy to use.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] OBTAINING HIGH-QUALITY SEISMIC DATA IN COMPLEX AND FRONTIER AREAS
    PERRY, C
    GALVAN, P
    OIL & GAS JOURNAL, 1983, 81 (48) : 138 - 140
  • [22] Involvement of pharmaeconomy criteria for obtaining high-quality medical care
    Alvarez, JS
    REVISTA CLINICA ESPANOLA, 2002, 202 (08): : 466 - 467
  • [23] High-quality text-to-speech synthesis: An overview
    Dutoit, T.
    Journal of Electrical and Electronics Engineering, Australia, 1997, 17 (01): : 25 - 36
  • [24] MULTIPOINT TELECONFERENCE SYSTEM PROVIDING HIGH-QUALITY SPEECH
    SHIMADA, S
    TAKA, M
    SUZUKI, J
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1988, 36 (01): : 57 - 62
  • [25] FlexVoice:: A parametric approach to high-quality speech synthesis
    Balogh, G
    Dobler, E
    Grobler, T
    Smodics, B
    Szepesvári, C
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 189 - 194
  • [26] HIGH-QUALITY SPEECH COMPRESSION-EXPANSION METHOD
    JOHNSON, O
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (05): : 725 - &
  • [27] Mining Infrequent High-Quality Phrases from Domain-Specific Corpora
    Wang, Li
    Zhu, Wei
    Jiang, Sihang
    Zhang, Sheng
    Wang, Keqiang
    Ni, Yuan
    Xie, Guotong
    Xiao, Yanghua
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1535 - 1544
  • [28] Harvester and transporting device development for high-quality soybean seeds obtaining
    Prisyazhnaya, I. M.
    Sinegovskaya, V. T.
    Prisyazhnaya, S. P.
    Sinegovskii, M. O.
    III INTERNATIONAL SCIENTIFIC CONFERENCE: AGRITECH-III-2020: AGRIBUSINESS, ENVIRONMENTAL ENGINEERING AND BIOTECHNOLOGIES, PTS 1-8, 2020, 548
  • [29] OBTAINING OF DIELECTRIC PICTURE OF HIGH-QUALITY IN A RASTER ELECTRON-MICROSCOPE
    SPIVAK, GV
    RAU, EI
    LUKYANOV, AE
    PETROV, VI
    AIRAPETO.AS
    RADIOTEKHNIKA I ELEKTRONIKA, 1972, 17 (10): : 2237 - 2239
  • [30] Obtaining high-quality powders by atomization from melts of secondary aluminum
    Ershov, GS
    Nichiporenko, OS
    Gavrilyuk, GV
    Medvedovskii, AB
    POWDER METALLURGY AND METAL CERAMICS, 1996, 35 (11-12) : 577 - 579