Methodology for Obtaining High-Quality Speech Corpora

被引:0
|
作者
Wieczorkowska, Alicja [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Koszykowa 86, PL-02008 Warsaw, Poland
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
关键词
natural language processing; speech corpora; automatic speech recognition; CORPUS;
D O I
10.3390/app15041848
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Creating speech corpora.Abstract Speech-based communication between users and machines is a very lively branch of research that covers speech recognition, synthesis, and, generally, natural language processing. Speech corpora are needed for training algorithms for human-machine communication, especially for automatic speech recognition and for speech synthesis. Generative artificial intelligence models also need corpora for training for every language implemented. Therefore, speech corpora are constantly being created. In this paper, we discuss how to create high-quality corpora. The technical parameters of the recordings and audio files are addressed, and a methodology is proposed for planning speech corpus creation with an emphasis on usability. The proposed methodology draws the attention of potential creators of speech corpora to often neglected aspects of the corpus creation process. The criteria for a quality assessment of particular components are also discussed. The author recommends not combining all quality metrics into one (or at least allowing users to adjust particular weights), as different users might be interested in different quality components. The presented guidelines lead to obtaining high-quality corpora that meet the needs of their end users and are easy to use.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] ChIP-Seq: technical considerations for obtaining high-quality data
    Benjamin L Kidder
    Gangqing Hu
    Keji Zhao
    Nature Immunology, 2011, 12 : 918 - 922
  • [32] ChIP-Seq: technical considerations for obtaining high-quality data
    Kidder, Benjamin L.
    Hu, Gangqing
    Zhao, Keji
    NATURE IMMUNOLOGY, 2011, 12 (10) : 918 - 922
  • [33] A Review of the Strategies for Obtaining High-Quality Crystals Utilizing Nanotechnologies and Microgravity
    Pechkova, Eugenia
    Bragazzi, Nicola
    Bozdaganyan, Marine
    Belmonte, Luca
    Nicolini, Claudio
    CRITICAL REVIEWS IN EUKARYOTIC GENE EXPRESSION, 2014, 24 (04): : 325 - 339
  • [34] Control of growth process for obtaining high-quality a-SiO:H
    Sobajima, Yasushi
    Kinoshita, Shota
    Kakimoto, Shinnosuke
    Okumoto, Ryoji
    Sada, Chitose
    Matsuda, Akihisa
    Okamoto, Hiroaki
    CANADIAN JOURNAL OF PHYSICS, 2014, 92 (7-8) : 582 - 585
  • [35] OBTAINING HIGH-QUALITY IMAGES OF INSULATORS IN A SCANNING ELECTRON MICROSCOPE.
    Spivak, G.V.
    Rau, E.I.
    Luk'yanov, A.Ye.
    Petrov, V.I.
    Ayrapetov, A.Sh.
    Radio Engineering and Electronic Physics (English translation of Radiotekhnika i Elektronika), 1972, 17 (10): : 1798 - 1802
  • [36] MULTIPOINT TELECONFERENCE SYSTEM PROVIDING HIGH-QUALITY SPEECH.
    Shimada, Shoji
    Taka, Masahiro
    Suzuki, Junji
    Reports of the Electrical Communication Laboratory, 1988, 36 (01): : 57 - 62
  • [37] Simplified aperiodicity representation for high-quality speech manipulation systems
    Kawahara, Hideki
    Morise, Masanori
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 579 - +
  • [39] 400bps High-Quality Speech Coding Algorithm
    Ma, Xiaofeng
    Li, Ye
    Jiang, Jingsai
    Zhang, Peng
    Fan, Yanhong
    Hao, Qiuyun
    2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, : 256 - 259
  • [40] HIGH-QUALITY SYNTHETIC SPEECH GENERATION USING SYNCHRONIZED OSCILLATORS
    HASHIMOTO, K
    MOCHIDA, T
    SATO, Y
    KOBAYASHI, T
    SHIRAI, K
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (11) : 1949 - 1956