A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction

被引:3
|
作者
Lin, Chu-Cheng [1 ]
Tsai, Richard Tzong-Han [2 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10617, Taiwan
[2] Yuan Ze Univ, Dept Comp Sci & Engn, Zhongli 320, Taiwan
关键词
Chinese dialects; data augmentation; generative model; pronunciation database;
D O I
10.1109/TASL.2011.2172424
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most spoken Chinese dialects lack comprehensive digital pronunciation databases, which are crucial for speech processing tasks. Given complete pronunciation databases for related dialects, one can use supervised learning techniques to predict a Chinese character's pronunciation in a target dialect based on the character's features and its pronunciation in other related dialects. Unfortunately, Chinese dialect pronunciation databases are far from complete. We propose a novel generative model that makes use of both existing dialect pronunciation data plus medieval rime books to discover patterns that exist in multiple dialects. The proposed model can augment missing dialectal pronunciations based on existing dialect pronunciation tables (even if incomplete) and the pronunciation data in rime books. The augmented pronunciation database can then be used in supervised learning settings. We evaluate the prediction accuracy in terms of phonological features, such as tone, initial phoneme, final phoneme, etc. For each character, features are evaluated on the whole, overall pronunciation feature accuracy (OPFA). Our first experimental results show that adding features from dialectal pronunciation data to our baseline rime-book model dramatically improves OPFA using the support vector machine (SVM) model. In the second experiment, we compare the performance of the SVM model using phonological features from closely related dialects with that of the model using phonological features from non-closely related dialects. The experimental results show that using features from closely related dialects results in higher accuracy. In the third experiment, we show that using our proposed data augmentation model to fill in missing data can increase the SVM model's OPFA by up to 7.6%.
引用
收藏
页码:1109 / 1117
页数:9
相关论文
共 50 条
  • [31] Analysis of Generative Data Augmentation for Face Antispoofing
    Orfao, Jarred
    van der Haar, Dustin
    PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2023, 2024, 14547 : 69 - 94
  • [32] Generative Adversarial Networks for Bitcoin Data Augmentation
    Zola, Francesco
    Lukas Bruse, Jan
    Etxeberria Barrio, Xabier
    Galar, Mikel
    Orduna Urrutia, Raul
    2020 2ND CONFERENCE ON BLOCKCHAIN RESEARCH & APPLICATIONS FOR INNOVATIVE NETWORKS AND SERVICES (BRAINS), 2020, : 136 - 143
  • [33] A Generative Model to Synthesize EEG Data for Epileptic Seizure Prediction
    Rasheed, Khansa
    Qadir, Junaid
    O'Brien, Terence J.
    Kuhlmann, Levin
    Razi, Adeel
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 2322 - 2332
  • [34] Hydrogen yield prediction for supercritical water gasification based on generative adversarial network data augmentation
    Ma, Zherui
    Wang, Jiangjiang
    Feng, Yingsong
    Wang, Ruikun
    Zhao, Zhenghui
    Chen, Hongwei
    APPLIED ENERGY, 2023, 336
  • [35] Model-domain Data Augmentation Using Generative Adversarial Network for Fault Localization
    Zhang Z.
    Lei Y.
    Mao X.-G.
    Xue J.-X.
    Chang X.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (05): : 2289 - 2306
  • [36] Generative adversarial network based data augmentation to improve cervical cell classification model
    Yu, Suxiang
    Zhang, Shuai
    Wang, Bin
    Dun, Hua
    Xu, Long
    Huang, Xin
    Shi, Ermin
    Feng, Xinxing
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (02) : 1740 - 1752
  • [37] A Evaluating Model of English Pronunciation for Chinese Students
    Huang, Guimin
    Ye, Jing
    Shen, Yan
    Zhou, Ya
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1062 - 1065
  • [38] Enhancing X-ray Security Image Synthesis: Advanced Generative Models and Innovative Data Augmentation Techniques
    Yagoub, Bilel
    Kasem, Mahmoud SalahEldin
    Kang, Hyun-Soo
    APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [39] Enhancing recognition performance of vortex arrays through conditional generative adversarial network-based data augmentation
    Zhang, Zhi
    Si, Jinhai
    Gao, Duorui
    Jia, Shuaiwei
    Wang, Wei
    Xie, Xiaoping
    OPTICAL ENGINEERING, 2024, 63 (05) : 54117
  • [40] Conditional Generative Adversarial Networks with Adversarial Attack and Defense for Generative Data Augmentation
    Baek, Francis
    Kim, Daeho
    Park, Somin
    Kim, Hyoungkwan
    Lee, SangHyun
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (03)