A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction

被引:3
|
作者
Lin, Chu-Cheng [1 ]
Tsai, Richard Tzong-Han [2 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10617, Taiwan
[2] Yuan Ze Univ, Dept Comp Sci & Engn, Zhongli 320, Taiwan
关键词
Chinese dialects; data augmentation; generative model; pronunciation database;
D O I
10.1109/TASL.2011.2172424
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most spoken Chinese dialects lack comprehensive digital pronunciation databases, which are crucial for speech processing tasks. Given complete pronunciation databases for related dialects, one can use supervised learning techniques to predict a Chinese character's pronunciation in a target dialect based on the character's features and its pronunciation in other related dialects. Unfortunately, Chinese dialect pronunciation databases are far from complete. We propose a novel generative model that makes use of both existing dialect pronunciation data plus medieval rime books to discover patterns that exist in multiple dialects. The proposed model can augment missing dialectal pronunciations based on existing dialect pronunciation tables (even if incomplete) and the pronunciation data in rime books. The augmented pronunciation database can then be used in supervised learning settings. We evaluate the prediction accuracy in terms of phonological features, such as tone, initial phoneme, final phoneme, etc. For each character, features are evaluated on the whole, overall pronunciation feature accuracy (OPFA). Our first experimental results show that adding features from dialectal pronunciation data to our baseline rime-book model dramatically improves OPFA using the support vector machine (SVM) model. In the second experiment, we compare the performance of the SVM model using phonological features from closely related dialects with that of the model using phonological features from non-closely related dialects. The experimental results show that using features from closely related dialects results in higher accuracy. In the third experiment, we show that using our proposed data augmentation model to fill in missing data can increase the SVM model's OPFA by up to 7.6%.
引用
收藏
页码:1109 / 1117
页数:9
相关论文
共 50 条
  • [1] Enhancing Collaborative Filtering with Generative Augmentation
    Wang, Qinyong
    Yin, Hongzhi
    Wang, Hao
    Quoc Viet Hung Nguyen
    Huang, Zi
    Cui, Lizhen
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 548 - 556
  • [2] Enhancing Text Classification Models with Generative AI-aided Data Augmentation
    Zhao, Huanhuan
    Chen, Haihua
    Yoon, Hong-Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 138 - 145
  • [3] Enhancing rumor detection with data augmentation and generative pre-trained transformer
    Askarizade, Mojgan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
  • [4] Data Augmentation Computing Model Based on Generative Adversarial Network
    Weng, Yu
    Zhou, Haiwen
    IEEE ACCESS, 2019, 7 : 64223 - 64233
  • [5] Generative Model based Data Augmentation for Special Person Classification
    Guo, Zijie
    Zhi, Rong
    Zhang, Wuqaing
    Wang, Baofeng
    Fang, Zhijie
    Kaiser, Vitali
    Wiederer, Julian
    Flohr, Fabian
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1669 - 1675
  • [6] Enhancing heart disease prediction with reinforcement learning and data augmentation
    Gayathri, R.
    Sangeetha, S. K. B.
    Mathivanan, Sandeep Kumar
    Rajadurai, Hariharan
    Malar, Banjul Anbu M. B.
    Mallik, Saurav
    Qin, Hong
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [7] Fusing generative and discriminative models for Chinese dialect identification
    Gu, Mingliang
    Xia, Yuguo
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1096 - 1099
  • [8] Data augmentation for enhancing EEG-based emotion recognition with deep generative models
    Luo, Yun
    Zhu, Li-Zhen
    Wan, Zi-Yu
    Lu, Bao-Liang
    JOURNAL OF NEURAL ENGINEERING, 2020, 17 (05)
  • [9] Enhancing Activity Recognition After Stroke: Generative Adversarial Networks for Kinematic Data Augmentation
    Hadley, Aaron J.
    Pulliam, Christopher L.
    SENSORS, 2024, 24 (21)
  • [10] Improving the prediction of extreme wind speed events with generative data augmentation techniques
    Vega-Bayo, M.
    Perez-Aracil, J.
    Prieto-Godino, L.
    Salcedo-Sanz, S.
    RENEWABLE ENERGY, 2024, 221