Multi-Accent Chinese Speech Recognition

被引:0
|
作者
Liu Yi [1 ]
Fung, Pascale
机构
[1] Univ Sci & Technol, Human Language Technol Ctr, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China
关键词
multi-accent recognition; accent tree; Chinese;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple accents are often present in spontaneous Chinese Mandarin speech as most Chinese have learned Mandarin as a second language. We propose a method to handle multiple accents as well as standard speech in a speaker-independent system by merging auxiliary accent decision trees with standard trees and reconstruct the acoustic model. In our proposed method, tree structures and shape are modified according to accent-specific data while the parameter set of the baseline model remains the same. The effectiveness of this approach is evaluated on Cantonese and Wu accented, as well as standard Mandarin speech. Our method yields a significant 4.4% and 3.3% absolute word error rate reduction without sacrificing the performance on standard Mandarin speech.
引用
收藏
页码:133 / +
页数:2
相关论文
共 50 条
  • [1] END-TO-END MULTI-ACCENT SPEECH RECOGNITION WITH UNSUPERVISED ACCENT MODELLING
    Li, Song
    Ouyang, Beibei
    Liao, Dexin
    Xia, Shipeng
    Li, Lin
    Hong, Qingyang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6418 - 6422
  • [2] JOINT MODELING OF ACCENTS AND ACOUSTICS FOR MULTI-ACCENT SPEECH RECOGNITION
    Yang, Xuesong
    Audhkhasi, Kartik
    Rosenberg, Andrew
    Thomas, Samuel
    Ramabhadran, Bhuvana
    Hasegawa-Johnson, Mark
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5989 - 5993
  • [3] MULTI-ACCENT SPEECH RECOGNITION WITH HIERARCHICAL GRAPHEME BASED MODELS
    Rao, Kanishka
    Sak, Hasim
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4815 - 4819
  • [4] Multi-Accent and Accent-Independent Non-Native Speech Recognition
    Bouselmi, Ghazi
    Fohr, Dominique
    Illina, Irina
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2703 - +
  • [5] Investigations of Low Resource Multi-Accent Mandarin Speech Recognition
    Wang, Wei
    Xu, Wenying
    Sui, Xiang
    Wang, Lan
    Liu, Xunying
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 62 - 66
  • [6] A Multi-Accent Acoustic Model using Mixture of Experts for Speech Recognition
    Jain, Abhinav
    Singh, Vishwanath P.
    Rath, Shakti P.
    [J]. INTERSPEECH 2019, 2019, : 779 - 783
  • [7] Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition
    Zhang, Chao
    Liu, Yi
    Xia, Yunqing
    Wang, Xuan
    Lee, Chin-Hui
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2073 - 2084
  • [8] Adaptive Attention Network with Domain Adversarial Training for Multi-Accent Speech Recognition
    Yang, Yanbing
    Shi, Hao
    Lin, Yuqin
    Ge, Meng
    Wang, Longbiao
    Hou, Qingzhi
    Dang, Jianwu
    [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 6 - 10
  • [9] Layer-Wise Fast Adaptation for End to End Multi-Accent Speech Recognition
    Qian, Yanmin
    Gong, Xun
    Huang, Houjun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2842 - 2853
  • [10] RELIABLE ACCENT SPECIFIC UNIT GENERATION WITH DYNAMIC GAUSSIAN MIXTURE SELECTION FOR MULTI-ACCENT SPEECH RECOGNITION
    Zhang, Chao
    Liu, Yi
    Xia, Yunqing
    Zheng, Thomas Fang
    Olsen, Jesper
    Tian, JiLei
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,