A Study on Mispronunciation Detection Based on Fine-grained Speech Attribute

被引:0
|
作者
Guo, Minghao [1 ]
Rui, Cai [1 ]
Wang, Wei [1 ]
Lin, Binghuai [2 ]
Zhang, Jinsong [1 ]
Xie, Yanlu [1 ]
机构
[1] Beijing Language & Culture Univ, Beijing Adv Innovat Ctr Language Resources, Beijing, Peoples R China
[2] Tencent Sci & Technol Ltd, MIG, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Over the last decade, several studies have investigated speech attribute detection (SAD) for improving computer assisted pronunciation training (CAPT) systems. The predefined speech attribute categories either is IPA or language dependent categories, which is difficult to handle multiple languages mispronunciation detection. In this paper, we propose a fine-grained speech attribute (FSA) modeling method, which defines types of Chinese speech attribute by combining Chinese phonetics with the international phonetic alphabet (IPA). To verify FSA, a large scale Chinese corpus was used to train Time-delay neural networks (TDNN) based on speech attribute models, and tested on Russian learner data set. Experimental results showed that all FSA's accuracy on Chinese test set is about 95% on average, and the diagnosis accuracy of the FSA-based mispronunciation detection achieved a 2.2% improvement compared to that of segment-based baseline system. Besides, as the FSA is theoretically capable of modeling language-universal speech attributes, we also tested the trained FSA-based method on native English corpus, which achieved about 50% accuracy rate.
引用
收藏
页码:1197 / 1201
页数:5
相关论文
共 50 条
  • [31] Deformable Part Descriptors for Fine-grained Recognition and Attribute Prediction
    Zhang, Ning
    Farrell, Ryan
    Iandola, Forrest
    Darrell, Trevor
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 729 - 736
  • [32] Knowledge graph fine-grained network with attribute transfer for recommendation
    Yuan, Xu
    Chen, Zixuan
    Bu, Xiya
    Gao, Zhengnan
    Zhao, Liang
    Ma, Ruixin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257
  • [33] Fine-Grained Visual Attribute Extraction from Fashion Wear
    Parekh, Viral
    Shaik, Karimulla
    Biswas, Soma
    Chelliah, Muthusamy
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3968 - 3972
  • [34] Audio Visual Attribute Discovery for Fine-Grained Object Recognition
    Zhang, Hua
    Cao, Xiaochun
    Wang, Rui
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7542 - 7549
  • [35] CAST: Conditional Attribute Subsampling Toolkit for Fine-grained Evaluation
    Robbins, Wes
    Zhou, Steven
    Bhatta, Aman
    Mello, Chad
    Albiero, Vitor
    Bowyer, Kevin W.
    Boult, Terrance E.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 919 - 929
  • [36] Fine-grained Emotion Role Detection Based on Retweet Information
    Yu, Zhiwen
    Yi, Fei
    Ma, Chao
    Wang, Zhu
    Guo, Bin
    Chen, Liming
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2019, 19 (01)
  • [37] Fine-grained Android Malware Detection based on Deep Learning
    Li, Dongfang
    Wang, Zhaoguo
    Xue, Yibo
    2018 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2018,
  • [38] A Fine-Grained Network Congestion Detection Based on Flow Watermarking
    Mo, Lusha
    Lv, Gaofeng
    Wang, Baosheng
    APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [39] Fine-grained access control system based on fully outsourced attribute-based encryption
    Zhang, Rui
    Ma, Hui
    Lu, Yao
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 125 : 344 - 353
  • [40] Fine-grained Noise Control for Multispeaker Speech Synthesis
    Nikitaras, Karolos
    Vamvoukakis, Georgios
    Ellinas, Nikolaos
    Klapsas, Konstantinos
    Markopoulos, Konstantinos
    Raptis, Spyros
    Sung, June Sig
    Jho, Gunu
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    INTERSPEECH 2022, 2022, : 828 - 832