A CORPUS FOR THE STUDY ON THE ASSESSMENT OF MANDARIN PRONUNCIATION OF TIBETAN SPEAKERS

被引:0
|
作者
Gan, Z. [1 ,3 ]
Jiang, J. [1 ]
Yan, Y. [1 ]
Yang, H. [1 ,2 ,4 ]
机构
[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Gansu, Peoples R China
[2] Northwest Normal Univ, Sch Educ Technol, Lanzhou, Gansu, Peoples R China
[3] Engn Res Ctr Gansu Prov Intelligent Informat Tech, Lanzhou, Gansu, Peoples R China
[4] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou, Gansu, Peoples R China
基金
中国国家自然科学基金;
关键词
Tibetan speaker Mandarin; pronunciation assessment; audio recording dataset; SAMPA-TSC;
D O I
暂无
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Tibetan speakers always have some types of fixed pronunciation errors when they speak Mandarin, which are affected by their native language pronunciation habits. Therefore, a system assessment that can detect the mispronunciation and overall similarity measurement of syllables or phonemes in Tibetan Mandarin to help learners improve their Mandarin level needs to be studied. A unique corpus is required in order to study on the assessment of Mandarin pronunciation of Tibetan speakers. Unfortunately, there is no such a corpus in this field for the research task. We create a particular corpus by integrating the linguistic theory of Tibetan and Chinese with speech signal processing and machine learning. In this work, we record the non-standard Mandarin audio of Tibetan students and the standard Mandarin audio. These audio recordings share the same text designed by analyzing and comparing the pronunciation characteristics of Tibetan and Chinese. Audio recordings total 5.5 hours that contain 1000 paragraphs, covering 377 syllables without tones and all phonemes in standard Chinese. Then we introduce the recording environment and recording equipments. Furthermore, we set the rules for the annotation of the audio recordings in hierarchical format through PRAAT software: the first layer is the phrase layer, marked with Chinese characters; the second layer is the syllable layer, marked with pinyin; the third layer is the phoneme layer, labeled with Speech Assessment Methods Phonetic Alphabet-Tibetan Standard Chinese (SAMPA-TSC), which is designed by ourselves. Finally, we evaluate the corpus creation in four aspects--coverage, completeness, quality, reusability--and describe the potential of the dataset application.
引用
收藏
页码:7840 / 7848
页数:9
相关论文
共 50 条
  • [1] Mandarin pronunciation modeling based on CASS corpus
    Zheng, F
    Song, ZJ
    Fung, P
    Byrne, W
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (03): : 249 - 263
  • [2] Mandarin pronunciation modeling based on CASS corpus
    Fang Zheng
    Zhanjiang Song
    Pascale Fung
    Byrne William
    [J]. Journal of Computer Science and Technology, 2002, 17 : 249 - 263
  • [3] Perception of Mandarin Tones by Native Tibetan Speakers
    Bao, Wenfu
    Feng, Hui
    Dang, Jianwu
    Liu, Zhilei
    Yu, Yang
    Wang, Siyu
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 811 - 814
  • [4] Investigation of Learning Trajectory of Mandarin for Tibetan Speakers
    Wang, Huixia
    Dang, Jianwu
    Feng, Hui
    Wang, Hongcui
    Yu, Yang
    Honda, Kiyoshi
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 208 - 212
  • [5] Acoustic Features of Mandarin Monophthongs by Tibetan Speakers
    Zhao, Lu
    Feng, Hui
    Wang, Huixia
    Dang, Jianwu
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 147 - 150
  • [6] Pronunciation recognition and assessment for mandarin Chinese
    Zhong, Cencen
    Miao, Zhenjiang
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 352 - 356
  • [7] Automatic pronunciation assessment for mandarin Chinese
    Chen, JC
    Jang, JSR
    Li, JY
    Wu, MC
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1979 - 1982
  • [8] Corpus-based learning of Cantonese for Mandarin speakers
    Wong, Tak-Sum
    Lee, John S. Y.
    [J]. RECALL, 2016, 28 (02) : 187 - 206
  • [9] MANDARIN SPEECH RECOGNITION FOR NONNATIVE SPEAKERS BASED ON PRONUNCIATION DICTIONARY ADAPTATION
    Yang, Jian
    Wu, Peishan
    Xu, Dan
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 217 - 220
  • [10] Tibetan Vowel Analysis with a Multi-Modal Mandarin-Tibetan Speech Corpus
    Lobsang, Gyaltsen
    Lu, Wenhuan
    Honda, Kiyoshi
    Wei, Jianguo
    Guan, Wendan
    Fang, Qiang
    Dang, Jianwu
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,