A CORPUS FOR THE STUDY ON THE ASSESSMENT OF MANDARIN PRONUNCIATION OF TIBETAN SPEAKERS

被引：0

作者：

Gan, Z. ^{[1
,3
]}

Jiang, J. ^{[1
]}

Yan, Y. ^{[1
]}

Yang, H. ^{[1
,2
,4
]}

机构：

[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Gansu, Peoples R China

[2] Northwest Normal Univ, Sch Educ Technol, Lanzhou, Gansu, Peoples R China

[3] Engn Res Ctr Gansu Prov Intelligent Informat Tech, Lanzhou, Gansu, Peoples R China

[4] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou, Gansu, Peoples R China

来源：

14TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

Tibetan speaker Mandarin; pronunciation assessment; audio recording dataset; SAMPA-TSC;

D O I：

暂无

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Tibetan speakers always have some types of fixed pronunciation errors when they speak Mandarin, which are affected by their native language pronunciation habits. Therefore, a system assessment that can detect the mispronunciation and overall similarity measurement of syllables or phonemes in Tibetan Mandarin to help learners improve their Mandarin level needs to be studied. A unique corpus is required in order to study on the assessment of Mandarin pronunciation of Tibetan speakers. Unfortunately, there is no such a corpus in this field for the research task. We create a particular corpus by integrating the linguistic theory of Tibetan and Chinese with speech signal processing and machine learning. In this work, we record the non-standard Mandarin audio of Tibetan students and the standard Mandarin audio. These audio recordings share the same text designed by analyzing and comparing the pronunciation characteristics of Tibetan and Chinese. Audio recordings total 5.5 hours that contain 1000 paragraphs, covering 377 syllables without tones and all phonemes in standard Chinese. Then we introduce the recording environment and recording equipments. Furthermore, we set the rules for the annotation of the audio recordings in hierarchical format through PRAAT software: the first layer is the phrase layer, marked with Chinese characters; the second layer is the syllable layer, marked with pinyin; the third layer is the phoneme layer, labeled with Speech Assessment Methods Phonetic Alphabet-Tibetan Standard Chinese (SAMPA-TSC), which is designed by ourselves. Finally, we evaluate the corpus creation in four aspects--coverage, completeness, quality, reusability--and describe the potential of the dataset application.

引用

页码：7840 / 7848

页数：9

共 50 条

[1] Mandarin pronunciation modeling based on CASS corpus
Zheng, F
Song, ZJ
Fung, P
Byrne, W
[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (03): : 249 - 263
[2] Mandarin pronunciation modeling based on CASS corpus
Fang Zheng
Zhanjiang Song
Pascale Fung
Byrne William
[J]. Journal of Computer Science and Technology, 2002, 17 : 249 - 263
[3] Perception of Mandarin Tones by Native Tibetan Speakers
Bao, Wenfu
Feng, Hui
Dang, Jianwu
Liu, Zhilei
Yu, Yang
Wang, Siyu
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 811 - 814
[4] Investigation of Learning Trajectory of Mandarin for Tibetan Speakers
Wang, Huixia
Dang, Jianwu
Feng, Hui
Wang, Hongcui
Yu, Yang
Honda, Kiyoshi
[J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 208 - 212
[5] Acoustic Features of Mandarin Monophthongs by Tibetan Speakers
Zhao, Lu
Feng, Hui
Wang, Huixia
Dang, Jianwu
[J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 147 - 150
[6] Pronunciation recognition and assessment for mandarin Chinese
Zhong, Cencen
Miao, Zhenjiang
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 352 - 356
[7] Automatic pronunciation assessment for mandarin Chinese
Chen, JC
Jang, JSR
Li, JY
Wu, MC
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1979 - 1982
[8] Corpus-based learning of Cantonese for Mandarin speakers
Wong, Tak-Sum
Lee, John S. Y.
[J]. RECALL, 2016, 28 (02) : 187 - 206
[9] MANDARIN SPEECH RECOGNITION FOR NONNATIVE SPEAKERS BASED ON PRONUNCIATION DICTIONARY ADAPTATION
Yang, Jian
Wu, Peishan
Xu, Dan
[J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 217 - 220
[10] Tibetan Vowel Analysis with a Multi-Modal Mandarin-Tibetan Speech Corpus
Lobsang, Gyaltsen
Lu, Wenhuan
Honda, Kiyoshi
Wei, Jianguo
Guan, Wendan
Fang, Qiang
Dang, Jianwu
[J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,

← 1 2 3 4 5 →