A MULTIMEDIA CORPUS OF CHILD MANDARIN: THE TONG CORPUS

被引:30
|
作者
Deng Xiangjun [1 ,2 ]
Yip, Virginia [3 ]
机构
[1] Chinese Univ Hong Kong, Shatin, Hong Kong, Peoples R China
[2] Shenzhen Univ, Res Ctr Language & Cognit, Sch Foreign Languages, 3688 Nanhai Ave, Shenzhen 518060, Guangdong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Linguist & Modern Languages, G-F,Leung Kau Kui Bldg, Shatin, Hong Kong, Peoples R China
关键词
Child language corpus; Mandarin Chinese; Language input; Media linking; Morphological tier; LANGUAGE-ACQUISITION; ENGLISH; SPEECH; NOUNS; VERBS;
D O I
10.1353/jcl.2017.0025
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This article features a new multimedia corpus(i) with 22 hours of recordings of a Mandarin-speaking child from the age of 1;7 to 3;4. We review the state of the art in the use of corpora for first language acquisition of Mandarin, and highlight the importance of corpus studies in evaluating children's language developmental patterns vis-a-vis adult input. The transcripts in our new corpus are annotated with a morphological tier indicating parts of speech, and linked to audio or video files. This corpus goes beyond existing published corpora of child Mandarin in having more data for a single child, as well as media linking. It contributes to a number of fields including language acquisition, Chinese linguistics, corpus linguistics, developmental psycholinguistics, education, and speech and language therapy.
引用
收藏
页码:69 / 92
页数:24
相关论文
共 50 条
  • [2] MANDARIN MULTIMEDIA CHILD SPEECH CORPUS: CASS_ CHILD
    Gao, Jun
    Li, Aijun
    Xiong, Ziyu
    [J]. 2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2012, : 7 - 12
  • [3] The ManDi Corpus: A Spoken Corpus of Mandarin Regional Dialects
    Zhao, Liang
    Chodroff, Eleanor
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1985 - 1990
  • [4] The linguistic encoding of space in child Mandarin: A corpus-based study
    Deng Xiangjun
    Yip, Virginia
    [J]. LINGUISTICS, 2015, 53 (05) : 1079 - 1112
  • [5] The DIG Mandarin Conversations (DMC) Corpus
    Yu, Guodong
    Wu, Yaxin
    Drew, Paul
    Raymond, Chase Wesley
    [J]. CHINESE LANGUAGE AND DISCOURSE, 2024, 15 (01) : 105 - 141
  • [6] A Corpus of Adpositional Supersenses for Mandarin Chinese
    Peng, Siyao
    Liu, Yang
    Zhu, Yilun
    Blodgett, Austin
    Zhao, Yushi
    Schneider, Nathan
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5986 - 5994
  • [7] A Multimedia Corpus of Driving Behaviors
    Malta, Lucas
    Ozaki, Akira
    Miyajima, Chiyomi
    Kitaoka, Norihide
    Takeda, Kazuya
    [J]. 2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 37 - 42
  • [8] A multimedia corpus of the Yiddish language
    T. A. Arkhangel’skii
    O. A. Sozinova
    [J]. Automatic Documentation and Mathematical Linguistics, 2015, 49 (2) : 47 - 53
  • [9] A Multimedia Corpus of the Yiddish Language
    Arkhangel'skii, T. A.
    Sozinova, O. A.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2015, 49 (02) : 47 - 53
  • [10] The Classroom Application of Multimedia Corpus
    LIU Rui(Zhongzhou University
    [J]. 海外英语, 2011, (07) : 388 - 391