A Corpus of Native, Non-native and Translated Texts

被引:0
|
作者
Nisioi, Sergiu [1 ]
Rabinovich, Ella [2 ]
Dinu, Liviu P. [1 ]
Wintner, Shuly [2 ]
机构
[1] Univ Bucharest, Ctr Computat Linguist, Bucharest, Romania
[2] Univ Haifa, Dept Comp Sci, Haifa, Israel
关键词
Corpus linguistics; Translation; Bilingualism; Second language acquisition;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
We describe a monolingual English corpus of original and (human) translated texts, with an accurate annotation of speaker properties, including the original language of the utterances and the speaker's country of origin. We thus obtain three sub-corpora of texts reflecting native English, non-native English, and English translated from a variety of European languages. This dataset will facilitate the investigation of similarities and differences between these kinds of sub-languages. Moreover, it will facilitate a unified comparative study of translations and language produced by (highly fluent) non-native speakers, two closely-related phenomena that have only been studied in isolation so far.
引用
收藏
页码:4197 / 4201
页数:5
相关论文
共 50 条
  • [21] Predictability and perception for native and non-native listeners
    Baese-Berk, Melissa
    Morrill, Tuuli H.
    Dilley, Laura
    [J]. LINGUISTICS VANGUARD, 2018, 4
  • [22] Native and non-native succulent plants in Algeria
    Sakhraoui, Nora
    Thomson, George
    [J]. BRADLEYA, 2024, 42
  • [23] A non-native perennial invades a native forest
    Almasi K.N.
    [J]. Biological Invasions, 2000, 2 (3) : 219 - 230
  • [24] NON-NATIVE TONGUES
    TRUSSEL, S
    [J]. VERBATIM, 1984, 10 (04): : 7 - 8
  • [25] The "non-native" enigma
    Caudill, Danny
    Caudill, Gretchen
    [J]. HUMAN-WILDLIFE INTERACTIONS, 2016, 10 (01): : 132 - 136
  • [26] Perceptual Learning for Native and Non-native Speech
    Baese-Berk, Melissa
    [J]. CURRENT TOPICS IN LANGUAGE, 2018, 68 : 1 - 29
  • [27] Native and non-native segmentation of continuous speech
    Hanulikova, Adriana
    Mitterer, Holger
    McQueen, M. James
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 675 - 675
  • [28] INTERACTIONAL METADISCOURSE MARKERS IN SCIENTIFIC TEXTS (BASED ON RESEARCH ARTICLES WRITTEN BY NATIVE AND NON-NATIVE SPEAKERS)
    Ahmadi, Leila
    [J]. VESTNIK VOLGOGRADSKOGO GOSUDARSTVENNOGO UNIVERSITETA-SERIYA 2-YAZYKOZNANIE, 2022, 21 (04): : 99 - 110
  • [29] L2-ARCTIC: A Non-Native English Speech Corpus
    Zhao, Guanlong
    Sonsaat, Sinem
    Silpachai, Alif
    Lucic, Ivana
    Chukharev-Hudilainen, Evgeny
    Levis, John
    Gutierrez-Osuna, Ricardo
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2783 - 2787
  • [30] The use of actually in a non-native English parliamentary context: a corpus study
    Sarfo-Kantankah, Kwabena Sarfo
    Yussif, Ben Kudus
    [J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 2019, 65 (04): : 234 - 251