Issues in building English-Chinese parallel corpora with WordNets

被引:0
|
作者
Bond, Francis [1 ]
Wang, Shan [1 ]
机构
[1] Nanyang Technol Univ, Nanyang, Singapore
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We discuss some of the issues in producing sense-tagged parallel corpora: including pre-processing, adding new entries and linking. We have preliminary results for three genres: stories, essays and tourism web pages, in both Chinese and English.
引用
收藏
页码:391 / 399
页数:9
相关论文
共 50 条
  • [1] Research of English-Chinese alignment at word granularity on parallel corpora
    Xu Yang
    Wang Hou-feng
    Lue Xue-qiang
    [J]. 7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 223 - +
  • [2] Building wordnets with multi-word expressions from parallel corpora
    Simoes, Alberto
    Gomez Guinovart, Xavier
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2020, (64): : 45 - 52
  • [3] Building a Case-based Semantic English-Chinese Parallel Treebank
    Shi, Huaxing
    Zhao, Tiejun
    Su, Keh-Yih
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2918 - 2924
  • [4] Phraseology in Contrast: Evidence from English-Chinese Corpora
    Li, Tao
    [J]. LANGUAGES IN CONTRAST, 2015, 15 (02) : 302 - 306
  • [5] Automatic creation of WordNets from parallel corpora
    Oliver, Antoni
    Climent, Salvador
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1112 - 1116
  • [6] Building an English-Chinese Parallel Corpus Annotated with Sub-sentential Translation Techniques
    Zhai, Yuming
    Liu, Lufei
    Zhong, Xinyi
    Illouz, Gabriel
    Vilnat, Anne
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4024 - 4033
  • [7] Mining an English-Chinese parallel Dataset of Financial News
    Turenne, Nicolas
    Chen, Ziwei
    Fan, Guitao
    Li, Jianlong
    Li, Yiwen
    Wang, Siyuan
    Zhou, Jiaqi
    [J]. JOURNAL OF OPEN HUMANITIES DATA, 2022, 8
  • [8] Automatic construction of English/Chinese parallel corpora
    Yang, CC
    Li, KW
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (08): : 730 - 742
  • [9] Developing parallel sense-tagged corpora with wordnets
    Bond, Francis
    Wang, Shan
    Gao, Eshley Huini
    Mok, Hazel Shuwen
    Tan, Jeanette Yiwen
    [J]. LAW 2013 and ID 2013 - 7th Linguistic Annotation Workshop and Interoperability with Discourse, Proceedings of the Workshop, (149-158):
  • [10] Building a Parallel Corpora: Translation Issues and Remedial Case
    Archana, G. P.
    Jithesh, V. S.
    Remya, L. B.
    Sherly, Elizabeth
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 2414 - 2417