Sakura at SemEval-2023 Task 2: Data Augmentation via Translation

被引:0
|
作者
Poncelas, Alberto [1 ]
Tkachenko, Maksim [1 ]
Htun, Ohnmar [1 ]
机构
[1] Rakuten Grp Inc, Rakuten Inst Technol, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We demonstrate a simple yet effective approach to augmenting training data for multilingual named entity recognition using machine translation. The named entity spans from the original sentences are transferred to the translations via word alignment and then filtered with the baseline recognizer to retain high quality annotations. The proposed data augmentation approach improves the baseline performance of XLM-Roberta on the multilingual dataset.
引用
收藏
页码:1718 / 1722
页数:5
相关论文
共 50 条
  • [31] KDDIE at SemEval-2023 Task 2: External Knowledge Injection for Named Entity Recognition
    Martin, Caleb
    Yang, Huichen
    Hsu, William
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1498 - 1501
  • [32] PoSh at SemEval-2023 Task 10: Explainable Detection of Online Sexism
    Sriram, Shruti
    Chandran, Padma Pooja
    Shrijith, M. R.
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1276 - 1281
  • [33] Brooke-English at SemEval-2023 Task 5: Clickbait Spoiling
    Tang, Shirui
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 64 - 76
  • [34] CKingCoder at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis
    Kumar, Harish B.
    Naveen, D.
    Prem, B.
    Aarthi, S.
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2009 - 2013
  • [35] GPL at SemEval-2023 Task 1: WordNet and CLIP to Disambiguate Images
    Zhang, Shibingfeng
    Nath, Shantanu
    Mazzaccara, Davide
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1592 - 1597
  • [36] ROZAM at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis
    Rostamkhani, Mohammadmostafa
    Zamaninejad, Ghazal
    Eetemadi, Sauleh
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2029 - 2032
  • [37] IXA at SemEval-2023 Task 2: Baseline Xlm-Roberta-base Approach
    Andres Santamaria, Edgar
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 379 - 381
  • [38] NLPeople at SemEval-2023 Task 2: A Staged Approach for Multilingual Named Entity Recognition
    Elkaref, Mohab
    Herr, Nathan
    Tanaka, Shinnosuke
    De Mel, Geeth
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1148 - 1153
  • [39] Coco at SemEval-2023 Task 10: Explainable Detection of Online Sexism
    Guo, Kangshuai
    Ma, Ruipeng
    Luo, Shichao
    Wang, Yan
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 469 - 476
  • [40] TohokuNLP at SemEval-2023 Task 5: Clickbait Spoiling via Simple Seq2seq Generation and Ensembling
    Kurita, Hiroto
    Ito, Ikumi
    Funayama, Hiroaki
    Sasaki, Shota
    Moriya, Shoji
    Ye Mengyu
    Kokuta, Kazuma
    Hatakeyama, Ryujin
    Sone, Shusaku
    Inui, Kentaro
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1756 - 1762