Sakura at SemEval-2023 Task 2: Data Augmentation via Translation

被引:0
|
作者
Poncelas, Alberto [1 ]
Tkachenko, Maksim [1 ]
Htun, Ohnmar [1 ]
机构
[1] Rakuten Grp Inc, Rakuten Inst Technol, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We demonstrate a simple yet effective approach to augmenting training data for multilingual named entity recognition using machine translation. The named entity spans from the original sentences are transferred to the translations via word alignment and then filtered with the baseline recognizer to retain high quality annotations. The proposed data augmentation approach improves the baseline performance of XLM-Roberta on the multilingual dataset.
引用
收藏
页码:1718 / 1722
页数:5
相关论文
共 50 条
  • [41] KINLP at SemEval-2023 Task 12: Kinyarwanda Tweet Sentiment Analysis
    Nzeyimana, Antoine
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 718 - 723
  • [42] FMI-SU at SemEval-2023 Task 7: Two-level Entailment Classification of Clinical Trials Enhanced by Contextual Data Augmentation
    Vassileva, Sylvia
    Grazhdanski, Georgi
    Boytcheva, Svetla
    Koytchev, Ivan
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1454 - 1462
  • [43] Gallagher at SemEval-2023 Task 5: Tackling Clickbait with Seq2Seq Models
    Bilgis, Tugay
    Bozdag, Nimet Beyza
    Bethard, Steven
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1650 - 1655
  • [44] SAB at SemEval-2023 Task 2: Does Linguistic Information Aid in Named Entity Recognition?
    Biales, Siena
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1131 - 1137
  • [45] SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data
    Jullien, Mael
    Valentino, Marco
    Frost, Hannah
    O'Regan, Paul
    Landers, Donal
    Freitas, Andre
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2216 - 2226
  • [46] iREL at SemEval-2023 Task 9: Improving understanding of multilingual Tweets using Translation-Based Augmentation and Domain Adapted Pre-Trained Models
    Singh, Bhavyajeet
    Maity, Ankita
    Kandru, Pavan
    Hari, Aditya
    Varma, Vasudeva
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2052 - 2057
  • [47] DUTH at SemEval-2023 Task 9: An Ensemble Approach for Twitter Intimacy Analysis
    Arampatzis, Georgios
    Perifanis, Vasileios
    Symeonidis, Symeon
    Arampatzis, Avi
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1225 - 1230
  • [48] Sefamerve at SemEval-2023 Task 12: Semantic Evaluation of Rarely Studied Languages
    Delil, Selman
    Kuyumcu, Birol
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 512 - 516
  • [49] SemEval-2023 Task 4: ValueEval: Identification of Human Values Behind Arguments
    Kiesel, Johannes
    Alshomary, Milad
    Mirzakhmedova, Nailia
    Heinrich, Maximilian
    Handke, Nicolas
    Wachsmuth, Henning
    Stein, Benno
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2287 - 2303
  • [50] RGAT at SemEval-2023 Task 2: Named Entity Recognition Using Graph Attention Network
    Chakraborty, Abir
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 163 - 170