Sakura at SemEval-2023 Task 2: Data Augmentation via Translation

被引:0
|
作者
Poncelas, Alberto [1 ]
Tkachenko, Maksim [1 ]
Htun, Ohnmar [1 ]
机构
[1] Rakuten Grp Inc, Rakuten Inst Technol, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We demonstrate a simple yet effective approach to augmenting training data for multilingual named entity recognition using machine translation. The named entity spans from the original sentences are transferred to the translations via word alignment and then filtered with the baseline recognizer to retain high quality annotations. The proposed data augmentation approach improves the baseline performance of XLM-Roberta on the multilingual dataset.
引用
收藏
页码:1718 / 1722
页数:5
相关论文
共 50 条
  • [21] Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models
    Monazzah, Erfan Moosavi
    Eetemadi, Sauleh
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2033 - 2038
  • [22] ZBL2W at SemEval-2023 Task 9: A Multilingual Fine-tuning Model with Data Augmentation for Tweet Intimacy Analysis
    Zhang, Hao
    Wu, Youlin
    Lu, Junyu
    Bai, Zewen
    Wu, Jiangming
    Lin, Hongfei
    Zhang, Shaowu
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 770 - 775
  • [23] SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
    Muhammad, Shamsuddeen Hassan
    Abdulmumin, Idris
    Yimam, Seid Muhie
    Adelani, David Ifeoluwa
    Ahmad, Ibrahim Said
    Ousidhoum, Nedjma
    Ayele, Abinew Ali
    Mohammad, Saif M.
    Beloucif, Meriem
    Ruder, Sebastian
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2319 - 2337
  • [24] HULAT at SemEval-2023 Task 10: Data Augmentation for Pre-trained Transformers Applied to the Detection of Sexism in Social Media
    Segura-Bedmar, Isabel
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 184 - 192
  • [25] RIGA at SemEval-2023 Task 2: NER enhanced with GPT-3
    Mukans, Eduards
    Barzdins, Guntis
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 331 - 339
  • [26] LTRC at SemEval-2023 Task 6: Experiments with Ensemble Embeddings
    Baswani, Pavan
    Adibhatla, Hiranmai Sri
    Shrivastava, Manish
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 841 - 846
  • [27] WKU_NLP at SemEval-2023 Task 9: Translation Augmented Multilingual Tweet Intimacy Analysis
    Zheng, Qinyuan
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1525 - 1530
  • [28] Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
    Nigam, Shubham Kumar
    Deroy, Aniket
    Shallum, Noel
    Mishra, Ayush Kumar
    Roy, Anup
    Mishra, Shubham Kumar
    Bhattacharya, Arnab
    Ghosh, Saptarshi
    Ghosh, Kripabandhu
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1293 - 1303
  • [29] SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis
    Pei, Jiaxin
    Silva, Vitor
    Bos, Maarten
    Liu, Yozen
    Neves, Leonardo
    Jurgens, David
    Barbieri, Francesco
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2235 - 2246
  • [30] UMUTeam and SINAI at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis using Multilingual Large Language Models and Data Augmentation
    Garcia-Diaz, Jose Antonio
    Pan, Ronghao
    Jimenez Zafra, Salud Maria
    Martin-Valdivia, Maria-Teresa
    Urena-Lopez, L. Alfonso
    Valencia-Garcia, Rafael
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 293 - 299