OPI at SemEval-2023 Task 9: A Simple But Effective Approach to Multilingual Tweet Intimacy Analysis

被引:0
|
作者
Dadas, Slawomir [1 ]
机构
[1] Natl Informat Proc Inst, Warsaw, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our submission to the SemEval-2023 multilingual tweet intimacy analysis shared task. The goal of the task was to assess the level of intimacy of Twitter posts in ten languages. The proposed approach consists of several steps. First, we perform in-domain pre-training to create a language model adapted to Twitter data. In the next step, we train an ensemble of regression models to expand the training set with pseudo-labeled examples. The extended dataset is used to train the final solution. Our method was ranked first in five out of ten language subtasks, obtaining the highest average score across all languages.
引用
收藏
页码:150 / 154
页数:5
相关论文
共 50 条
  • [1] SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis
    Pei, Jiaxin
    Silva, Vitor
    Bos, Maarten
    Liu, Yozen
    Neves, Leonardo
    Jurgens, David
    Barbieri, Francesco
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2235 - 2246
  • [2] CKingCoder at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis
    Kumar, Harish B.
    Naveen, D.
    Prem, B.
    Aarthi, S.
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2009 - 2013
  • [3] ROZAM at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis
    Rostamkhani, Mohammadmostafa
    Zamaninejad, Ghazal
    Eetemadi, Sauleh
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2029 - 2032
  • [4] Arizonans at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis with XLM-T
    Bozdag, Nimet Beyza
    Bilgis, Tugay
    Bethard, Steven
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1656 - 1659
  • [5] WKU_NLP at SemEval-2023 Task 9: Translation Augmented Multilingual Tweet Intimacy Analysis
    Zheng, Qinyuan
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1525 - 1530
  • [6] Zhegu at SemEval-2023 Task 9: Exponential Penalty Mean Squared Loss for Multilingual Tweet Intimacy Analysis
    He, Pan
    Zhang, Yanru
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 318 - 323
  • [7] YNU-HPCC at SemEval-2023 Task 9: Pretrained Language Model for Multilingual Tweet Intimacy Analysis
    Cai, Qisheng
    Wang, Jin
    Zhang, Xuejie
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 733 - 738
  • [8] UMUTeam and SINAI at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis using Multilingual Large Language Models and Data Augmentation
    Garcia-Diaz, Jose Antonio
    Pan, Ronghao
    Jimenez Zafra, Salud Maria
    Martin-Valdivia, Maria-Teresa
    Urena-Lopez, L. Alfonso
    Valencia-Garcia, Rafael
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 293 - 299
  • [9] HULAT at SemEval-2023 Task 9: Data Augmentation for Pre-trained Transformers Applied to Multilingual Tweet Intimacy Analysis
    Segura-Bedmar, Isabel
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 177 - 183
  • [10] NLP-LISAC at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis via a Transformer-based approach and Data Augmentation
    Benlahbib, Abdessamad
    Alami, Hamza
    Boumhidi, Achraf
    Benslimane, Omar
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 121 - 124