HULAT at SemEval-2023 Task 9: Data Augmentation for Pre-trained Transformers Applied to Multilingual Tweet Intimacy Analysis

被引:0
|
作者
Segura-Bedmar, Isabel [1 ]
机构
[1] Univ Carlos III Madrid, Comp Sci Dept, Leganes, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our participation in SemEval-2023 Task 9, Intimacy Analysis of Multilingual Tweets. We fine-tune some of the most popular transformer models with the training dataset and synthetic data generated by different data augmentation techniques. During the development phase, our best results were obtained by using XLM-T. Data augmentation techniques provide a very slight improvement in the results. Our system ranked in the 27th position out of the 45 participating systems. Despite its modest results, our system shows promising results in languages such as Portuguese, English, and Dutch. All our code is available in the repository https://github.com/isegura/hulat_intimacy.
引用
收藏
页码:177 / 183
页数:7
相关论文
共 50 条
  • [41] CAISA at SemEval-2023 Task 8: Counterfactual Data Augmentation for Mitigating Class Imbalance in Causal Claim Identification
    Karimi, Akbar
    Flek, Lucie
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2118 - 2123
  • [42] mCPT at SemEval-2023 Task 3: Multilingual Label-Aware Contrastive Pre-Training of Transformers for Few- and Zero-shot Framing Detection
    Reiter-Haas, Markus
    Ertl, Alexander
    Innerhofer, Kevin
    Lex, Elisabeth
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 941 - 949
  • [43] SarcasmDet at SemEval-2022 Task 6: Detecting Sarcasm using Pre-trained Transformers in English and Arabic Languages
    Abdullah, Malak
    Faraj, Dalya
    Swedat, Safa
    Khrais, Jumana
    Al-Ayyoub, Mahmoud
    PROCEEDINGS OF THE 16TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2022, 2022, : 1025 - 1030
  • [44] CodeNLP at SemEval-2023 Task 2: Data Augmentation for Named Entity Recognition by Combination of Sequence Generation Strategies
    Marcinczuk, Michal
    Walentynowicz, Wiktor
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1798 - 1804
  • [45] Appeal for attention at SemEval-2023 Task 3: Data augmentation and extension strategies for detection of online news persuasion techniques
    Sergiu, Amihaesei
    Laura, Cornei
    George, Stoica
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 616 - 623
  • [46] Billie-Newman at SemEval-2023 Task 5: Clickbait Classification and Question Answering with Pre-Trained Language Models, Named Entity Recognition and Rule-Based Approaches
    Kruff, Andreas
    Tran, Anh Huy Matthias
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1542 - 1550
  • [47] MasonNLP plus at SemEval-2023 Task 8: Extracting Medical Questions, Experiences and Claims from Social Media using Knowledge-Augmented Pre-trained Language Models
    Ramachandran, Giridhar Kaushik
    Gangavarapu, Haritha
    Lybarger, Kevin
    Uzuner, Ozlem
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2143 - 2152
  • [48] Generative Pre-trained Transformers for Coding Text Data? An Analysis with Classroom Orchestration Data
    Amarasinghe, Ishari
    Marques, Francielle
    Ortiz-Beltran, Ariel
    Hernandez-Leo, Davinia
    RESPONSIVE AND SUSTAINABLE EDUCATIONAL FUTURES, EC-TEL 2023, 2023, 14200 : 32 - 43
  • [49] NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis
    Wang, Mingyang
    Adel, Heike
    Lange, Lukas
    Stroetgen, Jannik
    Schuetze, Hinrich
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 488 - 497
  • [50] Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models
    Monazzah, Erfan Moosavi
    Eetemadi, Sauleh
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2033 - 2038