HULAT at SemEval-2023 Task 9: Data Augmentation for Pre-trained Transformers Applied to Multilingual Tweet Intimacy Analysis

被引：0

作者：

Segura-Bedmar, Isabel ^{[1
]}

机构：

[1] Univ Carlos III Madrid, Comp Sci Dept, Leganes, Spain

来源：

17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes our participation in SemEval-2023 Task 9, Intimacy Analysis of Multilingual Tweets. We fine-tune some of the most popular transformer models with the training dataset and synthetic data generated by different data augmentation techniques. During the development phase, our best results were obtained by using XLM-T. Data augmentation techniques provide a very slight improvement in the results. Our system ranked in the 27th position out of the 45 participating systems. Despite its modest results, our system shows promising results in languages such as Portuguese, English, and Dutch. All our code is available in the repository https://github.com/isegura/hulat_intimacy.

引用

页码：177 / 183

页数：7

共 50 条

[41] CAISA at SemEval-2023 Task 8: Counterfactual Data Augmentation for Mitigating Class Imbalance in Causal Claim Identification
Karimi, Akbar
Flek, Lucie
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2118 - 2123
[42] mCPT at SemEval-2023 Task 3: Multilingual Label-Aware Contrastive Pre-Training of Transformers for Few- and Zero-shot Framing Detection
Reiter-Haas, Markus
Ertl, Alexander
Innerhofer, Kevin
Lex, Elisabeth
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 941 - 949
[43] SarcasmDet at SemEval-2022 Task 6: Detecting Sarcasm using Pre-trained Transformers in English and Arabic Languages
Abdullah, Malak
Faraj, Dalya
Swedat, Safa
Khrais, Jumana
Al-Ayyoub, Mahmoud
PROCEEDINGS OF THE 16TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2022, 2022, : 1025 - 1030
[44] CodeNLP at SemEval-2023 Task 2: Data Augmentation for Named Entity Recognition by Combination of Sequence Generation Strategies
Marcinczuk, Michal
Walentynowicz, Wiktor
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1798 - 1804
[45] Appeal for attention at SemEval-2023 Task 3: Data augmentation and extension strategies for detection of online news persuasion techniques
Sergiu, Amihaesei
Laura, Cornei
George, Stoica
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 616 - 623
[46] Billie-Newman at SemEval-2023 Task 5: Clickbait Classification and Question Answering with Pre-Trained Language Models, Named Entity Recognition and Rule-Based Approaches
Kruff, Andreas
Tran, Anh Huy Matthias
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1542 - 1550
[47] MasonNLP plus at SemEval-2023 Task 8: Extracting Medical Questions, Experiences and Claims from Social Media using Knowledge-Augmented Pre-trained Language Models
Ramachandran, Giridhar Kaushik
Gangavarapu, Haritha
Lybarger, Kevin
Uzuner, Ozlem
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2143 - 2152
[48] Generative Pre-trained Transformers for Coding Text Data? An Analysis with Classroom Orchestration Data
Amarasinghe, Ishari
Marques, Francielle
Ortiz-Beltran, Ariel
Hernandez-Leo, Davinia
RESPONSIVE AND SUSTAINABLE EDUCATIONAL FUTURES, EC-TEL 2023, 2023, 14200 : 32 - 43
[49] NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis
Wang, Mingyang
Adel, Heike
Lange, Lukas
Stroetgen, Jannik
Schuetze, Hinrich
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 488 - 497
[50] Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models
Monazzah, Erfan Moosavi
Eetemadi, Sauleh
17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2033 - 2038

← 1 2 3 4 5 →