FI Group at SemEval-2024 Task 8: A Syntactically Motivated Architecture for Multilingual Machine-Generated Text Detection

被引:0
|
作者
Ben-Fares, Maha [1 ,2 ]
Zaratiana, Urchade [2 ,3 ]
Hernandez, Simon D. [2 ]
Holat, Pierre [2 ,3 ]
机构
[1] CY Cergy Paris Univ Pontoise, ETIS, Cergy, France
[2] FI Grp, Puteaux La Defense, France
[3] Univ Sorbonne Paris Nord, LIPN, Villetaneuse, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the description of our proposed system for Subtask A - multilingual track at SemEval-2024 Task 8, which aims to classify if text has been generated by an AI or Human. Our approach treats binary text classification as token-level prediction, with the final classification being the average of token-level predictions. Through the use of rich representations of pre-trained transformers, our model is trained to selectively aggregate information from across different layers to score individual tokens, given that each layer may contain distinct information. Notably, our model demonstrates competitive performance on the test dataset, achieving an accuracy score of 95.8%. Furthermore, it secures the 2nd position in the multilingual track of Subtask A, with a mere 0.1% behind the leading system.
引用
收藏
页码:1166 / 1171
页数:6
相关论文
共 50 条
  • [21] DUTh at SemEval 2024 Task 8: Comparing classic Machine Learning Algorithms and LLM based methods for Multigenerator, Multidomain and Multilingual Machine-Generated Text Detection
    Kyriakou, Theodora
    Maslaris, Ioannis
    Arampatzis, Avi
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1080 - 1086
  • [22] CLULab-UofA at SemEval-2024 Task 8: Detecting Machine-Generated Text Using Triplet-Loss-Trained Text Similarity and Text Classification
    Rezaei, MohammadHossein
    Kwon, Yeaeun
    Sanayei, Reza
    Singh, Abhyuday
    Bethard, Steven
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1498 - 1504
  • [23] CUNLP at SemEval-2024 Task 8: Classify Human and AI Generated Text
    Pranjal, Aggarwal
    Deepanshu, Sachdeva
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1 - 6
  • [24] TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques
    Urlana, Ashok
    Saibewar, Aditya
    Garlapati, Bala Mallikarjunarao
    Kumar, Charaka Vinayak
    Singh, Ajeet Kumar
    Chalamala, Srinivasa Rao
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 927 - 934
  • [25] Team MGTD4ADL at SemEval-2024 Task 8: Leveraging (Sentence) Transformer Models with Contrastive Learning for Identifying Machine-Generated Text
    Chen, Huixin
    Buessing, Jan
    Ruegamer, David
    Nie, Ercong
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1711 - 1718
  • [26] Groningen team D at SemEval-2024 Task 8: Exploring data generation and a combined model for fine-tuning LLMs for Multidomain Machine-Generated Text Detection
    Brekhof, Thijs
    Liu, Xuanyi
    Ruitenbeek, Joris
    Top, Niels
    Zhou, Yuwen
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 391 - 398
  • [27] SCaLAR at SemEval-2024 Task 8: Unmasking the machine : Exploring the power of RoBERTa Ensemble for Detecting Machine Generated Text
    Kumar, Anand M.
    Abhin, B.
    Murali, Sidhaarth Sredharan
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1135 - 1139
  • [28] AISPACE at SemEval-2024 task 8: A Class-balanced Soft-voting System for Detecting Multi-generator Machine-generated Text
    Gu, Renhua
    Meng, Xiangfeng
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1476 - 1481
  • [29] Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text
    Xu, Xiaoman
    Li, Xiangrun
    Wang, Taihang
    Tian, Jianxiang
    Jiang, Ye
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 463 - 470
  • [30] SemEval-2024 Task 4: Multilingual Detection of Persuasion Techniques in Memes
    Dimitrov, Dimitar
    Alam, Firoj
    Hasanain, Maram
    Hasnat, Abul
    Silvestri, Fabrizio
    Nakov, Preslav
    Da San Martino, Giovanni
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 2009 - 2026