FI Group at SemEval-2024 Task 8: A Syntactically Motivated Architecture for Multilingual Machine-Generated Text Detection

被引：0

作者：

Ben-Fares, Maha ^{[1
,2
]}

Zaratiana, Urchade ^{[2
,3
]}

Hernandez, Simon D. ^{[2
]}

Holat, Pierre ^{[2
,3
]}

机构：

[1] CY Cergy Paris Univ Pontoise, ETIS, Cergy, France

[2] FI Grp, Puteaux La Defense, France

[3] Univ Sorbonne Paris Nord, LIPN, Villetaneuse, France

来源：

PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present the description of our proposed system for Subtask A - multilingual track at SemEval-2024 Task 8, which aims to classify if text has been generated by an AI or Human. Our approach treats binary text classification as token-level prediction, with the final classification being the average of token-level predictions. Through the use of rich representations of pre-trained transformers, our model is trained to selectively aggregate information from across different layers to score individual tokens, given that each layer may contain distinct information. Notably, our model demonstrates competitive performance on the test dataset, achieving an accuracy score of 95.8%. Furthermore, it secures the 2nd position in the multilingual track of Subtask A, with a mere 0.1% behind the leading system.

引用

页码：1166 / 1171

页数：6

共 50 条

[1] SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Wang, Yuxia
Mansurov, Jonibek
Ivanov, Petar
Su, Jinyan
Shelmanov, Artem
Tsvigun, Akim
Afzal, Osama Mohammed
Mahmoud, Tarek
Puccetti, Giovanni
Arnold, Thomas
Whitehouse, Chenxi
Aji, Alham Fikri
Habash, Nizar
Gurevych, Iryna
Nakov, Preslav
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 2057 - 2079
[2] Team AT at SemEval-2024 Task 8: Machine-Generated Text Detection with Semantic Embeddings
Wei, Yuchen
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 492 - 496
[3] KInIT at SemEval-2024 Task 8: Fine-tuned LLMs for Multilingual Machine-Generated Text Detection
Spiegel, Michal
Macko, Dominik
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 558 - 564
[4] NewbieML at SemEval-2024 Task 8: Ensemble Approach for Multidomain Machine-Generated Text Detection
Tran, Bao
Nhi Tran
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 354 - 360
[5] Team Innovative at SemEval-2024 Task 8: Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection
Sharma, Surbhi
Mansuri, Irfan
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1172 - 1176
[6] UMUTeam at SemEval-2024 Task 8: Combining Transformers and Syntax Features for Machine-Generated Text Detection
Pan, Ronghao
Antonio Garcia-Diaz, Jose
Jose Vivancos-Vicente, Pedro
Valencia-Garcia, Rafael
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 697 - 702
[7] TueCICL at SemEval-2024 Task 8: Resource-efficient approaches for machine-generated text detection
Stuhlinger, Daniel
Winkler, Aron
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1597 - 1601
[8] BadRock at SemEval-2024 Task 8: DistilBERT to Detect Multigenerator, Multidomain and Multilingual Black-Box Machine-Generated Text
Siino, Marco
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 239 - 245
[9] SemEval-2024 Task 8: Weighted Layer Averaging RoBERTa for Black-Box Machine-Generated Text Detection
Datta, Ayan
Chandramania, Aryan
Mamidi, Radhika
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1623 - 1626
[10] MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection
Puspo, Sadiya Sayara Chowdhury
Raihan, Md Nishat
Goswami, Dhiman
Bin Emran, Al Nahian
Ganguly, Amrita
Uzuner, Ozlem
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1364 - 1372

← 1 2 3 4 5 →