Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text

被引:0
|
作者
Xu, Xiaoman [1 ]
Li, Xiangrun [1 ]
Wang, Taihang [1 ]
Tian, Jianxiang [1 ]
Jiang, Ye [1 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the participation of team QUST in Task 8 SemEval 2024. We first performed data augmentation and cleaning on the dataset to enhance model training efficiency and accuracy. In the monolingual task, we evaluated traditional deep-learning methods, multiscale positive-unlabeled framework (MPU), fine-tuning, adapters and ensemble methods. Then, we selected the top-performing models based on their accuracy from the monolingual models and evaluated them in subtasks A and B. The final model construction employed a stacking ensemble that combined fine-tuning with MPU. Our system achieved 8th (scored 8th in terms of accuracy, officially ranked 13th) place in the official test set in multilingual settings of subtask A. We release our system code at:https://github.com/warmth27/SemEval2024_QUST
引用
收藏
页码:463 / 470
页数:8
相关论文
共 50 条
  • [31] Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text
    Ebrahimi, Seyedeh Fatemeh
    Azari, Karim Akhavan
    Iravani, Amirmasoud
    Qazvini, Arian
    Sadeghi, Pouya
    Taghavi, Zeinab Sadat
    Sameti, Hossein
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 565 - 572
  • [32] Team MGTD4ADL at SemEval-2024 Task 8: Leveraging (Sentence) Transformer Models with Contrastive Learning for Identifying Machine-Generated Text
    Chen, Huixin
    Buessing, Jan
    Ruegamer, David
    Nie, Ercong
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1711 - 1718
  • [33] Team jelarson at SemEval 2024 Task 8: Predicting Boundary Line Between Human and Machine Generated Text
    Larson, Joseph
    Tyers, Francis
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 477 - 484
  • [34] NCL-UoR at SemEval-2024 Task 8: Fine-tuning Large Language Models for Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection
    Xiong, Feng
    Markchom, Thanet
    Zheng, Ziwei
    Jung, Subin
    Ojha, Varun
    Liang, Huizhi
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 163 - 169
  • [35] MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection
    Puspo, Sadiya Sayara Chowdhury
    Raihan, Md Nishat
    Goswami, Dhiman
    Bin Emran, Al Nahian
    Ganguly, Amrita
    Uzuner, Ozlem
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1364 - 1372
  • [36] AISPACE at SemEval-2024 task 8: A Class-balanced Soft-voting System for Detecting Multi-generator Machine-generated Text
    Gu, Renhua
    Meng, Xiangfeng
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1476 - 1481
  • [37] HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
    Dipta, Shubhashis Roy
    Shahriar, Sadat
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 485 - 491
  • [38] TueSents at SemEval-2024 Task 8: Predicting the Shift from Human Authorship to Machine-generated Output in a Mixed Text
    Pickard, Valentin
    Do, Hoa
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 829 - 832
  • [39] SemEval-2024 Task 8: Weighted Layer Averaging RoBERTa for Black-Box Machine-Generated Text Detection
    Datta, Ayan
    Chandramania, Aryan
    Mamidi, Radhika
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1623 - 1626
  • [40] SuteAlbastre at SemEval-2024 Task 4: Predicting Propaganda Techniques in Multilingual Memes using Joint Text and Vision Transformers
    Anghelina, Ion-Marian
    Buta, Gabriel-Sebastian
    Enache, Alexandru
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 443 - 449