Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text

被引:0
|
作者
Xu, Xiaoman [1 ]
Li, Xiangrun [1 ]
Wang, Taihang [1 ]
Tian, Jianxiang [1 ]
Jiang, Ye [1 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the participation of team QUST in Task 8 SemEval 2024. We first performed data augmentation and cleaning on the dataset to enhance model training efficiency and accuracy. In the monolingual task, we evaluated traditional deep-learning methods, multiscale positive-unlabeled framework (MPU), fine-tuning, adapters and ensemble methods. Then, we selected the top-performing models based on their accuracy from the monolingual models and evaluated them in subtasks A and B. The final model construction employed a stacking ensemble that combined fine-tuning with MPU. Our system achieved 8th (scored 8th in terms of accuracy, officially ranked 13th) place in the official test set in multilingual settings of subtask A. We release our system code at:https://github.com/warmth27/SemEval2024_QUST
引用
收藏
页码:463 / 470
页数:8
相关论文
共 50 条
  • [1] CUNLP at SemEval-2024 Task 8: Classify Human and AI Generated Text
    Pranjal, Aggarwal
    Deepanshu, Sachdeva
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1 - 6
  • [2] Team QUST at SemEval-2023 Task 3: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting Online News Genre, Framing and Persuasion Techniques
    Jiang, Ye
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 300 - 306
  • [3] Team MLab at SemEval-2024 Task 8: Analyzing Encoder Embeddings for Detecting LLM-generated Text
    Li, Kevin
    Hasanaliyev, Kenan
    Zhu, Sally
    Altshuler, George
    Eberts, Alden
    Chen, Eric
    Wang, Kate
    Xia, Emily
    Browne, Eli
    Chen, Ian
    Eren, Umut
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1463 - 1467
  • [4] SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
    Wang, Yuxia
    Mansurov, Jonibek
    Ivanov, Petar
    Su, Jinyan
    Shelmanov, Artem
    Tsvigun, Akim
    Afzal, Osama Mohammed
    Mahmoud, Tarek
    Puccetti, Giovanni
    Arnold, Thomas
    Whitehouse, Chenxi
    Aji, Alham Fikri
    Habash, Nizar
    Gurevych, Iryna
    Nakov, Preslav
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 2057 - 2079
  • [5] Team AT at SemEval-2024 Task 8: Machine-Generated Text Detection with Semantic Embeddings
    Wei, Yuchen
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 492 - 496
  • [6] Mast Kalandar at SemEval-2024 Task 8: On the Trail of Textual Origins: RoBERTa-BiLSTM Approach to Detect AI-Generated Text
    Bafna, Jainit Sushil
    Mittal, Hardik
    Sethia, Suyash
    Shrivastava, Manish
    Mamidi, Radhika
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1627 - 1633
  • [7] Team Innovative at SemEval-2024 Task 8: Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection
    Sharma, Surbhi
    Mansuri, Irfan
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1172 - 1176
  • [8] Mast Kalandar at SemEval-2024 Task 8: On the Trail of Textual Origins: RoBERTa-BiLSTM Approach to Detect AI-Generated Text
    Bafna, Jainit Sushil
    Mittal, Hardik
    Sethia, Suyash
    Shrivastava, Manish
    Mamidi, Radhika
    arXiv,
  • [9] I2C-Huelva at SemEval-2024 Task 8: Boosting AI-Generated Text Detection with Multimodal Models and Optimized Ensembles
    Pena, Alberto Rodero
    Vazquez, Jacinto Mata
    Alvarez, Victoria Pachon
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 845 - 852
  • [10] FI Group at SemEval-2024 Task 8: A Syntactically Motivated Architecture for Multilingual Machine-Generated Text Detection
    Ben-Fares, Maha
    Zaratiana, Urchade
    Hernandez, Simon D.
    Holat, Pierre
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1166 - 1171