AlphaIntellect at SemEval-2024 Task 6: Detection of Hallucinations in Generated Text

被引:0
|
作者
Choudhury, Sohan [1 ]
Saha, Priyam [2 ]
Ray, Subharthi [2 ]
Das, Shankha Shubhra [2 ]
Das, Dipankar [2 ]
机构
[1] KIIT, Bhubaneswar, India
[2] Jadavpur Univ, Kolkata, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One major issue in natural language generation (NLG) models is detecting hallucinations (semantically inaccurate outputs). This study investigates a hallucination detection system designed for three distinct NLG tasks: definition modeling, paraphrase generation, and machine translation. The system uses feedforward neural networks for classification and SentenceTransformer models for similarity scores and sentence embeddings. Even though the SemEval-2024 benchmark is showing good results, there is still room for improvement. Promising paths towards improving performance include considering multi-task learning methods, including strategies for handling out-of-domain data and minimizing bias, and investigating sophisticated architectures.
引用
收藏
页码:952 / 958
页数:7
相关论文
共 50 条
  • [31] Werkzeug at SemEval-2024 Task 8: LLM-Generated Text Detection via Gated Mixture-of-Experts Fine-Tuning
    Wu, Youlin
    Wang, Kaichun
    Ma, Kai
    Yang, Liang
    Lin, Hongfei
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 547 - 552
  • [32] RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts
    Kadiyala, Ram Mohan Rao
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 511 - 519
  • [33] Pollice Verso at SemEval-2024 Task 6: The Roman Empire Strikes Back
    Kobs, Konstantin
    Pfister, Jan
    Hotho, Andreas
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1529 - 1536
  • [34] GeminiPro at SemEval-2024 Task 9: BrainTeaser on Gemini
    Choi, Kyu-Hyun
    Na, Eung-Hoon
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1602 - 1606
  • [35] AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis
    Griogoriadou, Natalia
    Lymperaiou, Maria
    Filandrianos, Giorgos
    Stamou, Giorgos
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1549 - 1560
  • [36] UMUTeam at SemEval-2024 Task 6: Leveraging Zero-Shot Learning for Detecting Hallucinations and Related Observable Overgeneration Mistakes
    Pan, Ronghao
    Antonio Garcia-Diaz, Jose
    Bernal-Beltran, Tomas
    Valencia-Garcia, Rafael
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 675 - 681
  • [37] IUSTNLPLAB at SemEval-2024 Task 4: Multilingual Detection of Persuasion Techniques in Memes
    Osoolian, Mohammad
    Monazzah, Erfan Moosavi
    Eetemadi, Sauleh
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1092 - 1096
  • [38] ShefCDTeam at SemEval-2024 Task 4: A Text-to-Text Model for Multi-Label Classification
    Gibbons, Meredith
    Mi, Maggie
    Villavicencio, Aline
    Song, Xingyi
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1860 - 1867
  • [39] I2C-Huelva at SemEval-2024 Task 8: Boosting AI-Generated Text Detection with Multimodal Models and Optimized Ensembles
    Pena, Alberto Rodero
    Vazquez, Jacinto Mata
    Alvarez, Victoria Pachon
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 845 - 852
  • [40] HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
    Dipta, Shubhashis Roy
    Shahriar, Sadat
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 485 - 491