AlphaIntellect at SemEval-2024 Task 6: Detection of Hallucinations in Generated Text

被引：0

作者：

Choudhury, Sohan ^{[1
]}

Saha, Priyam ^{[2
]}

Ray, Subharthi ^{[2
]}

Das, Shankha Shubhra ^{[2
]}

Das, Dipankar ^{[2
]}

机构：

[1] KIIT, Bhubaneswar, India

[2] Jadavpur Univ, Kolkata, India

来源：

PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One major issue in natural language generation (NLG) models is detecting hallucinations (semantically inaccurate outputs). This study investigates a hallucination detection system designed for three distinct NLG tasks: definition modeling, paraphrase generation, and machine translation. The system uses feedforward neural networks for classification and SentenceTransformer models for similarity scores and sentence embeddings. Even though the SemEval-2024 benchmark is showing good results, there is still room for improvement. Promising paths towards improving performance include considering multi-task learning methods, including strategies for handling out-of-domain data and minimizing bias, and investigating sophisticated architectures.

引用

页码：952 / 958

页数：7

共 50 条

[31] Werkzeug at SemEval-2024 Task 8: LLM-Generated Text Detection via Gated Mixture-of-Experts Fine-Tuning
Wu, Youlin
Wang, Kaichun
Ma, Kai
Yang, Liang
Lin, Hongfei
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 547 - 552
[32] RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts
Kadiyala, Ram Mohan Rao
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 511 - 519
[33] Pollice Verso at SemEval-2024 Task 6: The Roman Empire Strikes Back
Kobs, Konstantin
Pfister, Jan
Hotho, Andreas
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1529 - 1536
[34] GeminiPro at SemEval-2024 Task 9: BrainTeaser on Gemini
Choi, Kyu-Hyun
Na, Eung-Hoon
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1602 - 1606
[35] AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis
Griogoriadou, Natalia
Lymperaiou, Maria
Filandrianos, Giorgos
Stamou, Giorgos
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1549 - 1560
[36] UMUTeam at SemEval-2024 Task 6: Leveraging Zero-Shot Learning for Detecting Hallucinations and Related Observable Overgeneration Mistakes
Pan, Ronghao
Antonio Garcia-Diaz, Jose
Bernal-Beltran, Tomas
Valencia-Garcia, Rafael
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 675 - 681
[37] IUSTNLPLAB at SemEval-2024 Task 4: Multilingual Detection of Persuasion Techniques in Memes
Osoolian, Mohammad
Monazzah, Erfan Moosavi
Eetemadi, Sauleh
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1092 - 1096
[38] ShefCDTeam at SemEval-2024 Task 4: A Text-to-Text Model for Multi-Label Classification
Gibbons, Meredith
Mi, Maggie
Villavicencio, Aline
Song, Xingyi
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1860 - 1867
[39] I2C-Huelva at SemEval-2024 Task 8: Boosting AI-Generated Text Detection with Multimodal Models and Optimized Ensembles
Pena, Alberto Rodero
Vazquez, Jacinto Mata
Alvarez, Victoria Pachon
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 845 - 852
[40] HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
Dipta, Shubhashis Roy
Shahriar, Sadat
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 485 - 491

← 1 2 3 4 5 →