MAMET at SemEval-2024 Task 7: Supervised Enhanced Reasoning Agent Model

被引：0

作者：

Kalantari, Mahmood ^{[1
]}

Feghhi, Mehdi ^{[1
]}

Alamooti, Taha Khany ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Tehran, Iran

来源：

PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the intersection of language understanding and numerical reasoning, a formidable challenge arises in natural language processing (NLP). Our study delves into the realm of NumEval, focusing on numeral-aware language understanding and generation using the QP, QQA and QNLI datasets(1). We harness the potential of the Orca2 model, Fine-tuning it in both normal and Chain-of-Thought modes with prompt tuning to enhance accuracy. Despite initial conjectures, our findings reveal intriguing disparities in model performance. While standard training methodologies yield commendable accuracy rates. The core contribution of this work lies in its elucidation of the intricate interplay between dataset sequencing and model performance. We expected to achieve a general model with the Fine Tuning model on the QP and QNLI datasets respectively, which has good accuracy in all three datasets. However, this goal was not achieved, and in order to achieve this goal, we introduce our structure 1.

引用

页码：1058 / 1063

页数：6

共 50 条

[21] IASBS at SemEval-2024 Task 10: Delving into Emotion Discovery and Reasoning in Code-Mixed Conversations
Tareh, Mehrzad
Mohandesi, Aydin
Ansari, Ebrahim
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1229 - 1238
[22] CRCL at SemEval-2024 Task 2: Simple prompt optimizations
Brutti-Mairesse, Clement
Verlingue, Loic
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 437 - 442
[23] INGEOTEC at SemEval-2024 Task 10: Bag of Words Classifiers
Graff, Mario
Tellez, Eric S.
Paredes, Mireya
Moctezuma, Daniela
Ortiz-Bejar, Jose
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1115 - 1120
[24] SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense
Jiang, Yifan
Ilievski, Filip
Ma, Kaixin
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1994 - 2008
[25] KnowComp at SemEval-2024 Task 9: Conceptualization-Augmented Prompting with Large Language Models for Lateral Reasoning
Wang, Weiqi
Xu, Baixuan
Shi, Haochen
Bai, Jiaxin
Hu, Qi
Song, Yangqiu
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1639 - 1645
[26] Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral-7B Model and Data Augmentation
Guimaraes, Artur
Martins, Bruno
Magalhaes, Joao
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1280 - 1287
[27] DaVinci at SemEval-2024 Task 9: Few-shot prompting GPT-3.5 for Unconventional Reasoning
Mathur, Suyash Vardhan
Jindal, Akshett Rai
Shrivastava, Manish
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1212 - 1216
[28] 0x.Yuan at SemEval-2024 Task 5: Enhancing Legal Argument Reasoning with Structured Prompts
Lu, Yu-An
Kao, Hung-Yu
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 385 - 390
[29] OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data
Wei, Chengcheng
Chen, Ze
Fang, Songtan
He, Jiarong
Gao, Max
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 721 - 729
[30] AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning
Ghashami, Mina
Mishra, Soumya Smruti
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1436 - 1442

← 1 2 3 4 5 →