MAMET at SemEval-2024 Task 7: Supervised Enhanced Reasoning Agent Model

被引：0

作者：

Kalantari, Mahmood ^{[1
]}

Feghhi, Mehdi ^{[1
]}

Alamooti, Taha Khany ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Tehran, Iran

来源：

PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the intersection of language understanding and numerical reasoning, a formidable challenge arises in natural language processing (NLP). Our study delves into the realm of NumEval, focusing on numeral-aware language understanding and generation using the QP, QQA and QNLI datasets(1). We harness the potential of the Orca2 model, Fine-tuning it in both normal and Chain-of-Thought modes with prompt tuning to enhance accuracy. Despite initial conjectures, our findings reveal intriguing disparities in model performance. While standard training methodologies yield commendable accuracy rates. The core contribution of this work lies in its elucidation of the intricate interplay between dataset sequencing and model performance. We expected to achieve a general model with the Fine Tuning model on the QP and QNLI datasets respectively, which has good accuracy in all three datasets. However, this goal was not achieved, and in order to achieve this goal, we introduce our structure 1.

引用

页码：1058 / 1063

页数：6

共 50 条

[1] Noot Noot at SemEval-2024 Task 7: Numerical Reasoning and Headline Generation
Bahad, Sankalp
Bhaskar, Yash
Krishnamurthy, Parameswari
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 913 - 917
[2] NumDecoders at SemEval-2024 Task 7: FlanT5 and GPT enhanced with CoT for Numerical Reasoning
Gongora, H. Andres Gonzalez
Hossain, Md Zobaer
Junaed, Jahedul Alam
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1260 - 1268
[3] VHA at SemEval-2024 Task 7: Bridging Numerical Reasoning and Headline Generation for Enhanced Language Models
Harinieswari, V
Srimathi, T.
Vaishnavi, R.
Aarthi, S.
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 821 - 828
[4] SemEval-2024 Task 5: Argument Reasoning in Civil Procedure
Held, Lena
Habernal, Ivan
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 2027 - 2038
[5] Mistral at SemEval-2024 Task 5: Mistral 7B for Argument Reasoning in Civil Procedure
Siino, Marco
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 155 - 162
[6] ZXQ at SemEval-2024 Task 7: Fine-tuning GPT-3.5-Turbo for Numerical Reasoning
Qian, Zhen
Xu, Xiaofei
Zhang, Xiuzhen
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 218 - 223
[7] DeepPavlov at SemEval-2024 Task 3: Multimodal Large Language Models in Emotion Reasoning
Belikova, Julia
Kosenko, Dmitrii
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1747 - 1757
[8] Team NP_PROBLEM at SemEval-2024 Task 7: Numerical Reasoning in Headline Generation with Preference Optimization
Rajpoot, Pawan Kumar
Chukamphaeng, Nut
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 716 - 720
[9] SSN_Semeval10 at SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversations
Rajesh, Antony A.
Abirami, Supriya A.
Aravindan, Chandrabose
Kumar, Senthil B.
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 553 - 557
[10] LinguisTech at SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversation
Alexandru, Mihaela
Ciocoiu, Calina-Georgiana
Maniga, Ioana
Ungureanu, Octavian
Gifu, Daniela
Trandabat, Diana
PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 412 - 419

← 1 2 3 4 5 →