MAMET at SemEval-2024 Task 7: Supervised Enhanced Reasoning Agent Model

被引:0
|
作者
Kalantari, Mahmood [1 ]
Feghhi, Mehdi [1 ]
Alamooti, Taha Khany [1 ]
机构
[1] Iran Univ Sci & Technol, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the intersection of language understanding and numerical reasoning, a formidable challenge arises in natural language processing (NLP). Our study delves into the realm of NumEval, focusing on numeral-aware language understanding and generation using the QP, QQA and QNLI datasets(1). We harness the potential of the Orca2 model, Fine-tuning it in both normal and Chain-of-Thought modes with prompt tuning to enhance accuracy. Despite initial conjectures, our findings reveal intriguing disparities in model performance. While standard training methodologies yield commendable accuracy rates. The core contribution of this work lies in its elucidation of the intricate interplay between dataset sequencing and model performance. We expected to achieve a general model with the Fine Tuning model on the QP and QNLI datasets respectively, which has good accuracy in all three datasets. However, this goal was not achieved, and in order to achieve this goal, we introduce our structure 1.
引用
收藏
页码:1058 / 1063
页数:6
相关论文
共 50 条
  • [21] IASBS at SemEval-2024 Task 10: Delving into Emotion Discovery and Reasoning in Code-Mixed Conversations
    Tareh, Mehrzad
    Mohandesi, Aydin
    Ansari, Ebrahim
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1229 - 1238
  • [22] CRCL at SemEval-2024 Task 2: Simple prompt optimizations
    Brutti-Mairesse, Clement
    Verlingue, Loic
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 437 - 442
  • [23] INGEOTEC at SemEval-2024 Task 10: Bag of Words Classifiers
    Graff, Mario
    Tellez, Eric S.
    Paredes, Mireya
    Moctezuma, Daniela
    Ortiz-Bejar, Jose
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1115 - 1120
  • [24] SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense
    Jiang, Yifan
    Ilievski, Filip
    Ma, Kaixin
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1994 - 2008
  • [25] KnowComp at SemEval-2024 Task 9: Conceptualization-Augmented Prompting with Large Language Models for Lateral Reasoning
    Wang, Weiqi
    Xu, Baixuan
    Shi, Haochen
    Bai, Jiaxin
    Hu, Qi
    Song, Yangqiu
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1639 - 1645
  • [26] Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral-7B Model and Data Augmentation
    Guimaraes, Artur
    Martins, Bruno
    Magalhaes, Joao
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1280 - 1287
  • [27] DaVinci at SemEval-2024 Task 9: Few-shot prompting GPT-3.5 for Unconventional Reasoning
    Mathur, Suyash Vardhan
    Jindal, Akshett Rai
    Shrivastava, Manish
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1212 - 1216
  • [28] 0x.Yuan at SemEval-2024 Task 5: Enhancing Legal Argument Reasoning with Structured Prompts
    Lu, Yu-An
    Kao, Hung-Yu
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 385 - 390
  • [29] OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data
    Wei, Chengcheng
    Chen, Ze
    Fang, Songtan
    He, Jiarong
    Gao, Max
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 721 - 729
  • [30] AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning
    Ghashami, Mina
    Mishra, Soumya Smruti
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1436 - 1442