MAMET at SemEval-2024 Task 7: Supervised Enhanced Reasoning Agent Model

被引:0
|
作者
Kalantari, Mahmood [1 ]
Feghhi, Mehdi [1 ]
Alamooti, Taha Khany [1 ]
机构
[1] Iran Univ Sci & Technol, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the intersection of language understanding and numerical reasoning, a formidable challenge arises in natural language processing (NLP). Our study delves into the realm of NumEval, focusing on numeral-aware language understanding and generation using the QP, QQA and QNLI datasets(1). We harness the potential of the Orca2 model, Fine-tuning it in both normal and Chain-of-Thought modes with prompt tuning to enhance accuracy. Despite initial conjectures, our findings reveal intriguing disparities in model performance. While standard training methodologies yield commendable accuracy rates. The core contribution of this work lies in its elucidation of the intricate interplay between dataset sequencing and model performance. We expected to achieve a general model with the Fine Tuning model on the QP and QNLI datasets respectively, which has good accuracy in all three datasets. However, this goal was not achieved, and in order to achieve this goal, we introduce our structure 1.
引用
收藏
页码:1058 / 1063
页数:6
相关论文
共 50 条
  • [1] Noot Noot at SemEval-2024 Task 7: Numerical Reasoning and Headline Generation
    Bahad, Sankalp
    Bhaskar, Yash
    Krishnamurthy, Parameswari
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 913 - 917
  • [2] NumDecoders at SemEval-2024 Task 7: FlanT5 and GPT enhanced with CoT for Numerical Reasoning
    Gongora, H. Andres Gonzalez
    Hossain, Md Zobaer
    Junaed, Jahedul Alam
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1260 - 1268
  • [3] VHA at SemEval-2024 Task 7: Bridging Numerical Reasoning and Headline Generation for Enhanced Language Models
    Harinieswari, V
    Srimathi, T.
    Vaishnavi, R.
    Aarthi, S.
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 821 - 828
  • [4] SemEval-2024 Task 5: Argument Reasoning in Civil Procedure
    Held, Lena
    Habernal, Ivan
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 2027 - 2038
  • [5] Mistral at SemEval-2024 Task 5: Mistral 7B for Argument Reasoning in Civil Procedure
    Siino, Marco
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 155 - 162
  • [6] ZXQ at SemEval-2024 Task 7: Fine-tuning GPT-3.5-Turbo for Numerical Reasoning
    Qian, Zhen
    Xu, Xiaofei
    Zhang, Xiuzhen
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 218 - 223
  • [7] DeepPavlov at SemEval-2024 Task 3: Multimodal Large Language Models in Emotion Reasoning
    Belikova, Julia
    Kosenko, Dmitrii
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1747 - 1757
  • [8] Team NP_PROBLEM at SemEval-2024 Task 7: Numerical Reasoning in Headline Generation with Preference Optimization
    Rajpoot, Pawan Kumar
    Chukamphaeng, Nut
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 716 - 720
  • [9] SSN_Semeval10 at SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversations
    Rajesh, Antony A.
    Abirami, Supriya A.
    Aravindan, Chandrabose
    Kumar, Senthil B.
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 553 - 557
  • [10] LinguisTech at SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversation
    Alexandru, Mihaela
    Ciocoiu, Calina-Georgiana
    Maniga, Ioana
    Ungureanu, Octavian
    Gifu, Daniela
    Trandabat, Diana
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 412 - 419