Long Context Question Answering via Supervised Contrastive Learning

被引:0
|
作者
Caciularu, Avi [1 ]
Dagan, Ido [1 ]
Goldberger, Jacob [2 ]
Cohan, Arman [3 ,4 ]
机构
[1] Bar Ilan Univ, Comp Sci Dept, Ramat Gan, Israel
[2] Bar Ilan Univ, Fac Engn, Ramat Gan, Israel
[3] Allen Inst AI, Seattle, WA USA
[4] Univ Washington, Paul G Allen Sch Comp Sci, Seattle, WA 98195 USA
基金
以色列科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long-context question answering (QA) tasks require reasoning over a long document or multiple documents. Addressing these tasks often benefits from identifying a set of evidence spans (e.g., sentences), which provide supporting evidence for answering the question. In this work, we propose a novel method for equipping long-context QA models with an additional sequence-level objective for better identification of the supporting evidence. We achieve this via an additional contrastive supervision signal in finetuning, where the model is encouraged to explicitly discriminate supporting evidence sentences from negative ones by maximizing question-evidence similarity. The proposed additional loss exhibits consistent improvements on three different strong long-context transformer models, across two challenging question answering benchmarks - Hot-potQA and QAsper.(1)
引用
收藏
页码:2872 / 2879
页数:8
相关论文
共 50 条
  • [11] Robust video question answering via contrastive cross-modality representation learning
    Xun YANG
    Jianming ZENG
    Dan GUO
    Shanshan WANG
    Jianfeng DONG
    Meng WANG
    [J]. Science China(Information Sciences), 2024, 67 (10) : 211 - 226
  • [12] Contrastive Video Question Answering via Video Graph Transformer
    Xiao, Junbin
    Zhou, Pan
    Yao, Angela
    Li, Yicong
    Hong, Richang
    Yan, Shuicheng
    Chua, Tat-Seng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13265 - 13280
  • [13] Multi-level Contrastive Learning for Commonsense Question Answering
    Fang, Quntian
    Huang, Zhen
    Zhang, Ziwen
    Hu, Minghao
    Hu, Biao
    Wang, Ankun
    Li, Dongsheng
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 318 - 331
  • [14] PGCL: Prompt guidance and self-supervised contrastive learning-based method for Visual Question Answering
    Gao, Ling
    Zhang, Hongda
    Liu, Yiming
    Sheng, Nan
    Feng, Haotian
    Xu, Hao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [15] Supervised Transfer Learning for Product Information Question Answering
    Tuan Manh Lai
    Trung Bui
    Lipka, Nedim
    Li, Sheng
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 1109 - 1114
  • [16] Knowledge Graph Question Answering based on Contrastive Learning and Feature Transformation
    Hu, Xinrong
    Huang, Jingjing
    Liu, Junping
    Zhu, Qiang
    Yang, Jie
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 608 - 615
  • [17] Contrastive Representation Learning for Conversational Question Answering over Knowledge Graphs
    Kacupaj, Endri
    Singh, Kuldeep
    Maleshkova, Maria
    Lehmann, Jens
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 925 - 934
  • [18] HCCL: Hierarchical Counterfactual Contrastive Learning for Robust Visual Question Answering
    Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of Automation, Chinese Academy of Sciences, China
    不详
    [J]. ACM Trans. Multimedia Comput. Commun. Appl., 2024, 10
  • [19] ALSA: Adversarial Learning of Supervised Attentions for Visual Question Answering
    Liu, Yun
    Zhang, Xiaoming
    Zhao, Zhiyun
    Zhang, Bo
    Cheng, Lei
    Li, Zhoujun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4520 - 4533
  • [20] elBERto: Self-supervised commonsense learning for question answering
    Zhan, Xunlin
    Li, Yuan
    Dong, Xiao
    Liang, Xiaodan
    Hu, Zhiting
    Carin, Lawrence
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258