MaxSimE: Explaining Transformer-based Semantic Similarity via Contextualized Best Matching Token Pairs

被引:0
|
作者
Brito, Eduardo [1 ]
Iser, Henri
机构
[1] Fraunhofer Inst Intelligent Anal & Informat Syst, St Augustin, Germany
关键词
explainable search; semantic similarity; ad-hoc explanations; neural models; trustworthy information retrieval;
D O I
10.1145/3539618.3592017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current semantic search approaches rely on black-box language models, such as BERT, which limit their interpretability and transparency. In this work, we propose MaxSimE, an explanation method for language models applied to measure semantic similarity. Our approach is inspired by the explainable-by-design ColBERT architecture and generates explanations by matching contextualized query tokens to the most similar tokens from the retrieved document according to the cosine similarity of their embeddings. Unlike existing post-hoc explanation methods, which may lack fidelity to the model and thus fail to provide trustworthy explanations in critical settings, we demonstrate that MaxSimE can generate faithful explanations under certain conditions and how it improves the interpretability of semantic search results on ranked documents from the LoTTe benchmark, showing its potential for trustworthy information retrieval.
引用
收藏
页码:2154 / 2158
页数:5
相关论文
共 16 条
  • [1] Scale-aware token-matching for transformer-based object detector
    Jung, Aecheon
    Hong, Sungeun
    Hyun, Yoonsuk
    PATTERN RECOGNITION LETTERS, 2024, 185 : 197 - 202
  • [2] Semantic Parameter Matching in Web APIs with Transformer-based Question Answering
    Kotstein, Sebastian
    Decker, Christian
    2023 IEEE INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED SYSTEM ENGINEERING, SOSE, 2023, : 114 - 123
  • [3] A Cognitive Study on Semantic Similarity Analysis of Large Corpora: A Transformer-based Approach
    Nemani, Praneeth
    Vollala, Satyanarayana
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [4] Measurement of Semantic Textual Similarity in Clinical Texts: Comparison of Transformer-Based Models
    Yang, Xi
    He, Xing
    Zhang, Hansi
    Ma, Yinghan
    Bian, Jiang
    Wu, Yonghui
    JMIR MEDICAL INFORMATICS, 2020, 8 (11)
  • [5] Semantic-Constraint Matching for transformer-based weakly supervised object localization
    Cao, Yiwen
    Su, Yukun
    Wang, Wenjun
    Liu, Yanxia
    Wu, Qingyao
    PATTERN RECOGNITION, 2025, 158
  • [6] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
    Chen, Sihan
    Chen, Yadang
    Zheng, Yuhui
    Yang, Zhi-Xin
    Wu, Enhua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667
  • [7] Experimental study on short-text clustering using transformer-based semantic similarity measure
    Abdalgader, Khaled
    Matroud, Atheer A.
    Hossin, Khaled
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [8] Experimental study on short-text clustering using transformer-based semantic similarity measure
    Abdalgader K.
    Matroud A.A.
    Hossin K.
    PeerJ Computer Science, 2024, 10
  • [9] TSD-CAM: transformer-based self distillation with CAM similarity for weakly supervised semantic segmentation
    Yan, Lingyu
    Chen, Jiangfeng
    Tang, Yuanyan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [10] A Semantic Guidance and Transformer-Based Matching Method for UAVs and Satellite Images for UAV Geo-Localization
    Zhuang, Jiedong
    Chen, Xuruoyan
    Dai, Ming
    Lan, Wenbo
    Cai, Yongheng
    Zheng, Enhui
    IEEE ACCESS, 2022, 10 : 34277 - 34287