SMAT: An Attention-Based Deep Learning Solution to the Automation of Schema Matching

被引:4
|
作者
Zhang, Jing [1 ]
Shin, Bonggun [2 ]
Choi, Jinho D. [1 ]
Ho, Joyce C. [1 ]
机构
[1] Emory Univ, Atlanta, GA 30329 USA
[2] Deargen Inc, Seoul, South Korea
基金
美国国家科学基金会;
关键词
Schema-level matching; Natural language processing; Attention over attention;
D O I
10.1007/978-3-030-82472-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Schema matching aims to identify the correspondences among attributes of database schemas. It is frequently considered as the most challenging and decisive stage existing in many contemporary web semantics and database systems. Low-quality algorithmic matchers fail to provide improvement while manually annotation consumes extensive human efforts. Further complications arise from data privacy in certain domains such as healthcare, where only schema-level matching should be used to prevent data leakage. For this problem, we propose SMAT, a new deep learning model based on state-of-the-art natural language processing techniques to obtain semantic mappings between source and target schemas using only the attribute name and description. SMAT avoids directly encoding domain knowledge about the source and target systems, which allows it to be more easily deployed across different sites. We also introduce a new benchmark dataset, OMAP, based on real-world schema-level mappings from the healthcare domain. Our extensive evaluation of various benchmark datasets demonstrates the potential of SMAT to help automate schema-level matching tasks.
引用
收藏
页码:260 / 274
页数:15
相关论文
共 50 条
  • [41] Pretraining of attention-based deep learning potential model for molecular simulation
    Zhang, Duo
    Bi, Hangrui
    Dai, Fu-Zhi
    Jiang, Wanrun
    Liu, Xinzijian
    Zhang, Linfeng
    Wang, Han
    [J]. NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [42] SeismicTransformer: An attention-based deep learning method for the simulation of seismic wavefields
    Xiang, Yanjin
    Wang, Zhiliang
    Song, Ziang
    Huang, Rong
    Song, Guojie
    Min, Fan
    [J]. COMPUTERS & GEOSCIENCES, 2024, 190
  • [43] Attention-Based Explanation in a Deep Learning Model For Classifying Radiology Reports
    Putelli, Luca
    Gerevini, Alfonso E.
    Lavelli, Alberto
    Maroldi, Roberto
    Serina, Ivan
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 367 - 372
  • [44] ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING
    Reizinger, Patrik
    Szemenyei, Marton
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3542 - 3546
  • [45] AIST: An Interpretable Attention-Based Deep Learning Model for Crime Prediction
    Rayhan, Yeasir
    Hashem, Tanzima
    [J]. ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2023, 9 (02)
  • [46] Attention-based Deep Reinforcement Learning for Multi-view Environments
    Barati, Elaheh
    Chen, Xuewen
    Zhong, Zichun
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1805 - 1807
  • [47] Attention-Based Deep Learning Model for Image Desaturation of SDO/AIA
    Xinze Zhang
    Long Xu
    Zhixiang Ren
    Xuexin Yu
    Jia Li
    [J]. Research in Astronomy and Astrophysics, 2023, 23 (08) : 94 - 104
  • [48] A hybrid attention-based deep learning approach for wind power prediction
    Ma, Zhengjing
    Mei, Gang
    [J]. APPLIED ENERGY, 2022, 323
  • [49] Hotspot Detection via Attention-based Deep Layout Metric Learning
    Geng, Hao
    Yang, Haoyu
    Zhang, Lu
    Miao, Jin
    Yang, Fan
    Zeng, Xuan
    Yu, Bei
    [J]. 2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [50] IChrom-Deep: An Attention-Based Deep Learning Model for Identifying Chromatin Interactions
    Zhang, Pengyu
    Wu, Hao
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (09) : 4559 - 4568