Open Knowledge Graphs Canonicalization using Variational Autoencoders

被引:0
|
作者
Dash, Sarthak [1 ]
Rossiello, Gaetano [1 ]
Bagchi, Sugato [1 ]
Mihindukulasooriya, Nandana [1 ]
Gliozzo, Alfio [1 ]
机构
[1] Thomas J Watson Res Ctr, IBM Res AI, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noun phrases and Relation phrases in open knowledge graphs are not canonicalized, leading to an explosion of redundant and ambiguous subject-relation-object triples. Existing approaches to solve this problem take a two-step approach. First, they generate embedding representations for both noun and relation phrases, then a clustering algorithm is used to group them using the embeddings as features. In this work, we propose Canonicalizing Using Variational Autoencoders (CUVA)1, a joint model to learn both embeddings and cluster assignments in an end-to-end approach, which leads to a better vector representation for the noun and relation phrases. Our evaluation over multiple benchmarks shows that CUVA outperforms the existing state-of-the-art approaches. Moreover, we introduce CANONICNELL, a novel dataset to evaluate entity canonicalization systems.
引用
收藏
页码:10379 / 10394
页数:16
相关论文
共 50 条
  • [1] Relation Canonicalization in Open Knowledge Graphs: A Quantitative Analysis
    Lomaeva, Maria
    Jain, Nitisha
    SEMANTIC WEB: ESWC 2022 SATELLITE EVENTS, 2022, 13384 : 21 - 25
  • [2] Towards Practical Open Knowledge Base Canonicalization
    Wu, Tien-Hsuan
    Wu, Zhiyong
    Kao, Ben
    Yin, Pengcheng
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 883 - 892
  • [3] GraphVAE: Towards Generation of Small Graphs Using Variational Autoencoders
    Simonovsky, Martin
    Komodakis, Nikos
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 412 - 422
  • [4] Joint Open Knowledge Base Canonicalization and Linking
    Liu, Yinan
    Shen, Wei
    Wang, Yuanfei
    Wang, Jianyong
    Yang, Zhenglu
    Yuan, Xiaojie
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2253 - 2261
  • [5] Open knowledge base canonicalization with multi-task learning
    Liu, Bingchen
    Peng, Huang
    Zeng, Weixin
    Zhao, Xiang
    Liu, Shijun
    Pan, Li
    Li, Xin
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [6] Multi-View Clustering for Open Knowledge Base Canonicalization
    Shen, Wei
    Yang, Yang
    Liu, Yinan
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1578 - 1588
  • [7] Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation
    Ren, Yunpeng
    Wang, Ziao
    Wang, Yiyuan
    Zhang, Xiaofeng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15879 - 15880
  • [8] Generating Long Financial Report Using Conditional Variational Autoencoders With Knowledge Distillation
    Wang Z.
    Ren Y.
    Zhang X.
    Wang Y.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1669 - 1680
  • [9] Multi-level feature interaction for open knowledge base canonicalization
    Sui, Xuhui
    Zhang, Ying
    Song, Kehui
    Zhou, Baohang
    Yuan, Xiaojie
    KNOWLEDGE-BASED SYSTEMS, 2024, 303
  • [10] SPEECH DEREVERBERATION USING VARIATIONAL AUTOENCODERS
    Baby, Deepak
    Bourlard, Herve
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5784 - 5788