Open Knowledge Graphs Canonicalization using Variational Autoencoders

被引:0
|
作者
Dash, Sarthak [1 ]
Rossiello, Gaetano [1 ]
Bagchi, Sugato [1 ]
Mihindukulasooriya, Nandana [1 ]
Gliozzo, Alfio [1 ]
机构
[1] Thomas J Watson Res Ctr, IBM Res AI, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noun phrases and Relation phrases in open knowledge graphs are not canonicalized, leading to an explosion of redundant and ambiguous subject-relation-object triples. Existing approaches to solve this problem take a two-step approach. First, they generate embedding representations for both noun and relation phrases, then a clustering algorithm is used to group them using the embeddings as features. In this work, we propose Canonicalizing Using Variational Autoencoders (CUVA)1, a joint model to learn both embeddings and cluster assignments in an end-to-end approach, which leads to a better vector representation for the noun and relation phrases. Our evaluation over multiple benchmarks shows that CUVA outperforms the existing state-of-the-art approaches. Moreover, we introduce CANONICNELL, a novel dataset to evaluate entity canonicalization systems.
引用
收藏
页码:10379 / 10394
页数:16
相关论文
共 50 条
  • [11] Energy disaggregation using variational autoencoders
    Langevin, Antoine
    Carbonneau, Marc-Andre
    Cheriet, Mohamed
    Gagnon, Ghyslain
    ENERGY AND BUILDINGS, 2022, 254
  • [12] MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases
    Wu, Tien-Hsuan
    Kao, Ben
    Wu, Zhiyong
    Feng, Xiyang
    Song, Qianli
    Chen, Cheng
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 315 - 327
  • [13] Canonicalization of Open Knowledge Bases with Side Information from the Source Text
    Lin, Xueling
    Chen, Lei
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 950 - 961
  • [14] Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
    Ma, Tengfei
    Chen, Jie
    Xiao, Cao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [15] Open-Set Recognition with Gaussian Mixture Variational Autoencoders
    Cao, Alexander
    Luo, Yuan
    Klabjan, Diego
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6877 - 6884
  • [16] Blind Channel Equalization using Variational Autoencoders
    Caciularu, Avi
    Burshtein, David
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2018,
  • [17] SRVAE: Super Resolution using Variational Autoencoders
    Heydari, A. Ali
    Mehmood, Asif
    PATTERN RECOGNITION AND TRACKING XXXI, 2020, 11400
  • [18] DoS and DDoS mitigation using Variational Autoencoders
    Bårli, Eirik Molde
    Yazidi, Anis
    Viedma, Enrique Herrera
    Haugerud, Hårek
    Computer Networks, 2021, 199
  • [19] Modelling urban networks using Variational Autoencoders
    Kempinska, Kira
    Murcio, Roberto
    APPLIED NETWORK SCIENCE, 2019, 4 (01)
  • [20] DoS and DDoS mitigation using Variational Autoencoders
    Barli, Eirik Molde
    Yazidi, Anis
    Viedma, Enrique Herrera
    Haugerud, Harek
    COMPUTER NETWORKS, 2021, 199