EAE: Enzyme Knowledge Graph Adaptive Embedding

被引:0
|
作者
Du Z. [1 ]
Zhang Y. [1 ]
Meng X. [1 ]
Wang Q. [1 ]
机构
[1] School of Information, Renmin University of China, Beijing
来源
Meng, Xiaofeng (xfmeng@ruc.edu.cn) | 1600年 / Science Press卷 / 54期
基金
中国国家自然科学基金;
关键词
Embedding; Enzyme; Imbalance; Inhomogeneous; Nonco-mmuting; Specific-domain knowledge graph (SKG);
D O I
10.7544/issn1000-1239.2017.20170638
中图分类号
学科分类号
摘要
In recent years a drastic rise in constructing Web-scale knowledge graph (KG) has appeared and the deal with practical problems falls back on KG. Embedding learning of entities and relations has become a popular method to perform machine learning on relational data such as KG. Based on embedding representation, knowledge analysis, inference, fusion, completion and even decision-making could be promoted. Constructing and embedding open-domain knowledge graph (OKG) has mushroomed, which greatly promots the intelligentization of big data in open domain. Meanwhile, specific-domain knowledge graph (SKG) has become an important resource for smart applications in specific domain. However, SKG is developing and its embedding is still in the embryonic stage. This is mainly because there is a germination in SKG due to the difference for data distributions between OKG and SKG. More specifically: 1) In OKG, such as WordNet and Freebase, sparsity of head and tail entities are nearly equal, but in SKG, such as Enzyme KG and NCI-PID, inhomogeneous is more popular. For example, the tail entities are about 1 000 times more than head ones in the enzyme KG of microbiology area. 2) Head and tail entities can be commuted in OKG, but they are noncommuting in SKG because most of relations are attributes. For example, entity "Obama" can be a head entity or a tail entity, but the head entity "enzyme" is always in the head position in the enzyme KG. 3) Breadth of relation has a small skew in OKG while imbalance in SKG. For example, a enzyme entity can link 31 809 x-gene entities in the enzyme KG. Based on observation, we propose a novel approach EAE to deal with the 3 issues. We evaluate our approach on link prediction and triples classification tasks. Experimental results show that our approach outperforms Trans(E, H, R, D and TransSparse) significantly, and achieves state-of the-art performance. © 2017, Science Press. All right reserved.
引用
收藏
页码:2674 / 2686
页数:12
相关论文
共 31 条
  • [1] Meng X., Du Z., Research on the big data fusion: Issues and challenges, Journal of Computer Research and Development, 53, 2, pp. 231-246, (2016)
  • [2] Liu Q., Li Y., Duan H., Et al., Knowledge graph construction techniques, Journal of Computer Research and Development, 53, 3, pp. 582-600, (2016)
  • [3] Walter S., Unger C., Cimiano P., DBlexipedia: A nucleus for a multilingual lexical Semantic Web, Proc of the 3rd Int Workshop on NLP and DBpedia, Co-located with the 14th Int Semantic Web Conf (ISWC'15), pp. 87-92, (2015)
  • [4] Vrandecic D., Krotzsch M., Wikidata: A free collaborative knowledgebase, Communications of the ACM, 57, 10, pp. 78-85, (2014)
  • [5] Bollacker K., Evans C., Paritosh P., Et al., Freebase: A collaboratively created graph database for structuring human knowledge, Proc of the 2008 ACM SIGMOD Int Conf on Management of Data (SIGMOD'08), pp. 1247-1250, (2008)
  • [6] Suchanek F.M., Kasneci G., Weikum G., Yago: A core of semantic knowledge, Proc of the 16th Int Conf on World Wide Web (WWW'07), pp. 697-706, (2007)
  • [7] Wu W., Li H., Wang H., Et al., Probase: A probabilistic taxonomy for text understanding, Proc of the 2012 ACM SIGMOD Int Conf on Management of Data(SIGMOD'12), pp. 481-492, (2012)
  • [8] Dong X., Gabrilovich E., Heitz G., Et al., Knowledge vault: A Web-scale approach to probabilistic knowledge fusion, Proc of the 20th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining(SIGKDD'14), pp. 601-610, (2014)
  • [9] Neelakantan A., Roth B., McCallum A., Compositional vector space models for knowledge base completion, Proc of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th Int Joint Conf on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL'15), pp. 156-166, (2015)
  • [10] Yang B., Yih W., He X., Et al., Embedding entities and relations for learning and inference in knowledge bases