Semantic integration of tree-structured data using dimension graphs

被引:0
|
作者
Dalamagas, T [1 ]
Theodoratos, D
Koufopoulos, A
Liu, IT
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Zografos, Greece
[2] New Jersey Inst Technol, Dept Comp Sci, Newark, NJ 07102 USA
来源
JOURNAL ON DATA SEMANTICS IV | 2005年 / 3730卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, huge volumes of Web data are organized or exported in tree-structured form. Popular examples of such structures are product catalogs of e-market stores, taxonomies of thematic categories, XML data encodings, etc. Even for a single knowledge domain, name mismatches, structural differences and structural inconsistencies raise difficulties when many data sources need to be integrated and queried in a uniform way. In this paper, we present a method for semantically integrating tree-structured data. We introduce dimensions which are sets of semantically related nodes in tree structures. Based on dimensions, we suggest dimension graphs. Dimension graphs can be automatically extracted from trees and abstract their structural information. They are semantically rich constructs that provide query guidance to pose queries, assist query evaluation and support integration of tree-structured data. We design a query language to query tree-structured data. The language allows full, partial or no specification of the structure of the underlying tree-structured data used to issue queries. Thus, queries in our language are not restricted by the structure of the trees. We provide necessary and sufficient conditions for checking query satisfiability and we present a technique for evaluating satisfiable queries. Finally, we conducted several experiments to compare our method for integrating tree-structured data with one that does not exploit dimension graphs. Our results demonstrate the superiority of our approach.
引用
收藏
页码:250 / 279
页数:30
相关论文
共 50 条
  • [31] Exploiting Common Patterns for Tree-Structured Data
    Wang, Zhiyi
    Chen, Shimin
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 883 - 896
  • [32] Tree-structured Curriculum Learning based on Semantic Similarity of Text
    Han, Sanggyu
    Myaeng, Sung-Hyon
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 971 - 976
  • [33] Bloom Filter Tree for Fast Search of Tree-Structured Data
    Wang, Mengyu
    Zhu, Ying
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 18 - 23
  • [34] Efficient Model Evidence Computation in Tree-structured Factor Graphs
    Nguyen, Hoang M. H.
    Van Erp, Bart
    Senoz, Ismail
    de Vries, Bert
    2022 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2022, : 126 - 131
  • [35] SAR image data compression using a tree-structured wavelet transform
    Zeng, ZH
    Cumming, IG
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2001, 39 (03): : 546 - 552
  • [36] TREE-STRUCTURED PROGRAMS
    WEGNER, E
    COMMUNICATIONS OF THE ACM, 1973, 16 (11) : 704 - 705
  • [37] Tree-structured classifiers
    Loh, Wei-Yin
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (03): : 364 - 369
  • [38] Simple and efficient Hash sketching for tree-structured data
    Wu, Wei
    Jiang, Mi
    Luo, Chuan
    Li, Fangfang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [39] STEED: An Analytical Database System for TrEE-structured Data
    Wang, Zhiyi
    Zhou, Dongyan
    Chen, Shimin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1897 - 1900
  • [40] Scaling Similarity Joins over Tree-Structured Data
    Tang, Yu
    Cai, Yilun
    Mamoulis, Nikos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (11): : 1130 - 1141