Towards Flexible Similarity Analysis of XML Data

被引:1
|
作者
Almendros-Jimenez, Jesus M. [1 ]
Cuzzocrea, Alfredo [2 ,3 ]
机构
[1] Univ Almeria, Dept Informat, Almeria, Spain
[2] Univ Trieste, DIA Dept, Trieste, Italy
[3] ICAR CNR, Trieste, Italy
关键词
D O I
10.1007/978-3-319-26138-6_61
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of supporting similarity analysis of XML data is a major problem in the data fusion research area. Several approaches have been proposed in literature, but lack of flexibility represents a hard challenge to be faced-off, especially in modern Cloud Computing environments. Inspired by this motivation, we propose SemSynX, a novel technique for supporting similarity analysis of XML data via semantic and syntactic heterogeneity/homogeneity detection. SemSynX retrieves several similarity scores over input XML documents, thus enabling flexible management and "customization" of similarity tools over XML data. In particular, the proposed technique is highly customizable, and it permits the specification of thresholds for the requested degree of similarity for paths and values as well as for the degree of relevance for path and value matching. Also, selection of paths and semantics-based comparison of label content are supported. It thus makes possible to "adjust" the similarity analysis depending on the nature of the input XML documents.
引用
收藏
页码:573 / 576
页数:4
相关论文
共 50 条
  • [1] SemSynX: Flexible Similarity Analysis of XML Data via Semantic and Syntactic Heterogeneity/Homogeneity Detection
    Almendros-Jimenez, Jesus M.
    Cuzzocrea, Alfredo
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, 2016, 9648 : 14 - 26
  • [2] Adaptive Similarity of XML Data
    Jilkova, Eva
    Polak, Marek
    Holubova, Irena
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 CONFERENCES, 2014, 8841 : 535 - 552
  • [3] ArHeX: Flexible composition of indexes and similarity measures for XML
    Sanz, Ismael
    Berlanga, Rafael
    Mesiti, Marco
    Guerrini, Giovanna
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 281 - +
  • [4] Towards flexible querying of XML imprecise data in a dataware house opened on the Web
    Buche, P
    Dibie-Barthélemy, J
    Haemmerlé, O
    Houhou, M
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 28 - 40
  • [5] Similarity of XML schema fragments based on XML data statistics
    Mlynkova, Irena
    Pokorny, Jaroslav
    [J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 194 - 198
  • [6] Towards XML metamodel patterns for XML data modeling
    Hu, ZJ
    Vollmar, G
    [J]. 12TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2001, : 71 - 75
  • [7] Xaggregation: Flexible aggregation of XML data
    Wang, HZ
    Li, JZ
    He, ZY
    Gao, H
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2003, 2762 : 104 - 115
  • [8] An Analysis of an Efficient Data Structure for Evaluating Flexible Constraints on XML Documents
    Marrara, Stefania
    Panzeri, Emanuele
    Pasi, Gabriella
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, 2011, 7022 : 294 - +
  • [9] Relevance ranking tuning for similarity queries on XML data
    Ciaccia, P
    Penzo, W
    [J]. EFFICIENCY AND EFFECTIVENESS OF XML TOOLS AND TECHNIQUES AND DATA INTEGRATION OVER THE WEB, 2003, 2590 : 22 - 34
  • [10] The geometric framework for exact and similarity querying XML data
    Krátky, M
    Pokorny, J
    Skopal, T
    Snásel, V
    [J]. EURASIA-ICT 2002: INFORMATION AND COMMUNICATION TECHNOLOGY, PROCEEDINGS, 2002, 2510 : 35 - 46