Similarity Algorithm Based on Weighted Hierarchical Structure of XML Document

被引:0
|
作者
Sun, Xia [1 ]
Cheng, Hong-Bin [1 ]
Wang, Hai-Jun [2 ]
机构
[1] Changshu Inst Technol, Sch Comp Sci, Changshu, Jiangsu, Peoples R China
[2] Hubei Univ Educ, Dept Comp Sci, Wuhan, Hubei, Peoples R China
关键词
XML; similarity; hierarchical structure;
D O I
10.1109/ICIE.2009.78
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A similarity algorithm based on weighted hierarchical structure of MM.. document is brought forward. The algorithm can calculate the similarity among XML documents efficiently according to hierarchical structure. It can be powerful enough to distinguish the similar structural documents. Experimental results prove that the algorithm reduces the complexity and has fairly high performance. The approach presented in this paper can be used in many applications, such as clustering, structural extracting and change checking of XML documents, etc.
引用
收藏
页码:143 / +
页数:2
相关论文
共 50 条
  • [1] On the use of hierarchical information in sequential mining-based XML document similarity computation
    Leung, HP
    Chung, FL
    Chan, SCF
    KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (04) : 476 - 498
  • [2] On the use of hierarchical information in sequential mining-based XML document similarity computation
    Ho-pong Leung
    Fu-lai Chung
    Stephen Chi-fai Chan
    Knowledge and Information Systems, 2005, 7 : 476 - 498
  • [3] XML document similarity measure in terms of the structure and contents
    Kim, Woosaeng
    PROCEEDINGS OF THE 2ND WSEAS INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: MODERN TOPICS OF COMPUTER SCIENCE, 2008, : 205 - 212
  • [4] Approximate XML structure validation based on document-grammar tree similarity
    Tekli, Joe
    Chbeir, Richard
    Traina, Agma J. M.
    Traina, Caetano, Jr.
    Fileto, Renato
    INFORMATION SCIENCES, 2015, 295 : 258 - 302
  • [5] Semantic-based similarity computation for XML document
    Song, In-sang
    Paik, Ju-ryun
    Kim, Ung-mo
    MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 796 - +
  • [6] An Improved Algorithm of Similarity Based on Clustering in XML
    Wang, Puqing
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 837 - 841
  • [7] An Improved Cosine Similarity Algorithm Based on Document Similarity
    Lee, Ming
    Zhao, Heji
    INTERNATIONAL SYMPOSIUM ON FUZZY SYSTEMS, KNOWLEDGE DISCOVERY AND NATURAL COMPUTATION (FSKDNC 2014), 2014, : 196 - 204
  • [8] Estimation of Structural Similarity of XML Document Based on Frequency and Path
    Ren Xueli
    Dai Yubiao
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY, 2016, 37 : 272 - 275
  • [9] Hierarchical Document Clustering based on Cosine Similarity measure
    Popat, Shraddha K.
    Deshmukh, Pramod B.
    Metre, Vishakha A.
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 153 - 159
  • [10] Structure Based XML Document Clustering: A Review
    Thulasi, A.
    Remya, K. T. V.
    Raju, G.
    2017 INTERNATIONAL CONFERENCE ON INFOCOM TECHNOLOGIES AND UNMANNED SYSTEMS (TRENDS AND FUTURE DIRECTIONS) (ICTUS), 2017, : 543 - 547