Path bitmap indexing for retrieval of XML documents

被引:0
|
作者
Lee, Jae-Min [1 ]
Hwang, Byung-Yeon [1 ]
机构
[1] Catholic Univ, Dept Comp Engn, Seoul, South Korea
来源
MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE | 2006年 / 3885卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The path-based indexing methods such as the three-dimensional bit-map indexing have been used for collecting and retrieving the similar XML documents. To do this, the paths become the fundamental unit for constructing index. In case the document structure changes, the path extracted before the change and the one after the change are regarded as totally different ones regardless of the degree of the change. Due to this, the performance of the path-based indexing methods is usually bad in retrieving and clustering the documents which are similar. A novel method which can detect the similar paths is needed for the effective collecting and retrieval of XML documents. In this paper, a new path construction similarity which calculates the similarity between the paths is defined and a path bitmap indexing method is proposed to effectively load and extract the similar paths. The proposed method extracts the representative path from the paths which are similar. The paths are clustered using this, and the XML documents are also clustered using the clustered paths. This solves the problem of existing three-dimensional bitmap indexing. Through the performance evaluation, the proposed method showed better clustering accuracy over existing methods.
引用
收藏
页码:329 / 339
页数:11
相关论文
共 50 条
  • [31] INDEXING OF MEDICAL XML DOCUMENTS STORED IN WORM STORAGE
    Aksu, Naim
    Guendem, Taflan Imre
    INFORMATION TECHNOLOGY AND CONTROL, 2009, 38 (01): : 72 - 80
  • [32] iXUPT: Indexing XML Using Path Templates
    Bartos, Tomas
    Kasarda, Jan
    PROCEEDINGS OF THE DATESO 2010 WORKSHOP - DATESO DATABASES, TEXTS, SPECIFICATIONS, AND OBJECTS, 2010, 567 : 84 - 95
  • [33] Mining module for adaptive XML path indexing
    Gudes, E
    Pertsev, A
    Sixteenth International Workshop on Database and Expert Systems Applications, Proceedings, 2005, : 1015 - 1019
  • [34] Indexing XML data for path expression queries
    Hu, G
    Tang, C
    SOFTWARE ENGINEERING RESEARCH AND APPLICATIONS, 2004, 3026 : 332 - 348
  • [35] A Non Redundant Compact XML Storage for Efficient Indexing and Querying of XML Documents
    Atique, Mohammed
    Raut, A. D.
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 109 - +
  • [36] Intelligent Indexing and Semantic Retrieval of Multimodal Documents
    Rohini K. Srihari
    Zhongfei Zhang
    Aibing Rao
    Information Retrieval, 2000, 2 (2-3): : 245 - 275
  • [37] Personalised indexing and retrieval of heterogeneous structured documents
    Bordogna, G
    Pasi, G
    INFORMATION RETRIEVAL, 2005, 8 (02): : 301 - 318
  • [38] USING UDC FOR COORDINATE INDEXING AND RETRIEVAL OF DOCUMENTS
    DMITRIEVSKII, NN
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 1-ORGANIZATSIYA I METODIKA INFORMATSIONNOI RABOTY, 1968, (01): : 14 - +
  • [39] Indexing and retrieval of on-line handwritten documents
    Jain, AK
    Namboodiri, AM
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 655 - 659
  • [40] Personalised Indexing and Retrieval of Heterogeneous Structured Documents
    Gloria Bordogna
    Gabriella Pasi
    Information Retrieval, 2005, 8 : 301 - 318