Mining frequent patterns from XML data

被引:0
|
作者
Win, Chit Nilar [1 ]
Hla, Khin Haymar Saw [1 ]
机构
[1] Univ Comp Studies, Yangon, Myanmar
关键词
XML data; XQuery; FP-growth method;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The web is rich with information. However, the data contained in the web is not well organized which makes obtaining useful information from the web a difficult task. The successful development of eXtensible Markup Language (XML) as a standard to represent semistructured data makes the data contained in the web more readable and the task of mining useful information from the web becomes feasible. XML has become very popular for representing semistructured data and a standard for data exchange over the web. Mining XML data from the web is becoming increasingly important. The previous studies adpot an Apriori-like candidate set generation approach but candidate set generation is still costly. We propose that extracting association rules from XML documents without any preprocessing or postprocessing using XML query language XQuery is possible and analyze the XQuery implementation of the efficient FP-tree based mining method, FP-growth, for mining the complete set of frequent patterns by pattern fragment growth. FP-tree based mining adopts a pattern fragment growth method to avoid the costly generation of a large number of candidate sets and a partition-based, divide-and-conquer method is used. Divide-and-conquer method divides the problem into a number of subproblems and the subproblems by solving them recursively. If the subproblem sizes are small enough, however, just solve the subproblems in a straightforward manner and then combine. the solutions to the subproblems into the solution for the original problem. In addition, we suggest features that need to be added into XQuery in order to make the implementation of the FP growth more efficient.
引用
收藏
页码:208 / 212
页数:5
相关论文
共 50 条
  • [1] Mining frequent patterns from Xml data based on vertical data
    Dai, Shangping
    Xie, Xiangming
    He, Tian
    [J]. DCABES 2007 PROCEEDINGS, VOLS I AND II, 2007, : 798 - 800
  • [2] Mining frequent query patterns from XML queries
    Yang, LH
    Lee, ML
    Hsu, W
    Acharya, S
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2003, : 355 - 362
  • [3] Mining XML frequent query patterns
    Hua, Cheng
    Zhao, Hai-jun
    Chen, Yi
    [J]. INTEGRATION AND INNOVATION ORIENT TO E-SOCIETY, VOL 1, 2007, 251 : 26 - +
  • [4] Mining Tree-Based Frequent Patterns from XML
    Mazuran, Mirjana
    Quintarelli, Elisa
    Tanca, Letizia
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 287 - 299
  • [5] Incremental mining of frequent XML query patterns
    Chen, Y
    Yang, LH
    Wang, YG
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 343 - 346
  • [6] Mining frequent patterns from XML data: Efficient algorithms and design trade-offs
    Jimenez, Aida
    Berzal, Fernando
    Cubero, Juan-Carlos
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 1134 - 1140
  • [7] A novel method for mining frequent subtrees from XML data
    Zhang, WS
    Liu, DX
    Zhang, JP
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 300 - 305
  • [8] Incremental mining of frequent query patterns from XML queries for caching
    Li, Guoliang
    Feng, Jianhua
    Wang, Jianyong
    Zhang, Yong
    Zhou, Lizhu
    [J]. ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 350 - +
  • [9] Mining Frequent User Query Patterns from XML Query Streams
    Chang, Tsui-Ping
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (05) : 452 - 458
  • [10] Mining Frequent Patterns from Microarray Data
    Yildiz, Baris
    Selale, Hatice
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL SYMPOSIUM ON HEALTH INFORMATICS AND BIOINFORMATICS (HIBIT'11), 2011, : 116 - 119