Incremental mining of the schema of semistructured data

被引:0
|
作者
Aoying Zhou
Wen Jin
Shuigeng Zhou
Weining Qian
Zenping Tian
机构
[1] Fudan University,Department of Computer Science
关键词
data mining; incremental mining; semistructured data; schema; algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Semistructured data are specified in lack of any fixed and rigid schema, even though typically some implicit structure appears in the data. The huge amounts of on-line applications make it important and imperative to mine the schema of semistructured data, both for the users (e.g., to gather useful information and facilitate querying) and for the systems (e.g., to optimize access). The critical problem is to discover the hidden structure in the semistructured data. Current methods in extracting Web data structure are either in a general way independent of application background, or bound in some concrete environment such as HTML, XML etc. But both face the burden of expensive cost and difficulty in keeping along with the frequent and complicated variances of Web data. In this paper, the problem of incremental mining of schema for semistructured data after the update of the raw data is discussed. An algorithm for incrementally mining the schema of semistructured data is provided, and some experimental results are, also given, which show that incremental mining for semistructured data is more efficient than non-incremental mining.
引用
收藏
页码:241 / 248
页数:7
相关论文
共 50 条
  • [11] Mining schemas in semistructured data using fuzzy decision trees
    Sun, W
    Liu, DX
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 606 - 607
  • [12] Automatic wrapper system for semistructured documents based on data mining
    Rancea, I. (irina.rancea@gmail.com), 2012, (74):
  • [13] Mining is-part-of association patterns from semistructured data
    Wang, K
    Liu, HQ
    KNOWLEDGE MANAGEMENT & INTELLIGENT ENTERPRISES, 2001, : 189 - 204
  • [14] An overview of data mining in heterogeneous schema integration
    Dao, S
    Perry, B
    WESCON - 96, CONFERENCE PROCEEDINGS, 1996, : 478 - 483
  • [16] NF-SS: A Normal Form for Semistructured Schema
    Wu, XY
    Ling, TW
    Lee, SY
    Lee, ML
    Dobbie, G
    CONCEPTUAL MODELING FOR NEW INFORMATION SYSTEMS TECHNOLOGIES, 2002, 2465 : 292 - 305
  • [17] Extracting Local Schema from Semistructured Data Based on Graph-Oriented Semantic Model
    王腾蛟
    唐世渭
    杨冬青
    刘云峰
    林斌
    Journal of Computer Science and Technology, 2001, (06) : 560 - 566
  • [18] Extracting local schema from semistructured data based on graph-oriented semantic model
    Tengjiao Wang
    Shiwei Tang
    Dongqing Yang
    Yunfeng Liu
    Bin Lin
    Journal of Computer Science and Technology, 2001, 16 : 560 - 566
  • [19] Incremental schema integration for data wrangling via knowledge graphs
    Flores, Javier
    Rabbani, Kashif
    Nadal, Sergi
    Gomez, Cristina
    Romero, Oscar
    Jamin, Emmanuel
    Dasiopoulou, Stamatia
    SEMANTIC WEB, 2024, 15 (03) : 793 - 830
  • [20] Incremental Schema Mapping
    Anam, Sarawat
    Kim, Yang Sok
    Liu, Qing
    KNOWLEDGE MANAGEMENT AND ACQUISITION FOR SMART SYSTEMS AND SERVICES, PKAW 2014, 2014, 8863 : 69 - 83