Incremental mining of the schema of semistructured data

被引:0
|
作者
Aoying Zhou
Wen Jin
Shuigeng Zhou
Weining Qian
Zenping Tian
机构
[1] Fudan University,Department of Computer Science
关键词
data mining; incremental mining; semistructured data; schema; algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Semistructured data are specified in lack of any fixed and rigid schema, even though typically some implicit structure appears in the data. The huge amounts of on-line applications make it important and imperative to mine the schema of semistructured data, both for the users (e.g., to gather useful information and facilitate querying) and for the systems (e.g., to optimize access). The critical problem is to discover the hidden structure in the semistructured data. Current methods in extracting Web data structure are either in a general way independent of application background, or bound in some concrete environment such as HTML, XML etc. But both face the burden of expensive cost and difficulty in keeping along with the frequent and complicated variances of Web data. In this paper, the problem of incremental mining of schema for semistructured data after the update of the raw data is discussed. An algorithm for incrementally mining the schema of semistructured data is provided, and some experimental results are, also given, which show that incremental mining for semistructured data is more efficient than non-incremental mining.
引用
收藏
页码:241 / 248
页数:7
相关论文
共 50 条
  • [31] Artificial Neural Network for Incremental Data Mining
    Driff, Lydia Nahla
    Drias, Habiba
    RECENT ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2017, 569 : 133 - 143
  • [32] Incremental mining of association patterns on compressed data
    Ng, VTY
    Wong, JML
    Bao, P
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 441 - 446
  • [33] AN INCREMENTAL DECISION TREE FOR MINING MULTILABEL DATA
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    Wang, Hao
    APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (10) : 992 - 1014
  • [34] Incremental generalization for mining in a data warehousing environment
    Ester, M
    Wittmann, R
    ADVANCES IN DATABASE TECHNOLOGY - EDBT'98, 1998, 1377 : 135 - 149
  • [35] Typechecking for semistructured data
    Suciu, D
    DATABASE PROGRAMMING LANGUAGES, 2002, 2397 : 1 - 20
  • [36] Semistructured data and XML
    Suciu, D
    INFORMATION ORGANIZATION AND DATABASES: FOUNDATIONS OF DATA ORGANIZATION, 2000, 579 : 9 - 30
  • [37] Incremental data mining using concurrent online refresh of materialized data mining views
    Morzy, M
    Morzy, T
    Wojciechowski, M
    Zakrzewicz, M
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 295 - 304
  • [38] Describing semistructured data
    Cardelli, L
    SIGMOD RECORD, 2001, 30 (04) : 80 - 85
  • [39] A fuzzy data mining algorithm for incremental mining of quantitative sequential patterns
    Subramanyam, RBV
    Goswami, A
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2005, 13 (06) : 633 - 652
  • [40] Incremental association rule mining using materialized data mining views
    Morzy, M
    Morzy, T
    Królikowski, Z
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2004, 3261 : 77 - 87