Incremental mining of the schema of semistructured data

被引:0
|
作者
Aoying Zhou
Wen Jin
Shuigeng Zhou
Weining Qian
Zenping Tian
机构
[1] Fudan University,Department of Computer Science
关键词
data mining; incremental mining; semistructured data; schema; algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Semistructured data are specified in lack of any fixed and rigid schema, even though typically some implicit structure appears in the data. The huge amounts of on-line applications make it important and imperative to mine the schema of semistructured data, both for the users (e.g., to gather useful information and facilitate querying) and for the systems (e.g., to optimize access). The critical problem is to discover the hidden structure in the semistructured data. Current methods in extracting Web data structure are either in a general way independent of application background, or bound in some concrete environment such as HTML, XML etc. But both face the burden of expensive cost and difficulty in keeping along with the frequent and complicated variances of Web data. In this paper, the problem of incremental mining of schema for semistructured data after the update of the raw data is discussed. An algorithm for incrementally mining the schema of semistructured data is provided, and some experimental results are, also given, which show that incremental mining for semistructured data is more efficient than non-incremental mining.
引用
收藏
页码:241 / 248
页数:7
相关论文
共 50 条
  • [21] Incremental schema mapping
    Anam, Sarawat
    Kim, Yang Sok
    Liu, Qing
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8863 : 69 - 83
  • [22] Extracting local schema from semistructured data based on graph-oriented semantic model
    Wang, TJ
    Tang, SW
    Yang, DQ
    Liu, YF
    Lin, B
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (06) : 560 - 566
  • [23] Mining Schema Knowledge from Linked Data on the Web
    Mehri, Razieh
    Valtchev, Petko
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2017): 10TH INTERNATIONAL CONFERENCE, KSEM 2017, MELBOURNE, VIC, AUSTRALIA, AUGUST 19-20, 2017, PROCEEDINGS, 2017, 10412 : 261 - 273
  • [24] Complexity and a method of extracting a database schema over semistructured documents
    Suzuki, N
    Sato, Y
    Hayase, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (06) : 940 - 949
  • [25] Mining Backbone Literals in Incremental SAT A New Kind of Incremental Data
    Ivrii, Alexander
    Ryvchin, Vadim
    Strichman, Ofer
    THEORY AND APPLICATIONS OF SATISFIABILITY TESTING - SAT 2015, 2015, 9340 : 88 - 103
  • [26] Incremental concept learning for bounded data mining
    Case, J
    Jain, S
    Lange, S
    Zeugmann, T
    INFORMATION AND COMPUTATION, 1999, 152 (01) : 74 - 110
  • [27] Classical and Incremental Classification in Data Mining Process
    Al-Hegami, Ahmed Sultan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (12): : 179 - 187
  • [28] A novel incremental approach for stream data mining
    Aboalsamh, Hatim A.
    AEJ - Alexandria Engineering Journal, 2009, 48 (04): : 419 - 426
  • [29] Constraint based filtering for Incremental Data Mining
    Borah, Malaya Dutta
    Jindal, Rajni
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 801 - 806
  • [30] The interestingness and robustness of knowledge in incremental data mining
    Wang, LH
    Zhang, BF
    Wu, GF
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 1203 - 1206