Clustering DTDs: An interactive two-level approach

被引:0
|
作者
Aoying Zhou
Weining Qian
Hailei Qian
Long Zhang
Yuqi Liang
Wen Jin
机构
[1] Fudan University,Department of Computer Science, Laboratory for Intelligent Information Processing
[2] Simon Fraser University,Department of Computer Science
关键词
clustering, XML (eXtensible Markup Language); DTD (Document Type Definition);
D O I
暂无
中图分类号
学科分类号
摘要
XML (eXtensible Markup Language) is a standard which is widely applied in data representation and data exchange. However, as an important concept of XML, DTD (Document Type Definition) is not taken full advantage in current applications. In this paper, a new method for clustering DTDs is presented, and it can be used in XML document clustering. The two-level method clusters the elements in DTDs and clusters DTDs separately. Element clustering forms the first level and provides element clusters, which are the generalization of relevant elements. DTD clustering utilizes the generalized information and forms the second level in the whole clustering process. The two-level method has the following advantages: 1) It takes into consideration both the content and the structure within DTDs; 2) The generalized information about elements is more useful than the separated words in the vector model; 3) The two-level method facilitates the searching of outliers. The experiments show that this method is able to categorize the relevant DTDs effectively.
引用
收藏
页码:807 / 819
页数:12
相关论文
共 50 条
  • [1] Clustering DTDs: An interactive two-level approach
    Zhou, AY
    Qian, WI
    Qian, HL
    Zhang, L
    Liang, YQ
    Jin, W
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (06) : 807 - 819
  • [2] A two-level method for clustering DTDs
    Qian, WN
    Zhang, L
    Liang, YQ
    Qian, HL
    Jin, W
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 41 - 52
  • [3] Interactive two-level multiobjective decision method
    Xia, Hongsheng
    [J]. Chinese Journal of Systems Engineering and Electronics, 1994, 5 (04):
  • [4] An Interactive Two-Level Multiobjective Decision Method
    Xia Hongsheng
    [J]. Journal of Systems Engineering and Electronics, 1994, (04) : 50 - 56
  • [5] Interactive Two-Level WEBSOM for Organizational Exploration
    Honkela, Timo
    Knapek, Michael
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 579 - 585
  • [6] A two-level clustering approach for multidimensional transfer function specification in volume visualization
    Cai, Lile
    Nguyen, Binh P.
    Chui, Chee-Kong
    Ong, Sim-Heng
    [J]. VISUAL COMPUTER, 2017, 33 (02): : 163 - 177
  • [7] A New Two-Level Clustering Approach for Situations Management in Distributed Smart Environments
    Mounir, Achouri
    Adel, Alti
    Makhlouf, Derdour
    Sebastien, Laborie
    Philippe, Roose
    [J]. INTERNATIONAL JOURNAL OF AMBIENT COMPUTING AND INTELLIGENCE, 2019, 10 (02) : 91 - 111
  • [8] A two-level clustering approach for multidimensional transfer function specification in volume visualization
    Lile Cai
    Binh P. Nguyen
    Chee-Kong Chui
    Sim-Heng Ong
    [J]. The Visual Computer, 2017, 33 : 163 - 177
  • [9] Two-level Hierarchical Clustering Analysis and Application
    HU Hui-rong
    [J]. 厦门大学学报(自然科学版), 2002, (S1) : 283 - 284
  • [10] A Two-Level Keyphrase Extraction Approach
    Ali, Chedi Bechikh
    Wang, Rui
    Haddad, Hatem
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 390 - 401