Automatic maintenance of category hierarchy

被引:9
|
作者
Hai Zhuge [1 ]
Lei He
机构
[1] Univ Chinese Acad Sci, Chinese Acad Sci, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Category; Category hierarchy; Classification; Clustering; Maintenance; Resource Space Model; MODEL;
D O I
10.1016/j.future.2016.06.038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Category hierarchy is an abstraction mechanism for efficiently managing large-scale resources. In an open environment, a category hierarchy will inevitably become inappropriate for managing resources that constantly change with unpredictable pattern. An inappropriate category hierarchy will mislead the management of resources. The increasing dynamicity and scale of online resources increase the requirement of automatically maintaining category hierarchy. Previous studies about category hierarchy mainly focus on either the generation of category hierarchy or the classification of resources under a pre-defined category hierarchy. The automatic maintenance of category hierarchy has been neglected. Making abstraction among categories and measuring the similarity between categories are two basic behaviours to generate a category hierarchy. Humans are good at making abstraction but limited in ability to calculate the similarities between large-scale resources. Computing models are good at calculating the similarities between large-scale resources but limited in ability to make abstraction. To take both advantages of human view and computing ability, this paper proposes a two-phase approach to automatically maintaining category hierarchy within two scales by detecting the internal pattern change of categories. The global phase clusters resources to generate a reference category hierarchy and gets similarity between categories to detect inappropriate categories in the initial category hierarchy. The accuracy of the clustering approaches in generating category hierarchy determines the rationality of the global maintenance. The local phase detects topical changes and then adjusts inappropriate categories with three local operations. The global phase can quickly target inappropriate categories top-down and carry out cross-branch adjustment, which can also accelerate the local-phase adjustments. The local phase detects and adjusts the local-range inappropriate categories that are not adjusted in the global phase. By incorporating the two complementary phase adjustments, the approach can significantly improve the topical cohesion and accuracy of category hierarchy. A new measure is proposed for evaluating category hierarchy considering not only the balance of the hierarchical structure but also the accuracy of classification. Experiments show that the proposed approach is feasible and effective to adjust inappropriate category hierarchy. The proposed approach can be used to maintain the category hierarchy for managing various resources in dynamic application environment. It also provides an approach to specialize the current online category hierarchy to organize resources with more specific categories. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Automatic Maintenance of the Category Hierarchy
    He, Lei
    Sun, Xiaoping
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2013, : 218 - 221
  • [2] Category Hierarchy Maintenance: a Data-Driven Approach
    Yuan, Quan
    Cong, Gao
    Sun, Aixin
    Lin, Chin-Yew
    Magnenat-Thalmann, Nadia
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 791 - 800
  • [3] Automatic category theme identification and hierarchy generation for Chinese text categorization
    Yang, HC
    Lee, CH
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2005, 25 (01) : 47 - 67
  • [4] Automatic Category Theme Identification and Hierarchy Generation for Chinese Text Categorization
    Hsin-Chang Yang
    Chung-Hong Lee
    [J]. Journal of Intelligent Information Systems, 2005, 25 : 47 - 67
  • [5] Automatic E-mail Classification Using Dynamic Category Hierarchy and Semantic Features
    Park, Sun
    An, Dong Un
    [J]. IETE TECHNICAL REVIEW, 2010, 27 (06) : 478 - 492
  • [6] Venn and the art of category maintenance
    Suits, B
    [J]. JOURNAL OF THE PHILOSOPHY OF SPORT, 2004, 31 (01) : 1 - 14
  • [7] Generating category hierarchy for classifying large corpora
    Fukumoto, F
    Suzuki, Y
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (04) : 1543 - 1554
  • [8] A Mental Model Approach for Category Hierarchy Maintenance on Sellers' Self-input Items in E-commerce Websites
    Wu, Peng
    He, Daqing
    Song, Jiang
    [J]. 2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
  • [9] AUTOMATIC MAINTENANCE
    SHATTOW, M
    [J]. DATAMATION, 1990, 36 (09): : 14 - 14
  • [10] Automatic memory hierarchy characterization
    Coleman, CL
    Davidson, JW
    [J]. ISPASS: 2001 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2001, : 103 - 110