Category Hierarchy Maintenance: a Data-Driven Approach

被引:0
|
作者
Yuan, Quan [1 ]
Cong, Gao [1 ]
Sun, Aixin [1 ]
Lin, Chin-Yew [2 ]
Magnenat-Thalmann, Nadia [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Category Hierarchy; Hierarchy Maintenance; Classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Category hierarchies often evolve at a much slower pace than the documents reside in. With newly available documents kept adding into a hierarchy, new topics emerge and documents within the same category become less topically cohesive. In this paper, we propose a novel automatic approach to modifying a given category hierarchy by redistributing its documents into more topically cohesive categories. The modification is achieved with three operations (namely, sprout, merge, and assign) with reference to an auxiliary hierarchy for additional semantic information; the auxiliary hierarchy covers a similar set of topics as the hierarchy to be modified. Our user study shows that the modified category hierarchy is semantically meaningful. As an extrinsic evaluation, we conduct experiments on document classification using real data from Yahoo! Answers and AnswerBag hierarchies, and compare the classification accuracies obtained on the original and the modified hierarchies. Our experiments show that the proposed method achieves much larger classification accuracy improvement compared with several baseline methods for hierarchy modification.
引用
收藏
页码:791 / 800
页数:10
相关论文
共 50 条
  • [31] Data-Driven Precision Implementation Approach
    Cullen, Laura
    Hanrahan, Kirsten
    Tucker, Sharon J.
    Gallagher-Ford, Lynn
    [J]. AMERICAN JOURNAL OF NURSING, 2019, 119 (08) : 60 - 63
  • [32] Curriculum Design - A Data-Driven Approach
    Chang, Jung-Kuei
    Tsao, Nai-Lung
    Kuo, Chin-Hwa
    Hsu, Hui-Huang
    [J]. PROCEEDINGS 2016 5TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS IIAI-AAI 2016, 2016, : 492 - 496
  • [33] Controller implementability: a data-driven approach
    Padoan, Alberto
    Coulson, Jeremy
    Dorfler, Florian
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6098 - 6103
  • [34] A data-driven approach to nonlinear elasticity
    Nguyen, Lu Trong Khiem
    Keip, Marc-Andre
    [J]. COMPUTERS & STRUCTURES, 2018, 194 : 97 - 115
  • [35] Data-Driven Systematic Evaluation of Preventive Maintenance Performance
    Sigsgaard, Kristoffer, V
    Agergaard, Julie K.
    Mortensen, Niels-Henrik
    Soleymani, Iman
    [J]. 67TH ANNUAL RELIABILITY & MAINTAINABILITY SYMPOSIUM (RAMS 2021), 2021,
  • [36] Data-Driven Application Maintenance: Experience from the Trenches
    Misra, Janardan
    Sengupta, Shubhashis
    Rawat, Divya
    Savagaonkar, Milind
    Podder, Sanjay
    [J]. 2017 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING RESEARCH AND INDUSTRIAL PRACTICE (SER&IP 2017), 2017, : 48 - 54
  • [37] Saliency Aggregation: A Data-driven Approach
    Mai, Long
    Niu, Yuzhen
    Liu, Feng
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1131 - 1138
  • [38] Data-driven Development and Maintenance of Soft-Sensors
    Abonyi, Janos
    Farsang, Barbara
    Kulcsar, Tibor
    [J]. 2014 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2014, : 239 - 244
  • [39] A Survey on Data-Driven Predictive Maintenance for the Railway Industry
    Davari, Narjes
    Veloso, Bruno
    Costa, Gustavo de Assis
    Pereira, Pedro Mota
    Ribeiro, Rita P.
    Gama, Joao
    [J]. SENSORS, 2021, 21 (17)
  • [40] Data-driven preventive maintenance for a heterogeneous machine portfolio
    Deprez, Laurens
    Antonio, Katrien
    Arts, Joachim
    Boute, Robert
    [J]. OPERATIONS RESEARCH LETTERS, 2023, 51 (02) : 163 - 170