Experiments in term weighting for novelty mining

被引:10
|
作者
Tsai, Flora S. [1 ]
Kwee, Agus T. [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Novelty mining; Novelty detection; Term weighting; Binary; Term frequency; Inverse document frequency; Threshold; Novelty dataset; SENTENCE; METRICS;
D O I
10.1016/j.eswa.2011.04.218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Obtaining new information in a short time is becoming crucial in today's economy. A lot of information both offline or online is easily acquired, exacerbating the problem of information overload. Novelty mining detects documents/sentences that contain novel or new information and presents those results directly to users (Tang, Tsai, & Chen, 2010). Many methods and algorithms for novelty mining have previously been studied, but none have compared and discussed the impact of term weighting on the evaluation measures. This paper performed experiments to recommend the best term weighting function for both document and sentence-level novelty mining. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:14094 / 14101
页数:8
相关论文
共 50 条
  • [1] Novelty and Primacy: A Long Term Estimator for Online Experiments
    Sadeghi, Soheil
    Gupta, Somit
    Gramatovici, Stefan
    Lu, Jiannan
    Ai, Hao
    Zhang, Ruhan
    TECHNOMETRICS, 2022, 64 (04) : 524 - 534
  • [2] BursT: A Dynamic Term Weighting Scheme for Mining Microblogging Messages
    Lee, Chung-Hong
    Wu, Chih-Hong
    Chien, Tzan-Feng
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT III, 2011, 6677 : 548 - 557
  • [3] Information services for novelty mining
    Tsai, Flora S.
    Kwee, Agus T.
    KNOWLEDGE ENGINEERING REVIEW, 2014, 29 (02): : 234 - 247
  • [4] Chinese Categorization and Novelty Mining
    Tsai, Flora S.
    Zhang, Yi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 284 - 295
  • [5] Evaluation of novelty metrics for sentence-level novelty mining
    Tsai, Flora S.
    Tang, Wenyin
    Chan, Kap Luk
    INFORMATION SCIENCES, 2010, 180 (12) : 2359 - 2374
  • [6] Novelty and the 1919 Eclipse Experiments
    Hudson, RG
    STUDIES IN HISTORY AND PHILOSOPHY OF MODERN PHYSICS, 2003, 34B (01): : 107 - 129
  • [7] Mining dynamic databases by weighting
    Zhang, Shichao
    Liu, Li
    Acta Cybernetica, 2003, 16 (01): : 179 - 205
  • [8] Multilingual sentence categorization and novelty mining
    Zhang, Yi
    Tsai, Flora S.
    Kwee, Agus Trisnajaya
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (05) : 667 - 675
  • [9] Redundancy and novelty mining in the business blogosphere
    Tsai, Flora S.
    Chan, Kap Luk
    LEARNING ORGANIZATION, 2010, 17 (06): : 490 - +
  • [10] PPNW: personalized pairwise novelty loss weighting for novel recommendation
    Lo, Kachun
    Ishigaki, Tsukasa
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (05) : 1117 - 1148