Trend-based Document Clustering for Sensitive and Stable Topic Detection

被引:0
|
作者
Sato, Yoshihide [1 ]
Kawashima, Harumi [2 ]
Okuda, Hidenori [2 ]
Oku, Masahiro [2 ]
机构
[1] NTT Corp, NTT West Corp, 1-1 Hikarino Oka, Yokosuka, Kanagawa 2390847, Japan
[2] NTT Corp, NTT Cyber Solut Labs, Yokosuka, Kanagawa 2390847, Japan
关键词
trend; clustering; gradient model; word frequency;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to detect new topics and track them is important given the huge amounts of documents. This paper introduces a trend-based document clustering algorithm for analyzing them. Its key characteristic; is that it gives scores to words on the basis of the fluctuation in word frequency. The algorithm generates clusters in a practical time, with O(n) processing cost due to preliminary calculation of document distances. The attribute allows the user to settle on the best level of granularity for identifying topics. Experiments prove that our algorithm can gather relevant documents with F measure of 63.0% on average from the beginning to the end of topic lifetime and it largely surpasses other algorithms.
引用
收藏
页码:331 / +
页数:2
相关论文
共 50 条
  • [21] Trend-based feature selection in molecular descriptor space
    Haghighatlari, Mojtaba
    Hachmann, Johannes
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [22] Trend-based load balancer for a distributed Web system
    Andreolini, Mauro
    Casolari, Sara
    Colajanni, Michele
    PROCEEDINGS OF MASCOTS '07: 15TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, 2007, : 288 - 294
  • [23] Clustering Based Topic Events Detection on Text Stream
    Li, Chunshan
    Ye, Yunming
    Zhang, Xiaofeng
    Chu, Dianhui
    Deng, Shengchun
    Xu, Xiaofei
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT 1, 2014, 8397 : 42 - 52
  • [24] Topic Detection based on Group Average Hierarchical Clustering
    Gao, Ni
    Gao, Ling
    He, Yiyue
    Wang, Hai
    Sun, Qian
    2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 88 - 92
  • [25] SNS TREND-BASED TV PROGRAM RECOMMENDATION SCHEME
    Kim, Daeyong
    Kim, Daehoon
    Rho, Seungmin
    Hwang, Eenjun
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [26] Topic Detection from Microblog Based on Text Clustering and Topic Model Analysis
    Huang, Siqi
    Yang, Yitao
    Li, Huakang
    Sun, Guozi
    2014 ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC), 2014, : 88 - 92
  • [27] Building Topic/Trend Detection System based on Slow Intelligence
    Shih, Chia-Chun
    Peng, Ting-Chun
    16TH INTERNATIONAL CONFERENCE ON DISTRIBUTED MULTIMEDIA SYSTEMS (DMS 2010), 2010, : 53 - 56
  • [28] Validating trend-based endpoints for neuroprotection trials in glaucoma
    Montesano, Giovanni
    Garway-Heath, David
    Rabiolo, Alessandro
    Ometto, Giovanni
    Crabb, David
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2023, 64 (08)
  • [29] An Improvement of PAA on Trend-Based Approximation for Time Series
    Zhang, Chunkai
    Chen, Yingyang
    Yin, Ao
    Qin, Zhen
    Zhang, Xing
    Zhang, Keli
    Jiang, Zoe L.
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT II, 2018, 11335 : 248 - 262
  • [30] A trend-based approach for situation awareness in power systems
    Wang, Tao
    Zhang, Shang
    Gu, Xueping
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2017, 27 (12):