Discovering event episodes from sequences of online news articles: A time-adjoining frequent itemset-based clustering method

被引:4
|
作者
Lee, Yen-Hsien [1 ]
Hu, Paul Jen-Hwa [2 ]
Zhu, Hongquan [3 ]
Chen, Hsin-Wei [4 ]
机构
[1] Natl Chiayi Univ, Dept Management Informat Syst, Chiayi, Taiwan
[2] Univ Utah, Dept Operat & Informat Syst, David Eccles Sch Business, Salt Lake City, UT 84112 USA
[3] Southwest Jiaotong Univ, Dept Finance, Sch Econ & Management, Chengdu, Peoples R China
[4] AdvancedTEK Int Corp, Taipei, Taiwan
关键词
Event episode discovery; Retrospective event detection; Event evolution; Temporal frequent itemset-based clustering; DECISION-SUPPORT; INFORMATION; TRENDS;
D O I
10.1016/j.im.2020.103348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Firms perform environmental surveillance to identify important events and their developments. To alleviate the stringent information processing and analysis requirements, automated methods are needed to discover from online news articles distinct episodes (stages) of an important event. We propose a time-adjoining frequent itemset-based method that incorporates essential temporal characteristics of news articles for event episode discovery. With a corpus of 1468 news articles that pertain to 248 episodes of 53 different events, we empirically evaluate the proposed method and include several prevalent techniques as benchmarks. The results show that our method outperforms the benchmark techniques consistently and significantly, attaining the cluster recall, cluster precision, and F-measure values at 0.706, 0.593, and 0.584, respectively.
引用
收藏
页数:13
相关论文
共 5 条
  • [1] A text summary-based method to detect new events from streams of online news articles
    Lee, Yen-Hsien
    Wei, Chih-Ping
    Hu, Paul Jen-Hwa
    Wu, Pao-Feng
    Jiang, How
    INFORMATION & MANAGEMENT, 2022, 59 (06)
  • [2] HCP miner: an efficient heuristic-based clustering method for discovering colossal frequent patterns from high dimensional databases
    Reddy, T. Sreenivasula
    Sathya, R.
    Nuka, Mallikhanjuna Rao
    INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2023, 14 (04) : 186 - 196
  • [3] A Grid-Based Clustering Method For Mining Frequent Trips From Large-Scale, Event-Based Telematics Datasets
    Cao, Qing
    Bouqata, Bouchra
    Mackenzie, Patricia D.
    Messier, Daniel
    Salvo, Josheph J.
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2996 - 3001
  • [4] Discovering e-commerce user groups from online comments: An emotional correlation analysis-based clustering method
    Ke, Jia
    Wang, Ying
    Fan, Mingyue
    Chen, Xiaojun
    Zhang, Wenlong
    Gou, Jianping
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 113
  • [5] Enhancing COVID-19 Epidemic Forecasting Accuracy by Combining Real-time and Historical Data From Multiple Internet-Based Sources: Analysis of Social Media Data, Online News Articles, and Search Queries
    Li, Jingwei
    Huang, Wei
    Sia, Choon Ling
    Chen, Zhuo
    Wu, Tailai
    Wang, Qingnan
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2022, 8 (06):