dhCM: Dynamic and Hierarchical Event Categorization and Discovery for Social Media Stream

被引:1
|
作者
Guo, Jinjin [1 ]
Gong, Zhiguo [1 ]
Cao, Longbing [2 ]
机构
[1] Univ Macau, Taipa, Macao, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW 2007, Australia
关键词
Hierarchical categorization; document stream; online inference; Bayesian nonparametrics; kernel estimation; event categorization; event discovery;
D O I
10.1145/3470888
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The online event discovery in social media based documents is useful, such as for disaster recognition and intervention. However, the diverse events incrementally identified from social media streams remain accumulated, ad hoc, and unstructured. They cannot assist users in digesting the tremendous amount of information and finding their interested events. Further, most of the existing work is challenged by jointly identifying incremental events and dynamically organizing them in an adaptive hierarchy. To address these problems, this article proposes dynamic and hierarchical Categorization Modeling (dhCM) for social media stream. Instead of manually dividing the timeframe, a multimodal event miner exploits a density estimation technique to continuously capture the temporal influence between documents and incrementally identify online events in textual, temporal, and spatial spaces. At the same time, an adaptive categorization hierarchy is formed to automatically organize the documents into proper categories at multiple levels of granularities. In a nonparametric manner, dhCM accommodates the increasing complexity of data streams with automatically growing the categorization hierarchy over adaptive growth. A sequential Monte Carlo algorithm is used for the online inference of the dhCM parameters. Extensive experiments show that dhCM outperforms the state-of-the-art models in terms of term coherence, category abstraction and specialization, hierarchical affinity, and event categorization and discovery accuracy.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Dynamic User Attribute Discovery on Social Media
    Huang, Xiu
    Yang, Yang
    Hu, Yue
    Shen, Fumin
    Shao, Jie
    WEB TECHNOLOGIES AND APPLICATIONS, PT I, 2016, 9931 : 256 - 267
  • [2] Spatial Event Forecasting in Social Media With Geographically Hierarchical Regularization
    Zhao, Liang
    Wang, Junxiang
    Chen, Feng
    Lu, Chang-Tien
    Ramakrishnan, Naren
    PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1953 - 1970
  • [3] Feature selection for event discovery in social media: A comparative study
    Zhao, Jie
    Wang, Xueya
    Jin, Peiquan
    COMPUTERS IN HUMAN BEHAVIOR, 2015, 51 : 903 - 909
  • [4] Modeling event stream hierarchies with hierarchical event models
    Rox, Jonas
    Ernst, Rolf
    2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 449 - 454
  • [5] Advanced hierarchical event-stream model
    Albers, Karsten
    Bodmann, Frank
    Slomka, Frank
    ECRTS 2008: PROCEEDINGS OF THE 20TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS, 2008, : 211 - 220
  • [6] Shade interest points for dynamic stream object categorization
    Suresh Babu, S.
    Mandhala, Venkata Naresh
    Chinnam, Siva Koteswara Rao
    Kim, Tai-Hoon
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (04): : 63 - 70
  • [7] METHODOLOGY FOR DYNAMIC LEARNING RESOURCES DISCOVERY AND RETRIEVAL FROM SOCIAL MEDIA
    Petreski, Hristijan
    Tsekeridou, Sofia
    Prasad, Neeli R.
    Tan, Zhen-Hua
    EDULEARN14: 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2014, : 2045 - 2052
  • [8] Sub-event discovery and retrieval during natural hazards on social media data
    Wu, Qunhui
    Ma, Shilong
    Liu, Yunzhen
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2016, 19 (02): : 277 - 297
  • [9] Sub-event discovery and retrieval during natural hazards on social media data
    Qunhui Wu
    Shilong Ma
    Yunzhen Liu
    World Wide Web, 2016, 19 : 277 - 297
  • [10] Dynamic hierarchical triangulation of a clustered data stream
    Skala, J.
    Kolingerova, I.
    COMPUTERS & GEOSCIENCES, 2011, 37 (08) : 1092 - 1101