A New Social Media Topic Mining Method Based on Co-word Network

被引:0
|
作者
Wang Y. [1 ,2 ,3 ]
Fu X. [1 ]
Li M. [1 ]
机构
[1] State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan
[2] Collaborative Innovation Center of Geospatial Technology, Wuhan
[3] Faculty of Geomatics, East China University of Technology, Nanchang
基金
中国国家自然科学基金;
关键词
Co-word network; Louvain community detection; Social media; Topic mining;
D O I
10.13203/j.whugis20180225
中图分类号
学科分类号
摘要
The in-depth exploration of the text data contained in social media facilitates efficient analysis of time and space. This paper proposes a new social media topic mining method based on the concept of co-word network and community detection. The method uses term frequency-inverse document frequency (TF-IDF) analysis to identify the key words of the messages automatically. Based on the problem whether the microblogs contain the same key words or not, we put forward the concept of microblog co-word network with microblog as the node. The network combined with the Louvain community detection algorithm is used to classify the microblogs into different clusters with topics. The proposed method is an unsupervised method. The advantage of this method is that there is no need to specify the number of clusters. Experiments demonstrate that the performance of the proposed method is better than the commonly used latent dirichlet allocation (LDA) model on both precision and recall. Taking the collected microblogs during the 2012 Beijing rainstorm as the case study, the method is used to conduct in-depth mining and time-space analysis of the microblogs dataset. The results demonstrate that the proposed method is effective in real world applications. © 2018, Research and Development Office of Wuhan University. All right reserved.
引用
收藏
页码:2287 / 2294
页数:7
相关论文
共 26 条
  • [1] Sakaki T., Okazaki M., Matsuo Y., Earthquake Shakes Twitter Users: Real-Time Event Detection by Social Sensors, The 19th International Conference on World Wide Web, (2010)
  • [2] Liu Y., Sui Z., Kang C., Et al., Uncovering Patternsof Inter-Urban Trip and Spatial Interaction from Social Media Check-In Data, Plos One, 9, 1, (2014)
  • [3] Caverlee J., Cheng Z., Sui D.Z., Et al., Towards Geo-Social Intelligence: Mining, Analyzing, and Leveraging Geospatial Footprints in Social Media, IEEE Data Eng Bull, 36, 3, pp. 33-41, (2013)
  • [4] Li L., Goodchild M.F., Xu B., Spatial, Temporal, and Socioeconomic Patterns in the Use of Twitter and Flickr, Cartography and Geographic Information Science, 40, 2, pp. 61-77, (2013)
  • [5] Nagel A.C., Tsou M.H., Spitzberg B.H., Et al., The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets, Journal of Medical Internet Research, 15, 10, pp. e237-1-e237-13, (2013)
  • [6] Salathe M., Khandelwal S., Assessing Vaccination Sentiments with Online Social Media: Implications for Infectious Disease Dynamics and Control, Plos Computational Biology, 7, 10, (2011)
  • [7] Achrekar H., Gandhe A., Lazarus R., Et al., Predicting Flu Trends Using Twitter Data, Computer Communications Workshops (INFOCOM WKSHPS) on 2011 IEEE Conference, (2011)
  • [8] Wang Y., Jing T., Jiang W., Et al., Mo-deling Urban Air Quality Trend Surface Using Social Media Data, Geomatics and Information Science of Wuhan University, 42, 1, pp. 14-20, (2017)
  • [9] Yates D., Paquette S., Emergency Knowledge Mana-gement and Social Media Technologies: A Case Study of the 2010 Haitian Earthquake, International Journal of Information Management, 31, 1, pp. 6-13, (2011)
  • [10] Tsou M.H., Yang J.A., Lusher D., Et al., Mapping Social Activities and Concepts with Social Media (Twitter) and Web Search Engines (Yahoo and Bing): A Case Study in 2012 US Presidential Election, Cartography and Geographic Information Science, 40, 4, pp. 337-348, (2013)