A Graph Based Clustering Technique for Tweet Summarization

被引:0
|
作者
Dutta, Soumi [1 ]
Ghatak, Sujata [1 ]
Roy, Moumita [1 ]
Ghosh, Saptarshi [2 ]
Das, Asit Kumar [2 ]
机构
[1] Inst Engn & Management, Comp Sci & Engn, Kolkata 700091, India
[2] Indian Inst Engn Sci & Technol Shibpur, Comp Sci & Technol, Howrah 711103, India
关键词
Twitter; tweet summarization; WordNet; graph clustering; Online Social Network;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Twitter is a very popular online social networking site, where hundreds of millions of tweets are posted every day by millions of users. Twitter is now considered as one of the fastest and most popular communication mediums, and is frequently used to keep track of recent events or news-stories. Whereas tweets related to a particular event / news-story can easily be found using keyword matching, many of the tweets are likely to contain semantically identical information. If a user wants to keep track of an event / news-story, it is difficult for him to have to read all the tweets containing identical or redundant information. Hence, it is desirable to have good techniques to summarize large number of tweets. In this work, we propose a graph-based approach for summarizing tweets, where a graph is first constructed considering the similarity among tweets, and community detection techniques are then used on the graph to cluster similar tweets. Finally, a representative tweet is chosen from each cluster to be included into the summary. The similarity among tweets is measured using various features including features based on WordNet synsets which help to capture the semantic similarity among tweets. The proposed approach achieves better performance than Sumbasic, an existing summarization technique.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Tweet Analytics and Tweet Summarization using Graph Mining
    Naik, Apeksha P.
    Bojewar, Sachin
    2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, 2017, : 17 - 21
  • [2] A Genetic Algorithm based tweet clustering Technique
    Dutta, Soumi
    Ghatak, Sujata
    Ghosh, Saptarshi
    Das, Asit K.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2017,
  • [3] Graph Based Technique for Hindi Text Summarization
    Kumar, K. Vimal
    Yadav, Divakar
    Sharma, Arun
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 301 - 310
  • [4] Graph Summarization Technique Based on Edit Behavior Coding
    Wang X.
    Dong Y.-H.
    Pan J.-F.
    Chen H.-H.
    Qian J.-B.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (12): : 2434 - 2443
  • [5] Tweet Timeline Generation via Graph-Based Dynamic Greedy Clustering
    Fan, Feifan
    Qiang, Runwei
    Lv, Chao
    Zhao, Wayne Xin
    Yang, Jianwu
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2015, 2015, 9460 : 304 - 316
  • [6] Clustering cliques for graph-based summarization of the biomedical research literature
    Zhang, Han
    Fiszman, Marcelo
    Shin, Dongwook
    Wilkowski, Bartlomiej
    Rindflesch, Thomas C.
    BMC BIOINFORMATICS, 2013, 14
  • [7] Clustering cliques for graph-based summarization of the biomedical research literature
    Han Zhang
    Marcelo Fiszman
    Dongwook Shin
    Bartlomiej Wilkowski
    Thomas C Rindflesch
    BMC Bioinformatics, 14
  • [8] Clustering based Semantic Data Summarization Technique: A New Approach
    Ahmed, Mohiuddin
    Mahmood, Abdun Naser
    PROCEEDINGS OF THE 2014 9TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2014, : 1780 - 1785
  • [9] A Tweet Summarization Method Based on Maximal Association Rules
    Huyen Trang Phan
    Ngoc Thanh Nguyen
    Hwang, Dosam
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2018, PT I, 2018, 11055 : 373 - 382
  • [10] Tweet-Biased Summarization
    Yulianti, Evi
    Huspi, Sharin
    Sanderson, Mark
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (06) : 1289 - 1300