A Graph Based Clustering Technique for Tweet Summarization

被引:0
|
作者
Dutta, Soumi [1 ]
Ghatak, Sujata [1 ]
Roy, Moumita [1 ]
Ghosh, Saptarshi [2 ]
Das, Asit Kumar [2 ]
机构
[1] Inst Engn & Management, Comp Sci & Engn, Kolkata 700091, India
[2] Indian Inst Engn Sci & Technol Shibpur, Comp Sci & Technol, Howrah 711103, India
关键词
Twitter; tweet summarization; WordNet; graph clustering; Online Social Network;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Twitter is a very popular online social networking site, where hundreds of millions of tweets are posted every day by millions of users. Twitter is now considered as one of the fastest and most popular communication mediums, and is frequently used to keep track of recent events or news-stories. Whereas tweets related to a particular event / news-story can easily be found using keyword matching, many of the tweets are likely to contain semantically identical information. If a user wants to keep track of an event / news-story, it is difficult for him to have to read all the tweets containing identical or redundant information. Hence, it is desirable to have good techniques to summarize large number of tweets. In this work, we propose a graph-based approach for summarizing tweets, where a graph is first constructed considering the similarity among tweets, and community detection techniques are then used on the graph to cluster similar tweets. Finally, a representative tweet is chosen from each cluster to be included into the summary. The similarity among tweets is measured using various features including features based on WordNet synsets which help to capture the semantic similarity among tweets. The proposed approach achieves better performance than Sumbasic, an existing summarization technique.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Sentences clustering based automatic summarization
    Wang, JH
    Zhou, SG
    Hu, YF
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 57 - 62
  • [32] Sentences clustering based automatic summarization
    Wang, Jianhui
    Zhou, Shuigeng
    Hu, Yunfa
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2004, 17 (03):
  • [33] On Summarization and Timeline Generation for Evolutionary Tweet Streams
    Wang, Zhenhua
    Shou, Lidan
    Chen, Ke
    Chen, Gang
    Mehrotra, Sharad
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (05) : 1301 - 1315
  • [34] Optimization Framework Model For Retrospective Tweet Summarization
    Chellal, Abdelhamid
    Boughanem, Mohand
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 704 - 711
  • [35] Graph based Tweet Entity Linking using DBpedia
    Kalloubi, Fahd
    Nfaoui, El Habib
    El Beqqali, Omar
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 501 - 506
  • [36] Hashtag Graph based Topic Model for Tweet Mining
    Wang, Yuan
    Liu, Jie
    Qu, Jishi
    Huang, Yalou
    Chen, Jimeng
    Feng, Xia
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 1025 - 1030
  • [37] Sumblr: Continuous Summarization of Evolving Tweet Streams
    Shou, Lidan
    Wang, Zhenhua
    Chen, Ke
    Chen, Gang
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 533 - 542
  • [38] Unsupervised graph-clustering learning framework for financial news summarization
    Wang, Jun
    Tan, Jinghua
    Jin, Hanlei
    Qi, Shuo
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 719 - 726
  • [39] Multi-Video Summarization Using Complex Graph Clustering and Mining
    Shao, Jian
    Jiang, Dongming
    Wang, Mengru
    Chen, Hong
    Yao, Lu
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2010, 7 (01) : 85 - 97
  • [40] Implementation of Vocabulary Based Summarization of Graph (VoG) on the Web Graph
    Sa'adah, Siti
    Rahmat, Kemas S. W.
    Hartomo, Satrio Adityo
    2019 IEEE INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2019, : 156 - 159