A Graph Based Clustering Technique for Tweet Summarization

被引:0
|
作者
Dutta, Soumi [1 ]
Ghatak, Sujata [1 ]
Roy, Moumita [1 ]
Ghosh, Saptarshi [2 ]
Das, Asit Kumar [2 ]
机构
[1] Inst Engn & Management, Comp Sci & Engn, Kolkata 700091, India
[2] Indian Inst Engn Sci & Technol Shibpur, Comp Sci & Technol, Howrah 711103, India
关键词
Twitter; tweet summarization; WordNet; graph clustering; Online Social Network;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Twitter is a very popular online social networking site, where hundreds of millions of tweets are posted every day by millions of users. Twitter is now considered as one of the fastest and most popular communication mediums, and is frequently used to keep track of recent events or news-stories. Whereas tweets related to a particular event / news-story can easily be found using keyword matching, many of the tweets are likely to contain semantically identical information. If a user wants to keep track of an event / news-story, it is difficult for him to have to read all the tweets containing identical or redundant information. Hence, it is desirable to have good techniques to summarize large number of tweets. In this work, we propose a graph-based approach for summarizing tweets, where a graph is first constructed considering the similarity among tweets, and community detection techniques are then used on the graph to cluster similar tweets. Finally, a representative tweet is chosen from each cluster to be included into the summary. The similarity among tweets is measured using various features including features based on WordNet synsets which help to capture the semantic similarity among tweets. The proposed approach achieves better performance than Sumbasic, an existing summarization technique.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Grapharizer: A Graph-Based Technique for Extractive Multi-Document Summarization
    Jalil, Zakia
    Nasir, Muhammad
    Alazab, Moutaz
    Nasir, Jamal
    Amjad, Tehmina
    Alqammaz, Abdullah
    ELECTRONICS, 2023, 12 (08)
  • [22] An effective graph summarization and compression technique for a large-scaled graph
    Seo, Hojin
    Park, Kisung
    Han, Yongkoo
    Kim, Hyunwook
    Umair, Muhammad
    Khan, Kifayat Ullah
    Lee, Young-Koo
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (10): : 7906 - 7920
  • [23] An effective graph summarization and compression technique for a large-scaled graph
    Hojin Seo
    Kisung Park
    Yongkoo Han
    Hyunwook Kim
    Muhammad Umair
    Kifayat Ullah Khan
    Young-Koo Lee
    The Journal of Supercomputing, 2020, 76 : 7906 - 7920
  • [24] Combining Graph Connectivity and Genetic Clustering to Improve Biomedical Summarization
    Menendez, Hector D.
    Plaza, Laura
    Camacho, David
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 2740 - 2747
  • [25] Combining graph connectivity & dominant set clustering for video summarization
    D. Besiris
    A. Makedonas
    G. Economou
    S. Fotopoulos
    Multimedia Tools and Applications, 2009, 44 : 161 - 186
  • [26] Combining graph connectivity & dominant set clustering for video summarization
    Besiris, D.
    Makedonas, A.
    Economou, G.
    Fotopoulos, S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 44 (02) : 161 - 186
  • [27] Tweet Summarization of News Articles: An Objective Ordering-Based Perspective
    Chakraborty, Roshni
    Bhavsar, Maitry
    Dandapat, Sourav Kumar
    Chandra, Joydeep
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (04): : 761 - 777
  • [28] Frame Clustering Technique towards Single Video Summarization
    Sachan, Priyamvada R.
    Keshaveni
    2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,
  • [29] Mutual-reinforcement document summarization using embedded graph based sentence clustering for storytelling
    Zhang, Zhengchen
    Ge, Shuzhi Sam
    He, Hongsheng
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (04) : 767 - 778
  • [30] Graph based KNN for Text Summarization
    Jo, Taeho
    2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 438 - 443