Summarizing Weibo with Topics Compression

被引:0
|
作者
Litvak, Marina [1 ]
Vanetik, Natalia [1 ]
Li, Lei [2 ]
机构
[1] Shamoon Engn Coll, Dept Software Engn, Beer Sheva, Israel
[2] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing, Peoples R China
关键词
D O I
10.1007/978-3-319-77116-8_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extractive text summarization aims at selecting a small subset of sentences so that the contents and meaning of the original document are best preserved. In this paper we describe an unsupervised approach to extractive summarization. It combines hierarchical topic modeling (TM) with the Minimal Description Length (MDL) principle and applies them to Chinese language. Our summarizer strives to extract information that provides the best description of text topics in terms of MDL. This model is applied to the NLPCC 2015 Shared Task of Weibo-Oriented Chinese News Summarization [1], where Chinese texts from news articles were summarized with the goal of creating short meaningful messages for Weibo (Sina Weibo is a Chinese microblogging website, one of the most popular sites in China.) [2]. The experimental results disclose superiority of our approach over other summarizers from the NLPCC 2015 competition.
引用
收藏
页码:522 / 534
页数:13
相关论文
共 50 条
  • [1] NewsInEssence: Summarizing online news topics
    Radev, D
    Otterbacher, J
    Winkel, A
    Blair-Goldensohn, A
    COMMUNICATIONS OF THE ACM, 2005, 48 (10) : 95 - 98
  • [2] An event summarizing algorithm based on the timeline relevance model in Sina Weibo
    Kai LEI
    Lizhu ZHANG
    Ying LIU
    Ying SHEN
    Chenwei LIU
    Qian YU
    Weitao WENG
    ScienceChina(InformationSciences), 2018, 61 (12) : 184 - 186
  • [3] An event summarizing algorithm based on the timeline relevance model in Sina Weibo
    Lei, Kai
    Zhang, Lizhu
    Liu, Ying
    Shen, Ying
    Liu, Chenwei
    Yu, Qian
    Weng, Weitao
    SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (12)
  • [4] An event summarizing algorithm based on the timeline relevance model in Sina Weibo
    Kai Lei
    Lizhu Zhang
    Ying Liu
    Ying Shen
    Chenwei Liu
    Qian Yu
    Weitao Weng
    Science China Information Sciences, 2018, 61
  • [5] Mini Reviews: A new manuscript category for summarizing emerging topics
    Freeland, Joanna
    Sibbett, Ben
    Rieseberg, Loren
    MOLECULAR ECOLOGY, 2024, 33 (13)
  • [6] Mini reviews: A new manuscript category for summarizing emerging topics
    Freeland, Joanna
    Sibbett, Ben
    Narum, Shawn
    MOLECULAR ECOLOGY RESOURCES, 2024, 24 (06)
  • [7] Research on the Impacts of Quantitative Factors on Sentimental Classification of Weibo of Different Topics
    Zhang, Ruoxi
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2015), 2015, 33 : 398 - 401
  • [8] Labelling Topics in Weibo Using Word Embedding and Graph-based Method
    Jin, Zhipeng
    Li, Qiudan
    Wang, Can
    Zeng, Daniel D.
    Wang, Lei
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING (ICISE), 2016, : 34 - 37
  • [9] Realtime Online Hot Topics Prediction in Sina Weibo for News Earlier Report
    Yuan, Sha
    Tao, Zhe
    Zhu, Tingshao
    Bai, Shuotian
    2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 599 - 605
  • [10] Propagation Regularity of Hot Topics in Sina Weibo based on SIR Model - A Simulation Research
    Li, Donghui
    Zhang, Yuqing
    Chen, Xin
    Cao, Long
    2014 IEEE COMPUTING, COMMUNICATIONS AND IT APPLICATIONS CONFERENCE (COMCOMAP), 2014, : 310 - 315