Recent advances in document summarization

被引:0
|
作者
Jin-ge Yao
Xiaojun Wan
Jianguo Xiao
机构
[1] Peking University,Institute of Computer Science and Technology
[2] Peking University,The MOE Key Laboratory of Computational Linguistics
来源
关键词
Document summarization; Natural language generation; Natural language processing; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.
引用
收藏
页码:297 / 336
页数:39
相关论文
共 50 条
  • [31] Automatic bilingual text document summarization
    Lo, SH
    Meng, HML
    Lam, W
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, PROCEEDINGS: INFORMATION SYSTEMS DEVELOPMENT I, 2002, : 113 - 118
  • [32] DOCUMENT SUMMARIZATION IN MALAYALAM WITH SENTENCE FRAMING
    Kishore, Kavya
    Gopal, Greeshma N.
    Neethu, P. H.
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 194 - 200
  • [33] MULTI-DOCUMENT VIDEO SUMMARIZATION
    Wang, Feng
    Merialdo, Bernard
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1326 - 1329
  • [34] Topic Generation for Web Document Summarization
    Hsu, Heng-Yao
    Tsai, Chun-Wei
    Chiang, Ming-Chao
    Yang, Chu-Sing
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 3701 - +
  • [35] Intrinsic Features of Biomedical Document for the Efficient Single Document Summarization
    Jin, Hoon
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [36] Multiple Text Document Summarization System using Hybrid Summarization Technique
    Dave, Harsha
    Jaswal, Shree
    [J]. 2015 1ST INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2015, : 804 - 808
  • [37] Comparison of Multi Document Summarization Techniques
    Nedunchelian, R.
    Muthucumarasamy, R.
    Saranathan, E.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (03): : 155 - 160
  • [38] Integrating Document Clustering and Multidocument Summarization
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Chi, Yun
    Gong, Yihong
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2011, 5 (03)
  • [39] LATENT DIRICHLET LEARNING FOR DOCUMENT SUMMARIZATION
    Chang, Ying-Lang
    Chien, Jen-Tzung
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1689 - 1692
  • [40] An Effective Joint Framework for Document Summarization
    Gui, Min
    Zhang, Zhengkun
    Yang, Zhenglu
    Gu, Yanhui
    Xu, Guandong
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 121 - 122