Recent advances in document summarization

被引:0
|
作者
Jin-ge Yao
Xiaojun Wan
Jianguo Xiao
机构
[1] Peking University,Institute of Computer Science and Technology
[2] Peking University,The MOE Key Laboratory of Computational Linguistics
来源
关键词
Document summarization; Natural language generation; Natural language processing; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.
引用
收藏
页码:297 / 336
页数:39
相关论文
共 50 条
  • [1] Recent advances in document summarization
    Yao, Jin-ge
    Wan, Xiaojun
    Xiao, Jianguo
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (02) : 297 - 336
  • [2] Recent advances in automatic speech summarization
    Furui, Sadaoki
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 16 - 21
  • [3] Interactive Document Summarization
    Said, Raoufdine
    Guille, Adrien
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V, 2024, 14612 : 177 - 181
  • [4] Advances in Code Summarization
    Desai, Utkarsh
    Sridhara, Giriprasad
    Tamilselvam, Srikanth
    [J]. 2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, : 330 - 331
  • [5] Multiple Data Document Summarization
    Kishore, V. V. Krishna
    Singh, Pramod Kumar
    [J]. 2017 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [6] A Topological Collapse for Document Summarization
    Guan, Hui
    Tang, Wen
    Krim, Hamid
    Keiser, James
    Rindos, Andrew
    Sazdanovic, Radmila
    [J]. 2016 IEEE 17TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC), 2016,
  • [7] Multimodal news document summarization
    Javed, Hira
    Akhtar, Nadeem
    Beg, M. M. Sufyan
    [J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2024, 45 (04): : 959 - 968
  • [8] On the Abstractiveness of Neural Document Summarization
    Zhang, Fangfang
    Yao, Jin-ge
    Yan, Rui
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 785 - 790
  • [9] Document Summarization with Latent Queries
    Xu, Yumo
    Lapata, Mirella
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 623 - 638
  • [10] A Brief Review of Document Image Retrieval Methods: Recent Advances
    Alaei, Fahimeh
    Alaei, Alireza
    Blumenstein, Michael
    Pal, Umapada
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3500 - 3507