BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

被引:0
|
作者
West, Peter [1 ]
Holtzman, Ari [1 ,2 ]
Buys, Jan [1 ]
Choi, Yejin [1 ,2 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
[2] Allen Inst Artificial Intelligence, Seattle, WA USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The principle of the Information Bottleneck (Tishby et al., 1999) is to produce a summary of information X optimized to predict some other relevant information Y. In this paper, we propose a novel approach to unsupervised sentence summarization by mapping the Information Bottleneck principle to a conditional language modelling objective: given a sentence, our approach seeks a compressed sentence that can best predict the next sentence. Our iterative algorithm under the Information Bottleneck objective searches gradually shorter subsequences of the given sentence while maximizing the probability of the next sentence conditioned on the summary. Using only pretrained language models with no direct supervision, our approach can efficiently perform extractive sentence summarization over a large corpus. Building on our unsupervised extractive summarization (BottleSumEx), we then present a new approach to self-supervised abstractive summarization (BottleSumSelf), where a transformer-based language model is trained on the output summaries of our unsupervised method. Empirical results demonstrate that our extractive method outperforms other unsupervised models on multiple automatic metrics. In addition, we find that our selfsupervised abstractive model outperforms unsupervised baselines (including our own) by human evaluation along multiple attributes.
引用
收藏
页码:3752 / 3761
页数:10
相关论文
共 50 条
  • [31] MVEB: Self-Supervised Learning With Multi-View Entropy Bottleneck
    Wen, Liangjian
    Wang, Xiasi
    Liu, Jianzhuang
    Xu, Zenglin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6097 - 6108
  • [32] TransSum: Translating Aspect and Sentiment Embeddings for Self-Supervised Opinion Summarization
    Wang, Ke
    Wan, Xiaojun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 729 - 742
  • [33] Enhancing Semantic Understanding with Self-supervised Methods for Abstractive Dialogue Summarization
    Lee, Hyunjae
    Yun, Jaewoong
    Choi, Hyunjin
    Joe, Seongho
    Gwon, Youngjune L.
    INTERSPEECH 2021, 2021, : 796 - 800
  • [34] Self-supervised opinion summarization with multi-modal knowledge graph
    Jin, Lingyun
    Chen, Jingqiang
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (01) : 191 - 208
  • [35] Self-supervised Video Summarization Guided by Semantic Inverse Optimal Transport
    Wang, Yutong
    Xu, Hongteng
    Luo, Dixin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6611 - 6622
  • [36] Self-supervised Product Quantization for Deep Unsupervised Image Retrieval
    Jang, Young Kyun
    Cho, Nam Ik
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12065 - 12074
  • [37] Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration
    Wang, Yu
    Lin, Jingyang
    Zou, Jingjing
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [38] Self-supervised opinion summarization with multi-modal knowledge graph
    Lingyun Jin
    Jingqiang Chen
    Journal of Intelligent Information Systems, 2024, 62 : 191 - 208
  • [39] Unsupervised perturbation based self-supervised federated adversarial training
    Zhang, Yuyue
    Ye, Hanchen
    Zhao, Xiaoli
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [40] Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey
    Simeoni, Oriane
    Zablocki, Eloi
    Gidaris, Spyros
    Puy, Gilles
    Perez, Patrick
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 781 - 808