BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

被引:0
|
作者
West, Peter [1 ]
Holtzman, Ari [1 ,2 ]
Buys, Jan [1 ]
Choi, Yejin [1 ,2 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
[2] Allen Inst Artificial Intelligence, Seattle, WA USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The principle of the Information Bottleneck (Tishby et al., 1999) is to produce a summary of information X optimized to predict some other relevant information Y. In this paper, we propose a novel approach to unsupervised sentence summarization by mapping the Information Bottleneck principle to a conditional language modelling objective: given a sentence, our approach seeks a compressed sentence that can best predict the next sentence. Our iterative algorithm under the Information Bottleneck objective searches gradually shorter subsequences of the given sentence while maximizing the probability of the next sentence conditioned on the summary. Using only pretrained language models with no direct supervision, our approach can efficiently perform extractive sentence summarization over a large corpus. Building on our unsupervised extractive summarization (BottleSumEx), we then present a new approach to self-supervised abstractive summarization (BottleSumSelf), where a transformer-based language model is trained on the output summaries of our unsupervised method. Empirical results demonstrate that our extractive method outperforms other unsupervised models on multiple automatic metrics. In addition, we find that our selfsupervised abstractive model outperforms unsupervised baselines (including our own) by human evaluation along multiple attributes.
引用
收藏
页码:3752 / 3761
页数:10
相关论文
共 50 条
  • [21] Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation
    Kreuk, Felix
    Keshet, Joseph
    Adi, Yossi
    INTERSPEECH 2020, 2020, : 3700 - 3704
  • [22] Self-Supervised Surgical Tool Segmentation using Kinematic Information
    Rocha, Crisian da Costa
    Padoy, Nicolas
    Rosa, Benoit
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8720 - 8726
  • [23] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
    Wang, Xinghao
    He, Junliang
    Wang, Pengyu
    Zhou, Yunhua
    Sun, Tianxiang
    Qiu, Xipeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19180 - 19188
  • [24] A Self-Supervised Representation Learning of Sentence Structure for Authorship Attribution
    Jafariakinabad, Fereshteh
    Hua, Kien A.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (04)
  • [25] ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
    Yan, Yuanmeng
    Li, Rumei
    Wang, Sirui
    Zhang, Fuzheng
    Wu, Wei
    Xu, Weiran
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5065 - 5075
  • [26] RobustEmbed: Robust Sentence Embeddings Using Self-Supervised Contrastive Pre-Training
    Asl, Javad Rafiei
    Blanco, Eduardo
    Takabi, Daniel
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4587 - 4603
  • [27] Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios
    S. Y. Kung
    Yuhui Luo
    Man-Wai Mak
    Journal of Signal Processing Systems, 2010, 61 : 3 - 20
  • [28] Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios
    Kung, S. Y.
    Luo, Yuhui
    Mak, Man-Wai
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 61 (01): : 3 - 20
  • [29] Exploring complementary information of self-supervised pretext tasks for unsupervised video pre-training
    Zhou, Wei
    Hou, Yi
    Ouyang, Kewei
    Zhou, Shilin
    IET COMPUTER VISION, 2022, 16 (03) : 255 - 265
  • [30] A sentence sentiment classification method based on Self-supervised and Self-attention
    Xiao, Jianqiong
    Zhou, Zhiyong
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1139 - 1143