Unsupervised Deep Keyphrase Generation

被引:0
|
作者
Shen, Xianjie [1 ]
Wang, Yinghan [2 ]
Meng, Rui [3 ]
Shang, Jingbo [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Amazon Com Inc, Seattle, WA USA
[3] Salesforce Res, Palo Alto, CA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrase generation aims to summarize long documents with a collection of salient phrases. Deep neural models have demonstrated remarkable success in this task, with the capability of predicting keyphrases that are even absent from a document. However, such abstractiveness is acquired at the expense of a substantial amount of annotated data. In this paper, we present a novel method for keyphrase generation, AutoKeyGen, without the supervision of any annotated doc-keyphrase pairs. Motivated by the observation that an absent keyphrase in a document may appear in other places, in whole or in part, we construct a phrase bank by pooling all phrases extracted from a corpus. With this phrase bank, we assign phrase candidates to new documents by a simple partial matching algorithm, and then we rank these candidates by their relevance to the document from both lexical and semantic perspectives. Moreover, we bootstrap a deep generative model using these top-ranked pseudo keyphrases to produce more absent candidates. Extensive experiments demonstrate that AutoKeyGen outperforms all unsupervised baselines and can even beat a strong supervised method in certain cases.
引用
收藏
页码:11303 / 11311
页数:9
相关论文
共 50 条
  • [1] Deep Keyphrase Generation
    Meng, Rui
    Zhao, Sanqiang
    Han, Shuguang
    He, Daqing
    Brusilovsky, Peter
    Chi, Yu
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 582 - 592
  • [2] Unsupervised Open-domain Keyphrase Generation
    Lam Thanh Do
    Akash, Pritom Saha
    Chang, Kevin Chen-Chuan
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10614 - 10627
  • [3] Hyperbolic Deep Keyphrase Generation
    Zhang, Yuxiang
    Yang, Tianyu
    Jiang, Tao
    Li, Xiaoli
    Wang, Suge
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 13714 : 521 - 536
  • [4] Exclusive Hierarchical Decoding for Deep Keyphrase Generation
    Chen, Wang
    Chan, Hou Pong
    Li, Piji
    King, Irwin
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1095 - 1105
  • [5] KGAgent: Learning a Deep Reinforced Agent for Keyphrase Generation
    Yao, Yu
    Yang, Peng
    Zhao, Guangzhen
    Yin, Guoshun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1928 - 1940
  • [6] Deep Keyphrase Generation with a Convolutional Sequence to Sequence Model
    Yong Zhang
    Yang Fang
    Xiao Weidong
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1477 - 1485
  • [7] TripleRank: An unsupervised keyphrase extraction algorithm
    Li, Tuohang
    Hu, Liang
    Li, Hongtu
    Sun, Chengyu
    Li, Shuai
    Chi, Ling
    KNOWLEDGE-BASED SYSTEMS, 2021, 219 (219)
  • [8] Unsupervised keyphrase extraction for search ontologies
    Gulla, Jon Atle
    Borch, Hans Olaf
    Ingvaldsen, Jon Espen
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 3999 : 25 - 36
  • [9] HTKG: Deep Keyphrase Generation with Neural Hierarchical Topic Guidance
    Zhang, Yuxiang
    Jiang, Tao
    Yang, Tianyu
    Li, Xiaoli
    Wang, Suge
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1044 - 1054
  • [10] Unsupervised Keyphrase Extraction for Web Pages
    Haarman, Tim
    Zijlema, Bastiaan
    Wiering, Marco
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2019, 3 (03)