SAKE: A Graph-Based Keyphrase Extraction Method Using Self-attention

被引:0
|
作者
Zhu, Ping [1 ]
Gong, Chuanyang [1 ]
Wei, Zhihua [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
关键词
Keywords extraction; Self attention; Pre-trained model;
D O I
10.1007/978-3-031-08530-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrase extraction is a text analysis technique that automatically extracts the most used and most important words and expressions from a text. It helps summarize the content of texts and recognize the main topics discussed. The majority of the existing techniques are mainly domain-specific, which require application domain knowledge and employ higher-order statistical methods. Supervised keyphrase extraction requires a large amount of labeled training data and has poor generalization ability outside the training data domain. Unsupervised systems have poor accuracy, and often do not generalize well. This paper proposes an unsupervised graph-based keyphrase extraction model that incorporates the words' self-attention score. Specifically, the proposed approach identifies the importance of each source word based on a word graph built by the self-attention layer in the Transformer and further introduces a new mechanism to capture the relationships between words in different sentences. The experimental results show that the proposed approach achieves remarkable improvements over the state-of-the-art models.
引用
收藏
页码:339 / 350
页数:12
相关论文
共 50 条
  • [1] Automatic Keyphrase Extraction using Graph-based Methods
    Mothe, Josiane
    Ramiandrisoa, Faneva
    Rasolomanana, Michael
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 728 - 730
  • [2] A Dependency Graph-Based Keyphrase Extraction Method Using Anti-patterns
    Batsuren, Khuyagbaatar
    Batbaatar, Erdenebileg
    Munkhdalai, Tsendsuren
    Li, Meijing
    Namsrai, Oyun-Erdene
    Ryu, Keun Ho
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (05): : 1254 - 1271
  • [3] Graph-based Keyphrase Extraction Using Word and Document Embeddings
    Zu, Xian
    Xie, Fei
    Liu, Xiaojian
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 70 - 76
  • [4] A Graph-based Approach of Automatic Keyphrase Extraction
    Yan Ying
    Tan Qingping
    Xie Qinzheng
    Zeng Ping
    Li Panpan
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 248 - 255
  • [5] Keyphrase Generation Based on Self-Attention Mechanism
    Yang, Kehua
    Wang, Yaodong
    Zhang, Wei
    Yao, Jiqing
    Le, Yuquan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (02): : 569 - 581
  • [6] Keyphrase extraction using graph-based statistical approach with NLP patterns
    Mehta, Siddhesh
    Karwa, Rushikesh
    Chavan, Rahul
    Khatavkar, Vaibhav
    Joshi, Amit
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2024, 49 (02):
  • [7] A Keyphrase Graph-Based Method for Document Similarity Measurement
    Huynh, ThanhThuong T.
    TruongAn PhamNguyen
    Do, Nhon, V
    ENGINEERING LETTERS, 2022, 30 (02) : 692 - 710
  • [8] SAMRank: Unsupervised Keyphrase Extraction using Self-Attention Map in BERT and GPT-2
    Kang, Byungha
    Shin, Youhyun
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10188 - 10201
  • [9] A Noun-Centric Keyphrase Extraction Model: Graph-Based Approach
    Abimbola, Rilwan O.
    Awoyelu, Iyabo O.
    Hunsu, Folasade O.
    Akinyemi, Bodunde O.
    Aderounmu, Ganiyu A.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (06) : 578 - 589
  • [10] NE-Rank: A Novel Graph-based Keyphrase Extraction in Twitter
    Bellaachia, Abdelghani
    Al-Dhelaan, Mohammed
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 372 - 379