An unsupervised keyphrase extraction model by incorporating structural and semantic information

被引:0
|
作者
Linkai Luo
Longmin Zhang
Hong Peng
机构
[1] Xiamen University,Department of Automation
来源
关键词
Keyphrase extraction; Unsupervised model; Structural information; Semantic information; Graph-based model;
D O I
暂无
中图分类号
学科分类号
摘要
We proposed an unsupervised keyphrase extraction model that incorporates the structural information and the semantic information of a document. The structural information refers to the directed graph that is composed of keyphrase candidates and topics. The weight between two candidates is computed by their relative distance in the document and the positions of the corresponding sentences. Graph ranking algorithm is then applied to get the structural scores of the candidates. Then, the semantic score is obtained by the similarity between candidate and all sentences. The final score of a candidate is the sum of the structural score and the semantic score. The top N candidates with the highest scores are selected as the recommended keyphrases. The comparison experiments on three widely used datasets show that our model achieves the best results in the long documents and a competitive result in the short document. It indicates that our model is effective and is superior to the state-of-the-art unsupervised models.
引用
收藏
页码:77 / 83
页数:6
相关论文
共 50 条
  • [1] An unsupervised keyphrase extraction model by incorporating structural and semantic information
    Luo, Linkai
    Zhang, Longmin
    Peng, Hong
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2020, 9 (01) : 77 - 83
  • [2] Contextual topic discovery using unsupervised keyphrase extraction and hierarchical semantic graph model
    Du, Hung
    Thudumu, Srikanth
    Giardina, Antonio
    Vasa, Rajesh
    Mouzakis, Kon
    Jiang, Li
    Chisholm, John
    Bista, Sanat
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [3] Contextual topic discovery using unsupervised keyphrase extraction and hierarchical semantic graph model
    Hung Du
    Srikanth Thudumu
    Antonio Giardina
    Rajesh Vasa
    Kon Mouzakis
    Li Jiang
    John Chisholm
    Sanat Bista
    [J]. Journal of Big Data, 10
  • [4] Improved Automatic Keyphrase Extraction by Using Semantic Information
    Wang, XiaoLing
    Mu, DeJun
    Fang, Jun
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL 1, PROCEEDINGS, 2008, : 1061 - 1065
  • [5] Glocal: Incorporating Global Information in Local Convolution for Keyphrase Extraction
    Prasad, Animesh
    Kan, Min-Yen
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1837 - 1846
  • [6] KeyVector: Unsupervised Keyphrase Extraction Using Weighted Topic via Semantic Relatedness
    Toleu, Alymzhan
    Tolegen, Gulmira
    Mussabayev, Rustam
    [J]. COMPUTACION Y SISTEMAS, 2019, 23 (03): : 861 - 869
  • [7] TripleRank: An unsupervised keyphrase extraction algorithm
    Li, Tuohang
    Hu, Liang
    Li, Hongtu
    Sun, Chengyu
    Li, Shuai
    Chi, Ling
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 219 (219)
  • [8] Unsupervised keyphrase extraction for search ontologies
    Gulla, Jon Atle
    Borch, Hans Olaf
    Ingvaldsen, Jon Espen
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 3999 : 25 - 36
  • [9] Unsupervised Keyphrase Extraction for Web Pages
    Haarman, Tim
    Zijlema, Bastiaan
    Wiering, Marco
    [J]. MULTIMODAL TECHNOLOGIES AND INTERACTION, 2019, 3 (03)
  • [10] Combination of Unsupervised Keyphrase Extraction Algorithms
    Zhu, Zede
    Li, Miao
    Chen, Lei
    Yang, Zhenxin
    Chen, Sheng
    [J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 33 - 36