A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph Construction

被引:0
|
作者
Zhang, Lin [1 ]
Li, Yanan [1 ]
Li, Qinru [1 ]
机构
[1] Dalian Maritime Univ, Sch Maritime Econ & Management, Dalian 116026, Peoples R China
关键词
keyword extraction; TextRank; word embedding; text statistical features; academic literature knowledge graph;
D O I
10.3390/math12091349
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this paper, we construct an academic literature knowledge graph based on the relationship between documents to facilitate the storage and research of academic literature data. Keywords are an important type of node in the knowledge graph. To solve the problem that there are no keywords in some documents for several reasons in the process of knowledge graph construction, an improved keyword extraction algorithm called TP-CoGlo-TextRank is proposed by using word frequency, position, word co-occurrence frequency, and a word embedding model. By combining the word frequency and position in the document, the importance of words is distinguished. By introducing the GloVe word-embedding model, which brings the external knowledge of documents into the TextRank algorithm, and combining the internal word co-occurrence frequency in the documents, the word-adjacency relationship is transferred non-uniformly. Finally, the words with the highest scores are combined into phrases if they are adjacent in the original text. The validity of the TP-CoGlo-TextRank algorithm is verified by experiments. On this basis, the Neo4j graph database is used to store and display the academic literature knowledge graph, to provide data support for research tasks such as text clustering, automatic summarization, and question-answering systems.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] A Way to Improve Graph-Based Keyword Extraction
    Cao, Jian
    Jiang, Zhiheng
    Huang, May
    Wang, Karl
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2015, : 166 - 170
  • [2] An Overview of Graph-Based Keyword Extraction Methods and Approaches
    Beliga, Slobodan
    Mestrovic, Ana
    Martincic-Ipsic, Sanda
    [J]. JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2015, 39 (01) : 1 - 20
  • [3] A multi-centrality index for graph-based keyword extraction
    Vega-Olivero, Didier A.
    Gomes, Pedro Spoljaric
    Milios, Evangelos E.
    Berton, Lilian
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)
  • [4] A Graph-based Recommendation Method for the Academic Community
    Ma, Yongzheng
    Yang, Qi
    Liu, Bing
    Chen, Wenyu
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 688 - 692
  • [5] A Novel Graph-Based Ensemble Token Classification Model for Keyword Extraction
    Hüma Kılıç
    Aydın Çetin
    [J]. Arabian Journal for Science and Engineering, 2023, 48 : 10673 - 10680
  • [6] A Novel Graph-Based Ensemble Token Classification Model for Keyword Extraction
    Kilic, Huma
    Cetin, Aydin
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10673 - 10680
  • [7] Biomedical Relation Extraction With Knowledge Graph-Based Recommendations
    Sousa, Diana
    Couto, Francisco M.
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (08) : 4207 - 4217
  • [8] A semantic graph-based keyword extraction model using ranking method on big social data
    Devika, R.
    Subramaniyaswamy, V
    [J]. WIRELESS NETWORKS, 2021, 27 (08) : 5447 - 5459
  • [9] A semantic graph-based keyword extraction model using ranking method on big social data
    R. Devika
    V. Subramaniyaswamy
    [J]. Wireless Networks, 2021, 27 : 5447 - 5459
  • [10] Knowledge graph-based metaphor representation for literature understanding
    Peng, Ciyuan
    Dang Thinh Vu
    Jung, Jason J.
    [J]. DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2021, 36 (03) : 698 - 711