An Unsupervised Approach for Keyphrase Extraction Using Within-Collection Resources

被引:0
|
作者
Li, Teng-Fei [1 ]
Hu, Liang [1 ]
Chu, Jian-Feng [1 ]
Li, Hong-Tu [1 ]
Chi, Ling [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130000, Jilin, Peoples R China
关键词
Phrase extraction; graph-based ranking; topic-based clustering; within-collection resource; NLP;
D O I
10.1109/ACCESS.2019.2938213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is hard to select and read suitable documents due to the rapidly growing number of scholarly documents. Keyphrases can be considered as the gist of a document so that a researcher can select the documents that they want using keyphrase queries. However, there are also many scholarly documents without any keyphrases tagged by the authors or other researchers. Automatic keyphrase extraction can help researchers to quickly extract keyphrases. This paper proposed an unsupervised approach for keyphrase extraction using graph-based ranking and topic-based clustering under the assumption that we only use the within-collection resources. We use graph-based ranking to describe the relevance between two words and topic-based clustering to embed semantical information into words. In this paper, we assume that each word has its own meaning, and each meaning can be considered as a topic, though we know nothing about these meanings. We use topic-based clustering to assign the "correct meaning" to the "correct word". In addition, by taking the relevance among phrases into consideration and only using within-collection resources, we can use the graph-based ranking in our approach. The edges in a graph that are built for phrases can describe the hidden relevance between two phrases, and the weights that are set for edges can measure the connection between two phrases. Then, after using the position feature, our approach consists of an enhanced graphbased ranking and a topic-based clustering. The experiments are run on four datasets: KDD, WWW, GSN and ACM. The results indicate that our approach has better performance than the state-of-the-art methods.
引用
收藏
页码:126088 / 126097
页数:10
相关论文
共 50 条
  • [41] Keyphrases Concentrated Area Identification from Academic Articles as Feature of Keyphrase Extraction: A New Unsupervised Approach
    Miah, Mohammad Badrul Alam
    Awang, Suryanti
    Azad, Md Saiful
    Rahman, Md Mustafizur
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 788 - 796
  • [42] Document-level Keyphrase Extraction Approach using Neighborhood Knowledge
    Li C.-L.
    Long J.-H.
    Tang Z.-L.
    Zhou T.
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2021, 50 (04): : 551 - 557
  • [43] Unsupervised Topic-Oriented Keyphrase Extraction and Its Application to Croatian
    Saratlija, Josip
    Snajder, Jan
    Basic, Bojana Dalbelo
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 340 - 347
  • [44] AdaptiveUKE: Towards adaptive unsupervised keyphrase extraction with gated topic modeling
    Liu, Qi
    Ke, Wenjun
    Yuan, Xiaoguang
    Yang, Yuting
    Zhao, Hua
    Wang, Peng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [45] Keyphrase Extraction Using Knowledge Graphs
    Shi W.
    Zheng W.
    Yu J.X.
    Cheng H.
    Zou L.
    [J]. Data Science and Engineering, 2017, 2 (4) : 275 - 288
  • [46] A Two-Level Keyphrase Extraction Approach
    Ali, Chedi Bechikh
    Wang, Rui
    Haddad, Hatem
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 390 - 401
  • [47] A LDA-based approach to keyphrase extraction
    Department of Automation, University of Science and Technology of China, Hefei
    230026, China
    不详
    230031, China
    [J]. Zhongnan Daxue Xuebao (Ziran Kexue Ban), 6 (2142-2148):
  • [48] A SUPERVISED LEARNING APPROACH FOR AUTOMATIC KEYPHRASE EXTRACTION
    Abulaish, Muhammad
    Anwar, Tarique
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (11): : 7579 - 7601
  • [49] Turkish keyphrase extraction using KEA
    Pala, Nagehan
    Cicekli, Ilyas
    [J]. 2007 22ND INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2007, : 192 - 196
  • [50] Keyphrase Extraction Using Knowledge Graphs
    Shi, Wei
    Zheng, Weiguo
    Yu, Jeffrey Xu
    Cheng, Hong
    Zou, Lei
    [J]. WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 132 - 148