A Keyword Extraction Method Based on Learning to Rank

被引:0
|
作者
Cai, Xianggao [1 ]
Cao, Shujin [1 ]
机构
[1] Sun Yat Sen Univ, Informat Management Sch, Guangzhou, Guangdong, Peoples R China
关键词
keyword extraction; learning to rank; SVMRank; TF-IDF; TextRank; LDA;
D O I
10.1109/SKG.2017.00040
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The extraction of keywords from document text is a hot research area. As machine learning techniques have been applied to many fields successfully, this study aims to explore how to optimize keyword extraction using Support Vector Machine for Ranking (SVMRank). Firstly, we constructed some features for each candidate word segmented from a document by employing the output rank of certain traditional extraction algorithms, such as TF-IDF, Text Rank, and LDA. Secondly, we labeled each candidate with an important rank through artificial auxiliary. Finally, we built up a SVMRank model to learn how to rank the candidates. The most important advantage of this approach is that it can integrate the advantages of other keyword extraction methods and overcome their shortcomings. The experiment results show that the SVMRank approach would improve the extraction precision and recall by 6% and 5%, respectively.
引用
收藏
页码:194 / 197
页数:4
相关论文
共 50 条
  • [1] ListBM: A Learning-to-Rank Method for XML Keyword Search
    Gao, Nmg
    Deng, Zhi-Hong
    Xiang, Yong-Qing
    Hang, Yu
    [J]. FOCUSED RETRIEVAL AND EVALUATION, 2010, 6203 : 81 - 87
  • [2] Implementing Graph Based Rank on Online News Media Keyword Extraction
    Syafiandini, Arida Ferti
    Mustika, Hani Febri
    Manik, Lindung Parningotan
    Rianto, Yan
    Akbar, Zaenal
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2019, : 108 - 113
  • [3] Keyword extraction method based on complex network
    Electric Power Research Institute of Yunnan Power Grid, Kunming
    650217, China
    不详
    710000, China
    不详
    650091, China
    [J]. Int. J. Inf. Commun. Technol, 2024, 4 (323-335):
  • [4] Selectivity-Based Keyword Extraction Method
    Beliga, Slobodan
    Mestrovic, Ana
    Martincic-Ipsic, Sanda
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2016, 12 (03) : 1 - 26
  • [5] An answer summarization method based on keyword extraction
    Fan, Qiaoqing
    Fang, Yu
    [J]. 2016 INTERNATIONAL CONFERENCE ON MEDICINE SCIENCES AND BIOENGINEERING (ICMSB2016), 2017, 8
  • [6] A Keyword Extraction Method Based on Lexical Chains
    Jiang, Xiao-yu
    [J]. ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2008, 5370 : 360 - 367
  • [7] A Patent Keyword Extraction Method Based on Corpus Classification
    Sun, Changjian
    Chen, Wentao
    Zhang, Zhen
    Zhang, Tian
    [J]. MATHEMATICS, 2024, 12 (07)
  • [8] Semantic-based keyword extraction method for document
    Jiang, Fang
    Li, Guohe
    Yun, Xue
    Yue, Xiang
    [J]. International Journal of Future Generation Communication and Networking, 2015, 8 (05): : 37 - 46
  • [9] Chinese Handwriting Identification Method Based on Keyword Extraction
    Chen, Rui
    Fang, Bin
    Wang, Patrick Shen-Pei
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (11)
  • [10] Learning to Rank based Gene Summary Extraction
    Shang, Yue
    Hao, Huihui
    Lin, Hongfei
    Wu, Jiajin
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,