Keyword Extraction Performance Analysis

被引:4
|
作者
Kumbhar, Abhishek [1 ]
Savargaonkar, Mayuresh [1 ]
Nalwaya, Aayush [1 ]
Bian, Chengqi [1 ]
Abouelenien, Mohamed [1 ]
机构
[1] Univ Michigan, Dearborn, MI 48128 USA
关键词
NLP; Keyword Extraction; Text Mining;
D O I
10.1109/MIPR.2019.00111
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a survey-cum-evaluation of methods for the comprehensive comparison of the task of keyword extraction using datasets of various sizes, forms, and genre. We use four different datasets which includes Amazon product data - Automotive, SemEval 2010, TMDB and Stack Exchange. Moreover, a subset of 100 Amazon product reviews is annotated and utilized for evaluation in this paper, to our knowledge, for the first time. Datasets are evaluated by five Natural Language Processing approaches (3 unsupervised and 2 supervised), which include TF-IDF, RAKE, TextRank, LDA and Shallow Neural Network. We use a ten-fold cross-validation scheme and evaluate the performance of the aforementioned approaches using recall, precision and F-score. Our analysis and results provide guidelines on the proper approaches to use for different types of datasets. Furthermore, our results indicate that certain approaches achieve improved performance with certain datasets due to inherent characteristics of the data.
引用
收藏
页码:550 / 553
页数:4
相关论文
共 50 条
  • [1] Performance Analysis of Keyword Extraction Algorithms Assessing Extractive Text Summarization
    Kumar, Akshi
    Sharma, Aditi
    Sharma, Sidhant
    Kashyap, Shashwat
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND ELECTRONICS (COMPTELIX), 2017, : 408 - 414
  • [2] Data Fusion : Boosting Performance in Keyword Extraction
    Bohne, Thomas
    Borghoff, Uwe M.
    [J]. 2013 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON THE ENGINEERING OF COMPUTER BASED SYSTEMS (ECBS 2013), 2013, : 166 - 173
  • [3] General-use unsupervised keyword extraction model for keyword analysis
    Shin, Hunsik
    Lee, Hye Jin
    Cho, Sungzoon
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [4] Performance Analysis of Different Keyword Extraction Algorithms for Emotion Recognition from Uyghur Text
    Imam, Seyyare
    Parhat, Rayilam
    Hamdulla, Askar
    Li, Zhijun
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 351 - 351
  • [5] Automatic keyword extraction by server log analysis
    Ding, C
    Zhou, J
    Chi, CH
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 605 - 606
  • [6] An Analysis on Different Document Keyword Extraction Methods
    Thushara, M. G.
    Anjali, S.
    Nair, Meera M.
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 933 - 937
  • [7] Evaluating the Performance of SOBEK Text Mining Keyword Extraction Algorithm
    Reategui, Eliseo
    Bigolin, Marcio
    Carniato, Michel
    dos Santos, Rafael Antunes
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2022, 2022, 13480 : 233 - 243
  • [8] Keyword Extraction Algorithm Based on Principal Component Analysis
    Li, Chang-Jin
    Han, Hui-Jian
    [J]. INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 503 - 508
  • [9] System Level Knowledge Analysis and Keyword Extraction in Neuroscience
    Di Maio, Paola
    [J]. BRAIN INFORMATICS, BI 2021, 2021, 12960 : 225 - 234
  • [10] Analysis of Text Collections for the Purposes of Keyword Extraction Task
    Vanyushkin, Alexander
    Graschenko, Leonid
    [J]. JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2020, 44 (01) : 171 - 184