Diverse feature set based Keyphrase extraction and indexing techniques

被引:0
|
作者
Saurabh Sharma
Vishal Gupta
Mamta Juneja
机构
[1] University Institute of Engineering & Technology,
[2] Panjab University,undefined
来源
关键词
Keyphrase extraction; Word embedding; Keyphrase indexing; External knowledge; Free indexing; Natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
The internet changed the way that people communicate, and this has led to a vast amount of Text that is available in electronic format. It includes things like e-mail, technical and scientific reports, tweets, physician notes and military field reports. Providing key-phrases for these extensive text collections thus allows users to grab the essence of the lengthy contents quickly and helps to locate information with high efficiency. While designing a Keyword Extraction and Indexing system, it is essential to pick unique properties, called features. In this article, we proposed different unsupervised keyword extraction approaches, which is independent of the structure, size and domain of the documents. The proposed method relies on the novel and cognitive inspired set of standard, phrase, word embedding and external knowledge source features. The individual and selected feature results are reported through experimentation on four different datasets viz. SemEval, KDD, Inspec, and DUC. The selected (feature selection) and word embedding based features are the best features set to be used for keywords extraction and indexing among all mentioned datasets. That is the proposed distributed word vector with additional knowledge improves the results significantly over the use of individual features, combined features after feature selection and state-of-the-art. After successfully achieving the objective of developing various keyphrase extraction methods we also experimented it for document classification task.
引用
收藏
页码:4111 / 4142
页数:31
相关论文
共 50 条
  • [1] Diverse feature set based Keyphrase extraction and indexing techniques
    Sharma, Saurabh
    Gupta, Vishal
    Juneja, Mamta
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 4111 - 4142
  • [2] Learning Feature Representations for Keyphrase Extraction
    Florescu, Corina
    Jin, Wei
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8077 - 8078
  • [3] Thesaurus based automatic keyphrase indexing
    Medelyan, Olena
    Witten, Ian H.
    [J]. OPENING INFORMATION HORIZONS, 2006, : 296 - +
  • [4] Automatic Keyphrase Extraction Techniques: A Review
    Lim, Vicky Min-How
    Wong, Siew Fan
    Lim, Tong Ming
    [J]. 2013 IEEE SYMPOSIUM ON COMPUTERS AND INFORMATICS (ISCI 2013), 2013,
  • [5] Automatic Keyphrase Extraction with a Refined Candidate Set
    You, Wei
    Fontaine, Dominique
    Barthes, Jean-Paul
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 576 - 579
  • [6] A Keyphrase Extraction Method Based on Multi-feature Evaluation and Mask Mechanism
    Ma, Liwen
    Liu, Weifeng
    [J]. 2022 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2022, : 164 - 170
  • [7] Unsupervised KeyPhrase Extraction Based on Multi-granular Semantics Feature Fusion
    Chen, Jie
    Hu, Hainan
    Zhao, Shu
    Zhang, Yanping
    [J]. ROUGH SETS, IJCRS 2023, 2023, 14481 : 299 - 310
  • [8] Keyphrase Distance Analysis Technique from News Articles as a Feature for Keyphrase Extraction: An Unsupervised Approach
    Miah, Mohammad Badrul Alam
    Awang, Suryanti
    Rahman, Md Mustafizur
    Hosen, A. S. M. Sanwar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 995 - 1002
  • [9] Text Feature Extraction Based on Rough Set
    Cheng, Yiyuan
    Zhang, Ruiling
    Wang, Xiufeng
    Chen, Qiushuang
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 310 - 314
  • [10] EEG Feature Extraction Based on Rough Set
    Mu, Zhendong
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION, INFORMATION AND CONTROL, 2015, 125 : 1246 - 1249