Automatic keyphrases extraction from document using backpropagation

被引:0
|
作者
Wang, JB [1 ]
Peng, H [1 ]
Hu, JS [1 ]
机构
[1] S China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Peoples R China
关键词
keyphrase extraction; backpropagation; information retrieval; natural language processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic keyphrase extraction from documents is a task with many applications in information retrieval and natural language processing. Previously, Several keyphrase extraction methods have been proposed based on different techniques. In this paper a keyphrase extraction approach based on backpropagation is proposed. In order to determine whether a phrase is a keyphrase or not, the following features of a phrase in a given document are adopted: its term frequency TF and inverted document frequency IDF, whether or not it appears in the title or headings (subheadings) of the given document, and its distribution in the paragraphs of the given document. The algorithm is evaluated by the standard information retrieval metrics of precision and recall and human assessment. Experiment results show that this approach is competitive with other known methods.
引用
收藏
页码:3770 / 3774
页数:5
相关论文
共 50 条
  • [31] Automatic Extraction of a Document-oriented NoSQL Schema
    Abdelhedi, Fatma
    Brahim, Amal Ait
    Rajhi, Hela
    Ferhat, Rabah Tighilt
    Zurfluh, Gilles
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1, 2021, : 192 - 199
  • [32] An automatic document processing system for medical data extraction
    Adamo, Francesco
    Attivissimo, Filippo
    Di Nisio, Attilio
    Spadavecchia, Maurizio
    MEASUREMENT, 2015, 61 : 88 - 99
  • [33] Towards a Linguistic Patterns for Arabic Keyphrases Extraction
    Sahmoudi, Issam
    Lachkar, Abdelmonaime
    2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,
  • [34] Extracting Keyphrases from Research Papers Using Word Embeddings
    Fan, Wei
    Liu, Huan
    Wang, Suge
    Zhang, Yuxiang
    Chang, Yaocheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT III, 2019, 11441 : 54 - 67
  • [35] Extracting Keyphrases from Research Papers Using Citation Networks
    Das Gollapalli, Sujatha
    Caragea, Cornelia
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1629 - 1635
  • [36] DKPro Keyphrases: Flexible and Reusable Keyphrase Extraction Experiments
    Erbs, Nicolai
    Santos, Pedro Bispo
    Gurevych, Iryna
    Zesch, Torsten
    PROCEEDINGS OF 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, 2014, : 31 - 36
  • [37] Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document
    Kim, Bong-Geun
    Park, Sang Il
    Kim, Hyo-Jin
    Lee, Sang-Ho
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2010, 24 (03) : 313 - 324
  • [38] Keyword extraction from a single document using centrality measures
    Palshikar, Girish Keshav
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2007, 4815 : 503 - 510
  • [39] Text Extraction from Document Images using Edge Information
    Grover, Sachin
    Arora, Kushal
    Mitra, Suman K.
    2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 582 - +
  • [40] Automatic Extraction of Axioms from Wikipedia Using SPARQL
    Haidar-Ahmad, Lara
    Zouaq, Amal
    Gagnon, Michel
    SEMANTIC WEB, ESWC 2016, 2016, 9989 : 60 - 64