Classification of Keyphrases using Random Forest

被引:0
|
作者
Tovar Vidal, Mireya [1 ]
Flores Petlacalco, Gerardo [1 ]
Montes Rendon, Azucena [2 ]
Contreras Gonzalez, Meliza [1 ]
Cervantes Marquez, Ana Patricia [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, Fac Comp Sci, Puebla, Mexico
[2] Inst Tecnol Tlalpan, TecNM, Mexico City, DF, Mexico
关键词
Keyphrases; Natural Language Processing; Machine Learning; Latent Semantic Analysis; LATENT SEMANTIC ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrases are words or phrases from a document that can describe its meaning. A keyphrase integrates the general idea of a document and implicitly contains the resources that the author used during the development of its research to achieve his goal. Therefore, there is a need to create classification models that allow the clustering of keyphrases according to their content for simplify reading. In this paper, keyphrases classification from scientific publications based on LSA and some classifying techniques is proposed and implemented. The aim is to create a classification model based on the extraction of features from the input corpus, without enriching it using external resources such as Wikipedia or online resources. Process, task, and material are the classes considered from Computer Science, Material Sciences, and Physics publications domains. Results show that Random Forest was found to be the best classification technique of keyphrases with 60% of measure-F-1.
引用
收藏
页码:506 / 511
页数:6
相关论文
共 50 条
  • [31] HABITAT CLASSIFICATION USING RANDOM FOREST BASED IMAGE ANNOTATION
    Torres, Mercedes
    Qiu, Guoping
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1491 - 1495
  • [32] Automatic structure classification of small proteins using random forest
    Pooja Jain
    Jonathan D Hirst
    BMC Bioinformatics, 11
  • [33] High- resolution landcover classification using Random Forest
    Hayes, Matthew M.
    Miller, Scott N.
    Murphy, Melanie A.
    REMOTE SENSING LETTERS, 2014, 5 (02) : 112 - 121
  • [34] Integrated Pedestrian and Direction Classification using a Random Decision Forest
    Tao, Junli
    Klette, Reinhard
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 230 - 237
  • [35] Enzyme Function Classification using Protein Sequence Features and Random Forest
    Kumar, Chetan
    Li, Gang
    Choudhary, Alok
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 764 - 767
  • [36] An Approach for Sentiment Analysis Using Gini Index with Random Forest Classification
    Kaur, Manpreet
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 541 - 554
  • [37] Photometric classification of quasars from ALHAMBRA survey using random forest
    Arroquia-Cuadros, Benjamin
    Sanchez, Nestor
    Gomez, Vicent
    Blay, Pere
    Martinez-Badenes, Vicent
    Nieves-Seoane, Lorena
    ASTRONOMY & ASTROPHYSICS, 2023, 673
  • [38] Classification of Phishing Email Using Random Forest Machine Learning Technique
    Akinyelu, Andronicus A.
    Adewumi, Aderemi O.
    JOURNAL OF APPLIED MATHEMATICS, 2014,
  • [39] CLASSIFICATION OF LARGE MICROARRAY DATASETS USING FAST RANDOM FOREST CONSTRUCTION
    Manilich, Elena A.
    Oezsoyoglu, Z. Meral
    Trubachev, Valeriy
    Radivoyevitch, Tomas
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2011, 9 (02) : 251 - 267
  • [40] Raman spectroscopy based analysis of milk using random forest classification
    Amjad, Arslan
    Ullah, Rahat
    Khan, Saranjam
    Bilal, Muhammad
    Khan, Asifullah
    VIBRATIONAL SPECTROSCOPY, 2018, 99 : 124 - 129