Word synonym relationships for text analysis: A graph-based approach

被引:4
|
作者
Alrasheed, Hend [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Technol, Riyadh, Saudi Arabia
来源
PLOS ONE | 2021年 / 16卷 / 07期
关键词
KEYWORD EXTRACTION; AUTHORSHIP ATTRIBUTION; EMBEDDINGS;
D O I
10.1371/journal.pone.0255127
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Keyword extraction refers to the process of detecting the most relevant terms and expressions in a given text in a timely manner. In the information explosion era, keyword extraction has attracted increasing attention. The importance of keyword extraction in text summarization, text comparisons, and document categorization has led to an emphasis on graph-based keyword extraction techniques because they can capture more structural information compared to other classic text analysis methods. In this paper, we propose a simple unsupervised text mining approach that aims to extract a set of keywords from a given text and analyze its topic diversity using graph analysis tools. Initially, the text is represented as a directed graph using synonym relationships. Then, community detection and other measures are used to identify keywords in the text. The set of extracted keywords is used to assess topic diversity within the text and analyze its sentiment. The proposed approach relies on grouping semantically similar candidate words. This approach ensures that the set of extracted keywords is comprehensive. Differing from other graph-based keyword extraction approaches, the proposed method does not require user parameters during graph construction and word scoring. The proposed approach achieved significant results compared to other keyword extraction techniques.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] PRIMROSe: A Graph-Based Approach for Enterprise Architecture Analysis
    Naranjo, David
    Sanchez, Mario
    Villalobos, Jorge
    [J]. ENTERPRISE INFORMATION SYSTEMS, ICEIS 2014, 2015, 227 : 434 - 452
  • [32] Graph-based Arabic text semantic representation
    Etaiwi, Wael
    Awajan, Arafat
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [33] Graph-Based Term Weighting for Text Categorization
    Malliaros, Fragkiskos D.
    Skianis, Konstantinos
    [J]. PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 1473 - 1479
  • [34] New Graph-Based Text Summarization Method
    alZahir, Saif
    Fatima, Qandeel
    Cenek, Martin
    [J]. 2015 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2015, : 396 - 401
  • [35] Graph-based Text Representation and Knowledge Discovery
    Jin, Wei
    Srihari, Rohini K.
    [J]. APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 807 - 811
  • [36] Graph-based abstractive biomedical text summarization
    Givchi, Azadeh
    Ramezani, Reza
    Baraani-Dastjerdi, Ahmad
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 132
  • [37] Graph-based biomedical text summarization: An itemset mining and sentence clustering approach
    Azadani, Mozhgan Nasr
    Ghadiri, Nasser
    Davoodijam, Ensieh
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 84 : 42 - 58
  • [38] Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation
    Li, Ximing
    Wang, Bing
    Wang, Yang
    Wang, Meng
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [39] A Graph-based Approach to Word Sense Disambiguation. An Unsupervised Method Based on Semantic Relatedness
    Arab, Meysam
    Jahromi, Mansoor Zolghadri
    Fakhrahmad, Seyed Mostafa
    [J]. 2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 250 - 255
  • [40] Graph-based word sense disambiguation in Telugu language
    Koppula, Neeraja
    Rani, B. Padmaja
    Rao, Koppula Srinivas
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2019, 23 (01) : 55 - 60