Centrality-based Approach for Supervised Term Weighting

被引:0
|
作者
Shanavas, Niloofer [1 ]
Wang, Hui [1 ]
Lin, Zhiwei [1 ]
Hawe, Glenn [1 ]
机构
[1] Ulster Univ, Sch Comp & Math, Jordanstown, North Ireland
关键词
automatic text classification; graph-based text representation; supervised term weighting; node centrality; TEXT; SCHEME;
D O I
10.1109/ICDMW.2016.189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The huge amount of text documents has made the manual organization of text data a tedious task. Automatic text classification helps to easily handle the large number of documents by organising them automatically into predefined classes. The effectiveness and efficiency of automatic text classification largely depends on the way text documents are represented. A text document is usually viewed as a bag of terms (or words) and represented as a vector using the vector space model where terms are assumed unordered and independent and term frequencies (or weights) are used in the representation. Graphs are another text representation scheme that considers the structure of terms in the text document which is important for natural language. Terms weighted on the basis of graph representation increase the performance of text classification. In this paper, we present a novel approach for graph-based supervised term weighting which considers information relevant for the classification task using node centrality in the co-occurrence graphs built from the labelled training documents. Our experimental evaluation of the proposed term weighting scheme on four benchmark datasets shows the scheme has consistently superior performance over the state-of-the-art term weighting methods for text classification.
引用
收藏
页码:1261 / 1268
页数:8
相关论文
共 50 条
  • [41] A Centrality-Based History Prediction Routing Protocol for Opportunistic Networks
    Bamrah, Amarpreet
    Woungang, Isaac
    Barolli, Leonard
    Dhurandher, Sanjay Kumar
    Carvalho, Glaucio H. S.
    Takizawa, Makoto
    PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS), 2016, : 130 - 136
  • [43] Toward More Effective Centrality-Based Attacks on Network Topologies
    Zhang, Songwei
    Si, Weisheng
    Qiu, Tie
    Cao, Qing
    IEEE International Conference on Communications, 2020, 2020-June
  • [44] A Centrality-based Measure of User Privacy in Online Social Networks
    Pensa, Rugger G.
    Di Blasi, Gianpiero
    PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 1438 - 1439
  • [45] Evaluation of Congestion Aware Social Metrics for Centrality-Based Routing
    Islam, Muhammad Arshad
    Iqbal, Muhammad Azhar
    Aleem, Muhammad
    Halim, Zahid
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [46] FLAKE: Fuzzy Graph Centrality-based Automatic Keyword Extraction
    Jain, Amita
    Mittal, Kanika
    Vaisla, Kunwar Singh
    COMPUTER JOURNAL, 2022, 65 (04): : 926 - 939
  • [47] Weighting Based Approach for Semi-supervised Feature Selection
    Benabdeslem, Khalid
    Hindawi, Mohammed
    Makkhongkaew, Raywat
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 300 - 307
  • [48] Centrality-based and similarity-based neighborhood extension in graph neural networks
    Zohrabi, Mohammadjavad
    Saravani, Saeed
    Chehreghani, Mostafa Haghir
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (16): : 24638 - 24663
  • [49] Centrality-based Relation aware Heterogeneous Graph Neural Network
    Li, Yangding
    Fu, Shaobin
    Zeng, Yangyang
    Feng, Hao
    Peng, Ruoyao
    Wang, Jinghao
    Zhang, Shichao
    KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [50] Centrality-based Caching for Privacy in Information-Centric Networks
    Abani, Noor
    Gerla, Mario
    MILCOM 2016 - 2016 IEEE MILITARY COMMUNICATIONS CONFERENCE, 2016, : 1249 - 1254