RANDOM WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION

被引:24
|
作者
Hassan, Samer [1 ]
Mihalcea, Rada [1 ]
Banea, Carmen [1 ]
机构
[1] Univ North Texas, Dept Comp Sci & Engn, POB 311366, Denton, TX 76203 USA
关键词
Random walk models; graph-based algorithms; TextRank; term weighting; text classification;
D O I
10.1142/S1793351X07000263
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a new approach for estimating term weights in a document, and shows how the new weighting scheme can be used to improve the accuracy of a text classifier. The method uses term co-occurrence as a measure of dependency between word features. A random walk model is applied on a graph encoding words and co-occurrence dependencies, resulting in scores that represent a quantification of how a particular word feature contributes to a given context. Experiments performed on three standard classification datasets show that the new random walk based approach outperforms the traditional term frequency approach of feature weighting.
引用
收藏
页码:421 / 439
页数:19
相关论文
共 50 条
  • [1] Random-walk term weighting for improved text classification
    Hassan, Samer
    Mihalcea, Rada
    Banea, Carmen
    [J]. ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 242 - +
  • [2] An Effective Term Weighting Method Using Random Walk Model for Text Classification
    Islam, Md. Rafiqul
    Islam, Md. Rakibul
    [J]. 2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, : 433 - 436
  • [3] An improved method of term weighting for text classification
    Jiang, Hua
    Li, Ping
    Hu, Xin
    Wang, Shuyan
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 294 - 298
  • [4] An improved term weighting scheme for text classification
    Tang, Zhong
    Li, Wenqiang
    Li, Yan
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (09):
  • [5] An improved supervised term weighting scheme for text representation and classification
    Tang, Zhong
    Li, Wenqiang
    Li, Yan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
  • [6] Improved inverse gravity moment term weighting for text classification
    Dogan, Turgut
    Uysal, Alper Kursat
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 130 : 45 - 59
  • [7] Supervised term-category feature weighting for improved text classification
    Attieh, Joseph
    Tekli, Joe
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 261
  • [8] An improved term weighting method based on relevance frequency for text classification
    Li, Chuanxiao
    Li, Wenqiang
    Tang, Zhong
    Li, Song
    Xiang, Hai
    [J]. SOFT COMPUTING, 2023, 27 (07) : 3563 - 3579
  • [9] An improved term weighting method based on relevance frequency for text classification
    Chuanxiao Li
    Wenqiang Li
    Zhong Tang
    Song Li
    Hai Xiang
    [J]. Soft Computing, 2023, 27 : 3563 - 3579
  • [10] Adaptable Term Weighting Framework for Text Classification
    Huynh, Dat
    Dat Tran
    Ma, Wanli
    Sharma, Dharmendra
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 254 - 265