Smoothing Temporal Difference for Text Categorization

被引:0
|
作者
Fukumoto, Fumiyo [1 ]
Suzuki, Yoshimi [1 ]
机构
[1] Univ Yamanashi, Grad Fac Interdisciplinary Res, Kofu, Yamanashi, Japan
关键词
Temporal adaptation; Term smoothing; Text categorization; Transfer learning;
D O I
10.1007/978-3-319-28940-3_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses text categorization problem that training data may be derived from a different time period than test data. We present a method for text categorization that minimizes the impact of temporal effects by using term smoothing and transfer learning techniques. We first used a technique called Temporal-based Term Smoothing (TTS) to replace those time sensitive features with representative terms, then applied boosting based transfer learning algorithm called TrAda-Boost for categorization. The results using a 21-year Japanese Mainichi Newspaper corpus showed that integrating term smoothing and transfer learning improves overall performance, especially it is effective when the creation time period of the test data differs greatly from the training data.
引用
收藏
页码:203 / 214
页数:12
相关论文
共 50 条
  • [1] Smoothing LDA model for text categorization
    Li, Wenbo
    Sun, Le
    Feng, Yuanyong
    Zhang, Dakun
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 83 - +
  • [2] Analyzing the temporal sequences for text categorization
    Luo, X
    Zincir-Heywood, AN
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2004, 3215 : 498 - 505
  • [3] A logistic regression-based smoothing method for Chinese text categorization
    Yen, Show-Jane
    Lee, Yue-Shi
    Ying, Jia-Ching
    Wu, Yu-Chieh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11581 - 11590
  • [4] Maximum scatter difference classifier and its application of text categorization
    Song, Fengxi
    Liu, Shuhai
    Yang, Jingyu
    Xia, Saifei
    [J]. Jisuanji Gongcheng/Computer Engineering, 2005, 31 (05): : 8 - 10
  • [5] Chinese text categorization based on the binary weighting model with non-binary smoothing
    Xue, D
    Sun, MS
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 408 - 419
  • [6] Using the absolute difference of term occurrence probabilities in binary text categorization
    Hakan Altınçay
    Zafer Erenel
    [J]. Applied Intelligence, 2012, 36 : 148 - 160
  • [7] Using the absolute difference of term occurrence probabilities in binary text categorization
    Altincay, Hakan
    Erenel, Zafer
    [J]. APPLIED INTELLIGENCE, 2012, 36 (01) : 148 - 160
  • [8] Temporal-based Feature Selection and Transfer Learning for Text Categorization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    [J]. 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 17 - 26
  • [9] Max-difference maximization criterion: a feature selection method for text categorization
    Lingbin Jin
    Li Zhang
    Lei Zhao
    [J]. Frontiers of Computer Science, 2023, 17
  • [10] Max-difference maximization criterion:a feature selection method for text categorization
    Lingbin JIN
    Li ZHANG
    Lei ZHAO
    [J]. Frontiers of Computer Science, 2023, 17 (01) : 231 - 233